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AGCGACGACG GTCATTAAAA GTGTCATACC CCTCGTGGCT TCCCACTTTA TCGACCAGTA 8280 

TCTCAGCAAT CTTAACCAAC TAGCCGTTAC CGTTTTGCTG GTCTACTATG GTCTCTACAT 8340 

CCTACAAACT GTAGTTCAGT ATGTCGGCAA TCTTCTCTTT GCGCGCGTGT CTTACAGTAT 8400 

TGTTAGGGAT ATTCGTCGGG ATGCCTTTGC CAATATGGAG AAACTGGGCA TGTCTTACTT 8460 

TGACAAGACG CCAGCAGGTT CTATCGTTTC TCGTTTGACC AACGATACCG AGACGATTAG 8520 

TGATATGTTT TCTGGGATTT TATCCAGCTT TATCTCAGCA GTTTTTATCT TTCTGACAAC 8580 

CCTTTATACC ATGTTGGTGC TGGATTTTCG TTTGACGGCT TTAGTCTTGC TCTTTCTTCC 8640 

TTTGATTTTC CTTTTGGTCA ATCTCTATCG AAAAAAGTCA GTGAAAATCA TCGAGAAAAC 8700 

CAGAAGTCTC TTGTCAGATA TCAATAGTAA GCTGGCAGAG AATATCGAGG GAATCAGGAT 8760 

TATTCAGGCC TTTAATCAAG AGAAGCGCCT GCAGGCAGAA TTTGATGAAA TCAACCAAGA 8820 

ACACTTGGTC TACGCCAACC GTTCTGTAGC CTTGGATGCC CTCTTTTTGA GACCTGCCAT 8880 

GAGTTTGCTG AAACTTCTAG GCTATGCAGT CTTGATGGCC TACTTTGGCT ACCGTGGTTT 8940 

TTCTATCGGG ATAACGGTCG GGACCATGTA TGCCTTTATC CAGTACATCA ACCGCCTTTT 9000 

TGACCCCTTG ATTGAGGTGA CGCAAAACTT TTCAACTCTG CAAACGGCTA TGGTTTCTGC 9060 

AGGTCGTGTC TTTGCCCTGA TAGACGAGAG GACCTATGAA CCTCTTCAAG AAAATGGGCA 9120 

AGCCAAAGTC CAAGAAGGCA ATATCCGTTT TGAACATGTG TGTTTCTCAT ATGACGGTAA 9180 

ACATCCGATT CTGGATGACA TTTCTTTCTC TGTTAATAAG GGTGAAACCA TTGCCTTTGT 9240 

AGGTCATACA GGTTCAGGGA AATCGTCTAT TATCAATGTC CTCATGCGCT TTTATGAATT 9300 

CCAGTCAGGG AGAGTTCTCT TGGATGATGT GGATATCAGG GATTTCAGTC AAGAAGAGCT 9360 

GAGAAAAAAC ATCGGTTTGG TCTTGCAGGA ACCCTTCCTC TATCATGGAA CTATTAAGTC 9420 

CAATATCGCC ATGTACCAAG AAACCAGTGA TGAGCAGGTT CAGGCTGCGG CAGCCTTTGT 9480 

GGATGCAGAT TCCTTTATTC AAGAACTTCC TCAGGGGTAC GACTCCCCTG TTTCCGAGCG 9540 

TGGTTCGAGC TTCTCTACTG GGCAACGCCA GCTTCTTGCC TTTGCTAGAA CAGTCGCCAG 9600 

CCAGCCTAAA ATCCTGATTT TGGATGAAGC GACAGCCAAT ATTGACTCTG AAACAGAAAG 9660 

CTTGGTTCAA GCTTCTCTGG CGAAGATGAG ACAGGGCCGA ACAACTATTG CTATCGCTCA 9720 

CCGCCTTTCT ACTATTCAAG ATGCCAACTG CATCTATGTC TTGGATAAGG GACGCATTAT 9780 

CGAGAGTGGA ACCCATGAGG AACTCTTGGC TCTGGGAGGA ACCTATCACA AGATGTATAG 9840 

TTTGCAGGCA GGGGCCATGG CCGATACTCT TTGAAAATCT CTTTAAACCA TGTCAGCTTT 9900 

ATCTGCAATC TCAAAGCTGT ACTTTGATTT TCATTGAGTA CTAGAAGGAA ATCCTTCAAA 9960 

TTACAGATTT CTTTCACCGC CTTTTCCATT TTGTGGTATA ATGAAAAATG , TTGAC AAATA 10020 
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GTATAATAAA AACAAAGGAG AACAGCATGC TGAAATGGGA AGACTTG CCT GTGGAAATGA 10080 

AATCAAGCGA GGTTGAGTCT TACTACCAGC TTGTCTCTAA AAGGAAGGGT TCGCTGATTT 1014 0 

TCAAGCGTTG CTTGGACTGG GTTTTGGCCT TGGTCTTACT GGTTCTGACC TCTCCCATCT 10200 

TTCTCATCTT GAGCATTTGG ATCAAGTTGG ATAGCAAAGG GCCAGTGATT TACAAGCAAG 10260 

AGCGTGTGAC CCAGTACAAC CGTCGGTTCA AGATTTGGAA GTTTCGTACC ATGGTGACGG 10320 

ATGCGGATAA AAAAGGAAGT CTGGTGACTT CTGCTAACGA TAGCCGCATT ACCAAGGTTG 10380 

GAAATTTCAT CCGACGTGTC CGTTTGGACG AACTGCCTCA GTTGGTCAAT GTCCTTAAAG 10440 

GTGAGATGTC CTTTGTCGGT ACACGACCTG AAGTGCCACG TTATACAGAG CAGTATAGCC 10500 

CTGAAATGAT GGCAACCTTG CTCTTGCAAG CAGGGATTAC CTCTCCAGCC AGCATCAACT 10560 

ACAAGGATGA GGACACAATT ATCAGTCAAA TGACGGAGAA AGGTCTGTCA GTTGATCAGG 10620 

CCTATGTGGA GCATGTTCTT CCTGAAAAGA TGCGCTATAA CCTCGCCTAT CTCCGAGAGT 10680 

TTAGTTTCTT TGGGGACATC AAAATCATGT TTCAAACCGT GTTTGAGGTA CTAAAATAAA 10740 

GTAGTCATAA GAAAATGAGT ACAGATAAAA GGAGCAAATC AATGCCAAAT TACAATATTC 10800 

CATTTTCACC GCCTGATATC ACAGAAGCAG AAATTACTGA AGTAGTGGAT ACCCTGCGTT 10860 

CTGGTTGGAT CACAACAGGT CCTAAAACAA AAGAACTGGA GCGCCGCTTG TCTCTTTACA 10920 

CACAGACACC TAAGACTGTT TGTCTCAACT CTGCGACAGC CGCTCTGGAG TTGATTTTAC 10980 

GCGTTTTGGA AGTGGGACCT GGTGATGAAG TCATCGTTCC AGCCATGACC TATACGGCTT 11040 

CATGTAGTGT CATTACGCAC GTGGGAGCAA CCCCTGTCAT GGTGGATATC CAAGCAGATA 11100 

CGTTTGAGAT GGACTATGAC CTGCTTGAGC AAGCTATCAC TGAGAAAACT AAGGTGATTA 11160 

TTCCAGTAGA GCTCGCAGGG ATTGTTTGCG ATTATGACCG TTTGTTCCAA GTCGTGGAGA 11220 

AAAAACGTGA CTTCTTTACC GCTTCAAGCA AGTGGCAAAA GGCCTTTAAC CGTATTGTCA 11280 

TTGTCTCTGA TAGTGCCCAC GCTTTGGGAT CTATTTATAA AGGACAACCT TCTGGTTCTA 11340 

TCGCTGACTT TACTTCCTTC TCATTCCATG CAGTTAAGAA CTTTACAACG GCAGAAGGTG 11400 

GAAGTGCGAC TTGGAAAGCC AATCCAGTGA TTGATGACGA AGAGATGTAC AAGGAATTCC 11460 

AAATCCTTTC CCTTCACGGG CAAACTAAGG ATGCTCTTGC CAAGATGCAA CTGGGGTCAT 11520 

GGGAATACGA TATCGTTACA CCAGCCTATA AGTGCAACAT GACCGATATC ATGGCTTCAC 11580 

TTGGTTTGGT ACAATTGGAC CGCTATCCAA GTTTGTTGCA ACGCCGTAAG GACATTGTGG 11640 

ACCGCTATGA TAGTGGTTTT GCAGGTTCTC GCATCCATCC TTTGGCACAC AAGACTGAAA 11700 

CTGTCGAATC TTCACGCCAC CTCTACATCA CCCGTGTAGA AGGAGCAAGC CTAGAAGAAC 11760 
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GCAACCTCAT CATCCAAGAA TTGGCTAAAG CAGGAATTGC AAGTAATGTT CACTACAAAC 11820 

CGCTTCCTCT CTTGACAGCC TATAAGAATC TTGGATTTGA TATGACGAAC TATCCTAAGG 11880 

CCTATGCCTT CTTTGAGAAT GAAATTACCC TCCCTCTTCA TACTAAATTA AGCGATGAAG 11940 

AAGTAGACTA TATCATTGAG ACTTTCAAAA CAGTTTCTGA AAAAGTGCTA ACTTTATCAA 12000 

AAAAATGACA AACTACAGTC AAGCGAAAGT GATCCTGCCC CTAAAAAGTC TAATTGAGTG 12060 

TAAAAACTGT TGTTTTCAAT TGATAATAGT TTACACCTGT AGTTGAGGCC CCTTTCTCCT 12120 

CAGAGAGAGA ATTTTTATAG GATTTTCCTT TCTTGTGGGA GTCCCGTGGT TTGAAATAAG 12180 

ATGTGAGCAA TTTAGTGTAG CATTTAGAAT CCTTACTAGA CATCATTTAG AAAATCTAGT 12240 

GTCTTGTTCT AGTTTTCAAT TCACCCTATT TTTTGAAAGA CGTGAGTTTC CATGAGTGAG 12300 

ATTGTGGAAA CTCGCGTCTT TTTTTGTTTT CAGAATATTG TTCAAAATTT TGTGCCTGTC 12360 

TTTCATGTTC TAGTCATTCT TTTGCATGAT AGAATTTATA GCATGTTGAT ATTATAATAA 12420 

TACAAATATT CTATATGTTT AGTGATGCTT GCTATACATT ATTAGATCTC CTGCGAGACA 12480 

ATCTATAAAA CACTTGTCTA CGATTACCTA TATGCCCTAT TCCAGTATTT TAGAAGCACT 12540 

GCATCTATTT TTATCGAGGT TAAATCTAGC TTTTATAGAA GGTCTATTTA AGAAATATAT 12600 

TGTAGTGTTT TAGTTTCAAT CCGCCATATG AGCGATATTC AGGTAAATAT CCCTGGCGAA 12660 

TGCTTGTATG ACAAGGTATT TGTTCTTTCA TTTATAATTT ACAACATATC AACAAATTTA 12720 

AATATAGTAA ATGGGATATT TTATATTCAA GCTAAGAAAG ATAGCATCAC TTTTGAATGG 12780 

AAGGCTAAAG AGCAAACTAG GAAGTTGGCC ATAGATAGCT CAAAACCCTG CTTTGAGGTT 12840 

GTAGATATAG TAAAATGAAA TGAGAATAGG ACAAATTGAT CGGGACAGTC AAATCGATTT 12900 

CTAACAATGT TTTAGAAGTA GAGGTGTACT ATTTTAGTTT CAGTCTACTA TAGAACTGAC 12960 

CAAGTCAGTA ACCTAGACTT AGGGCAAGGC GGCACTGACC TAGTTTGAAG AGATTTCCGA 13020 

AGAGTATAAA TTTTAATATT TTCTTGTGTT ATTCCTTGAC AATTCAATTT GGAAAATATA 13080 

TGATAAAGAT AATGACAGCG GTGTCATTCT ATCTATTTTA AGAAAAGTAA TAATCAATTG 13140 

TTAAAAATAG TAAAAAAATT GGAGGTTCTG ATGAAATATT TTGTTCCG 13188 
(2) INFORMATION FOR SEQ ID NO: 71: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32768 base pairs 

(B) TYPE: nucleic acid / 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 71: 
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AACGAGTGCA TCAGTCTCAG CAAGCACCAG TGCGTCGGCC TCAGCAAGCA CCAGCGCGTC 60 

TGAATCCGCA TCAACCAGTG CCTCAGCTTC AGCAAGTACC TCAGCATCTG AATCAGCATC 120 

AACAAGTGCA TCGGCTTCAG CAAGCACAAG TGCTTCAGCC TCAGCAAGTA TCTCAGCGTC 180 

TGAATCGGCA TCAACGAGTG CGTCCGCTTC AGCAAGTACT AGCGCCTCAG CATCAGCGTC 240 

AACAAGTGCT TCGGCTTCAG CGTCAACGAG TGCGTCTGAG TCAGCATCAA CGAGTACGTC 300 

AGCCTCAGCA AGCACATCAG CTTCTGAATC TGCATCAACC AGTGCGTCAG CCTCAGCATC 360 

GACAAGCGCC TCAGCTTCAG CAAGTACCAG TGCGTCAGCC TCAGCAAGTA CCAGTGCTTC 420 

AGCCTCAGCG TCGACAAGTG CGTCGGCCTC AACCAGTGCA TCTGAATCGG CATCAACCAG 480 

TGCGTCAGCC TCAGCAAGTA CTAGCGCCTC AGCCTCAGCA TCAACGAGTG CGTCCGCTTC 540 

AGCAAGTACT AGTGCATCAG CATCAGCATC AACGAGTGCA TCGGCTTCAG CAAGTACCAG 600 

CGCCTCAGCT TCAGCAAGCA CCAGTGCGTC AGCCTCAGCA AGTACCAGCG CCTCAGCCTC 660 

AGCAAGCACC AGTGCCTCAG CTTCAGCAAG TACCAGTGCG TCAGCCTCAG CGTCGACAAG 720 

TGCGTCGGCT TCAGCAAGTA CCTCAGCGTC TGAATCAGCA TCAACGAGTG CATCAGCTTC 780 

AGCATCAACA AGTGCTTCAG CTTCAGCAAG TATCTCAGCG TCTGAATCGG CATCAACGAG 840 

TGCGTCCGCT TCAGCAAGTA CTAGCGCCTC AGCATCAGCG TCAACAAGTG CTTCGGCTTC 900 

AGCGTCAACG AGTGCGTCTG AGTCAGCATC AACGAGTACG TCAGCCTCAG CAAGCACATC 960 

AGCTTCTGAA TCTGCATCAA CCAGTGCGTC AGCCTCAGCA TCGACAAGCG CCTCAGCTTC 1020 

AGCAAGTACC AGTGCGTCAG CCTCAGCAAG TACCAGTGCT TCAGCCTCAG CGTCGACAAG 1080 

TGCGTCGGCC TCAACCAGTG CATCTGAATC GGCATCAACC AGTGCGTCAg CCTCAGCAAG 1140 

TACTAGCGCC TCAGCCTCAG CATCAACGAG TGCGTCCGCT TCAGCAAGTA CTAGTGCATC 1200 

AGCATCAGCA TCAACGAGTG CATCGGCTTC AGCAAGTACC AGCGCCTCAG CTTCAGCAAG 1260 

CACCAGTGCG TCAGnCTCAG CAAGTACCAG CGCCTCAGCC TCAGCAAGCA CCAGTGCCTC 1320 

AGCTTCAGCA AGTACCAGTG CGTCAgCCTC AGCGTCGACA AGTGCGTCGG CTTCAGCAAG 1380 

TACCTCAGCG TCTGAATCAG CATCAACGAG TGCATCAGCT TCAGCATCAA CAAGTGCTTC 1440 

AGCTTCAGCA AGTACCAGTG CGTCGGCTTC AGCATCAACG AGTGCTTCAG TCTCAGCGTC 1500 
AACCAGTGCC TCTGAATCAG CATCAACAAG TGCCTCGGCT TCAGCAAGCA CCAGTGCGTC ' 1560 

GGCTTCAGCA AGTACTAGTG CATCGGCTTC AGCATCGACA AGTGCGTCTG AATCGGCATC 1620 

AACGAGTGCT TCGGCTTCAG CATCAACGAG TGCGTCAGCC TCAGCAAGCA CATCAGCTTC 1680 

TGAATCTGCA TCAACCAGTG CGTCCGCTTC AGCGTCAACC AGTGCGTCGG CTTCAGCGTC 1740 
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GACAAGTGCT 
AGCGTCAGct 
AGCAAGTATC 
TACGTCAGCC 
AGCATCGACA 
TGCGTCGGCC 
TACTAGTGCA 
GGCTTCAGCG 
AACAAGTGCT 
TGAATCAGCG 
AACCAGCGCC 
GGCCTCAGCA 
AACGAGTGCT 
AGCTTCAGCG 
TATCTCAGCG 
AGCCTCAGCA 
GACAAGCGCC 
GGCCTCAACC 
TGCATCGGCT 
AGCAAGTACC 
TGCCTCGGCT 
AGCAAGTACC 
TGCATCAGCT 
AGCAAGTACC 
TGCGTCGGCT 
AGCAAGTATC 
CGCCTCAGCC 
GGCATCAACG 
TGCATCGGCT 
AGCGTCAACA 



TCGGCTTCAG 
TCCGCCTCAA 
TCAGCGTCTG 
TCAGCAAGCA 
AGCGCCTCAG 
TCAACCAGTG 
TCAGCTTCAG 
TCAACCAGTG 
TCAGCCTCAG 
TCAACCAGTG 
TCGGCCTCAG 
AGCACCTCAG 
TCGGCTTCAG 
TCAACCAGTG 
TCTGAATCGG 
AGCACCTCAG 
TCAGCTTCAG 
AGTGCATCTG 
TCAGCATCAA 
AGTGCTTCAG 
TCAGCAAGCA 
AGTGCGTCAG 
TCAGCATCAA 
AGTGCGTCAg 
TCAGCAAGTA 
TCAGCGTCTG 
TCAGCGTCAA 
AGTGCGTCCG 
TCAGCATCAA 
AGTGCATCGG 



CATCAACGAG 
CCAGTGCGTC 
AATCGGCATC 
CATCAGCTTC 
CTTCAGCAAG 
CATCTGAATC 
CATCAACGAG 
CGTCAGCTTC 
CATCGACAAG 
CTTCGGCTTC 
CAAGCACCTC 
CTTCTGAATC 
CAAGCACAAG 
CTTCAGCCTC 
CATCAACGAG 
CTTCTGAATC 
CAAGTACCAG 
AATCGGCATC 
CCAGTGCCTC 
TCTCAGCATC 
CATCAGCATC 
CCTCAGCGTC 
CGAGTGCATC 
CTTCCGCATC 
CTAGCGCCTC 
AATCGGCATC 
CAAGTGCATC 
CTTCAGCAAG 
CGAGTGCGTC 
CTTCAGCGTC 



576 
TGCGTCGGCC 



GGCTTCAGCA 
AACGAGTGCG 
TGAATCTGCA 
TACCAGTGCT 
GGCATCAACC 
TGCATCGGCT 
AGCAAGTACC 
TGCCTCGGCT 
AGCAAGTACC 
AGCTTCTGAA 
GGCCTCAACC 
CGCCTCGGGT 
AGCATCAACA 
TGCGTCTGAG 
GGCCTCAACC 
TGCTTCAGCC 
AACCAGTGCG 
GGCTTCAGCG 
AACAAGTGCT 
TGAATCAGCG 
GACAAGTGCG 
GGCTTCGGCG 
AACAAGTGCC 
AGCCTCAGCC 
AACGAGTGCG 
GGCTTCAGCG 
TACTAGCGCC 
CGCTTCAGCA 
AACGAGTGCG 



TCAGCAAGCG 
AGCACAAGTG 
TCTGAGTCAG 
TCAACCAGTG 
TCAGCCTCAG 
AGTGCGTCAG 
TCAGCATCAA 
AGTGCTTCAG 
TCAGCAAGCA 
AGTGCTTCAG 
TCGGCCTCAA 
AGCGCCTCAG 
TCAGCATCAA 
AGTGCGTCAG 
TCAGCATCAA 
AGTGCGTCAG 
TCAGCGTCGA 
TCAGCCTCAG 
TCAACCAGTG 
TCAGCCTCAG 
TCGACAAGCG 
TCAGCCTCAG 
TCAACCAGTG 
TCGGCTTCAG 
TCAACCAGTG 
TCCGCTTCAG 
TCAACGAGTG 
TCAGCCTCAG 
AGTACTAGCG 
TCTGAGTCAG 



CAAGTACCTC 
CGTCAGCCTC 
CATCAACGAG 
CGTCAGCCTC 
CGTCGACAAG 
CCTCAGCAAG 
CCAGTGCCTC 
TCTCAGCATC 
CATCAGCATC 
CTTCAGCATC 
CCAGCGCCTC 
CCTCAGCATC 
CGAGTACGTC 
CCTCAGCAAG 
CGAGTACGTC 
CCTCAGCATC 
CAAGTGCGTC 
CAAGTACTAG 
CGTCAGCTTC 
CATCGACAAG 
CCTCAGCTTC 
CAAGTACTAG 
CATCAGAGTC 
CAAGCACCAG 
CGTCAGCCTC 
CAAGTACTAG 
CGTCTGAATC 
CGTCAACAAG 
CCTCAGCCTC 
CATCAACGAG 
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TGCGTCAGCC TCAGCAAGCA CATCAGCTTC TGAATCTGCA TCAACCAGTG CGTCAGCCTC 
AGCATCGACA AGCGCCTCAG CTTCAGCAAG TACCAGTGCG TCAGCCTCAG CGTCGACAAG 
TGCGTCGGCT TCAGCAAGTA CCAGTGCGTC AGCCTCAGCA AGTACCAGTG CGTCAGCCTC 
AGCGTCGACA AGTGCGTCGG CCTCAACCAG TGCATCTGAA TCGGCATCAA CCAGTGCGTC 
AGCCTCAGCA AGTACTAGTG CATCAGCTTC AGCATCAACG AGTGCATCGG CTTCAGCATC 
- AACCAGTGCA TCAGAGTCAG CAAGTACCAG TGCGTCAGCT TCCGCATCAA CAAGTGCCTC 
GGCTTCAGCA AGTACTAGCG CCTCAGCCTC AGCGTCAACA AGTGCTTCAG CTTCCGCGTC 
AACCAGCGCC TCGGCCTCAG CAAGTATCTC AGCGTCTGAA TCGGCATCAA CAAGTGCCTC 
GGCTTCAGCA TCAACGAGTG CATCAGTCTC AGCAAGCACC AGTGCGTCGG CCTCAGCAAG 
CACCAGCGCG TCTGAATCCG CATCAACCAG TGCCTCAGCT TCAGCAAGTA CCTCAGCATC 
TGAATCAGCA TCAACAAGTG CCTCGGCTTC AGCAAGCACA AGTGCTTCAG CCTCAGCAAG 
TATCTCAGCG TCTGAATCGG CATCAACGAG TGCGTCCGCT TCAGCAAGTA CTAGCGCCTC 
AGCATCAGCG TCAACAAGTG CTTCGGCTTC AGCGTCAACG AGTGCGTCTG AGTCAGCATC 
AACGAGTACG TCAGCCTCAG CAAGCACATC AGCTTCTGAA TCTGCATCAA CCAGTGCGTC 
AGCCTCAGCA TCGACAAGCG CCTCAGCTTC AGCAAGTACC AGTGCGTCAG CCTCAGCAAG 
TACCAGTGCT TCAGCCTCAG CGTCGACAAG TGCGTCGGCC TCAACCAGTG CATCTGAATC 
GGCATCAACC AGTGCGTCAG CCTCAGCAAG TACTAGCGCC TCAGCCTCAG CATCAACGAG 
TGCGTCCGCT TCAGCAAGTA CTAGTGCATC AGCTTCAGCA AGTACTAGCG CCTCAGCCTC 
AGCGTCGACA AGCGCCTCAG CTTCAGCAAG TACCAGTGCG TCAGCCTCAG CGTCGACAAG 
TGCGTCGGCT TCAGCAAGTA CCTCAGCCTC TGAATCAGCA TCAACAAGTG CGTCGGCTTC 
AGCATCAACG AGTGCATCAG CTTCAGCATC AACAAGTGCT TCAGCTTCAG CAAGTACCAG 
TGCGTCGGCT TCAGCATCAA CGAGTGCTTC AGTCTCAGCG TCAACCAGTG CCTCTGAATC 
CGCATCAACA AGTGCCTCGG CTTCAGCAAG CACCAGTGCT TCGGCTTCAG CGTCAACGAG 
TGCGTCTGAG TCAGCATCAA CCAGTGCGTC AGCCTCAGCA AGCACATCAG CTTCTGAATC 
TGCATCAACC AGTGCGTCAG CTTCCGCATC AACAAGCGCC TCGGCCTCAG CAAGTACAAG 
TGCTTCAGCC TCAGCATCAA CCAGTGCATC AGCTTCAGCC TCAACAAGTG CTTCAGCCTC 
AGCGTCAACC AGTGCCTCGG CTTCAGCAAG TACCAGTGCG TCAGCTTCAG CAAGCACAAG 
TGCGTCAGCT TCAGCATCAA CCAGTGCTTC GGCTTCGGCA TCAACAAGTG CCTCAGCATC 
AGCATCAACG AGTGCGTCAG CCTCAGCAAG TACTAGTGCA TCAGCATCAG CATCAACCAG 
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TGCATCAGCC 


TCAGCAAGTA 


TCTCAGCGTC 


TGAATCGGCA 


TCAACGAGTG 


CATCAGCATP 




AGCATCAACG 


AGTGCATCGG 


CTTCAGCGTC 


AACCAGTGCA 


TCAGTCTCAG 


CAAGCACPAP, 


5400 


TGCGTCGGCT 


TCAGCATCAA 
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AGTATPTPAfi 


PP/PPTY2A ATP 


J^DU 


GGCATCAACG 


AGTGCGTCAG 


CCTCAGCAAG 


TACTAP.TPPA 


TPPPPTTPaP 


pa spp&ppip 


con 


TGCGTCGGCT 


TCAGCATCAA 


CCAGTGCCTP 


AP.PPTPAPPA 


/\v7 lAlv.1 LAb 


PPTPTP a A m/~» 


5580 


GGCATCAACG AGTGCGTCAG 


CCTPAP.PAAP, 


x 1 AO 1 


TP S PPniTP & C 


LA x L AACGAG 


5640 


TGCATCGGCT 


TCAGCAAGTA 




AP.PTTP afipa 


AuLALLAu 1 L> 


L(j 1 L*AvjL.t. 1L 


5700 


AGCAAGTACC 


AGCGCCTCAG 


ppt p a p. p a a n. 


tALLAu luLL 


TCAGCTTCAG 


CAAGTACCAG 


5760 


TGCGTCAGcT 


CAGCATCAAC 


AAGTGPTTPA 


vjv- I 1 LuuvL 1 


CAACAAGTGC 


GTCAGCTTCA 


5820 


GCATCAACGA 


GTGCGTCGGC 


TTPAGPAAP.P 


& PP & PTYIPPT 


CGGCCTCAGC 


AAGCACCAGT 


5880 


GCTTCAGCTT 


CAGCATCAAC 


A AP.TP*PPTP A 


PPTTPUPPft & 


GTACATCAGT 


TTCAAATTCA 


5940 


GCAAACCATT 


CGAACTCACA 


aPtTtp.p.aart 




CGACAGGTAA 


ATCCCAAAAA 


6000 


GAATTGCCTA 


ATACAGGTAC 


TGAGTCGTPA 


ATTGP. A TPTP. 


TGTTACTTGG 


AGTTCTAGCA 


6060 


GCTGTTACAG 


GTATTGGATT 


GGTTGCGAAA 


PP.PPP.TAAAP 


GTGATGAAGA 


AGAGTAAGAC 


6120 


AACCTGTAAA 


GTTAGGCTAA 


ACTAACTPGP 


1 nnn i 


CAAGGAGAAA ATTGCTAGTG 


6180 


GATGATAAAA 


TAACAGTCAT 


TGTACCAGTA 


TAPAATP.TP.P. 


AAAACTATCT 


GAGGAAGTGC 


6240 


CTAGATAGTA 


TTATTACTCA 


AA CAT AT AAA 


AATATTGAfiA 
nn x At x x unun 


TTGTTGTCGT 


TAATGATGGT 


o J UU 


TCTACGGATG 


CTTCAGGTGA 


AATTTGTAAA 


p, AAT^PTT P A P. 
\wi x x x x la\j 


AAATGGATCA 


CCGAATTCTC 


O J OU 


TATATAGAAC 


AAGAAAATGC 


TGGTCTTTCT 


GCCGCACGAA 


ACACCGGTCT 


GAATAATATG 


04ZU 


TCCGGAAATT 


ATGTGACCTT 


TGTGGACTCG 


GATGATTGGA 

w«b X Un X X 


TTGAGCAAGA TTATGTAGAA 


QHOVJ 


ACTCTATATA 


AAAAAATAGT 


AGAGTATCAG 


GPTfiATATTV": 


CAGTTGGTAA 


TTATTATTCT 


6540 


TTCAACGAAA 


GTGAAGGAAT 


GTTCTACTTT 


CATATATTGG 


GAGACTCCTA 


TTATGAGAAA 


OOUU 


GTATATGATA 


ATGTTTCTAT 


CTTTGAGAAC 


TTGTATGAAA 


CTCAAGAAAT 


GAAGAGTTTT 


6660 


GCTTTGATAT 


CTGCTTGGGG 


TAAACTCTAT 


AAGGCAAGAT 


TGTTTGAGCA 


GTTGCGCTTT 


6720 


GACATAGGTA 


AATTAGGAGA 


AGATGGTTAC 


CTCAATCAAA 


AGGTATATTT 


ATTATCAGAA 


6780 


AAGGTAATTT 


ATTTAAATAA 


AAGTCTTTAT 


GCTTATCGGA 


TTAGAAAAGG 


TAGTTTATCA 


6840 


AGAGTTTGGA 


CAGAAAAGTG 


GATGCACGCT 


TTAGTTGATG 


CTATGTCTGA 


ACGTATTACG 


6900 


CTACTAGCTA 


ATATGGGTTA 


TCCTCTAGAG 


AAACACTTGG 


CAGTTTATCG 


TCAGATGTTG 


6960 


GAAGTCAGTC 


TCGCCAACGG 


TCAAGCTAGT 


GGTTTATCTG 


ACACAGCAAC 


GTATAAAGAG 


7020 


TTTGAAATGA 


AACAAAGGCT 


TTTAAATCAG 


CTATCGAGAC 


AAGAGGAAAG 


TGAAAAGAAA 


7080 
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CCC^c .CCCOC^ CTATGGCTAT GTAGACCAAG 

ATTTGTTATC ATAATCGTTC GATTCGTTTT TATCTGATTC ATAGCGAn, TCCAAATGAA 
I**™** AATTAAATAA GCGC^AGAG AAG^ACT CAGAAATTAT TAA.TGTCGG 
GTAACTTCTG AGCAAATTTC ATGTTATAAA TCGGATATTA GTTACACAGT C^TTTACGC 
TATTTCATAG CTGATTTCGT GCAAGAAGAC AAGGCCCTCT AG^GA^ TGATCTAGTT 
GTAACGAAAA ATCTGGATGA CT^CT ACAGACTTAC AAGATTATCC TTTGGCTGCT 
GTTAGAGATT TTGGGGGCAG AGCTTATTTT GGTCAAGAAA TC^MTOC CGGTGTTCTC 
TTGGTAAACA ATGCTTTTTG GAAAAAAGAG AATATGACCC AAAAATTAAT TGATGTAACC 
AATGAATGGC ATGATAAGGT GGATCAGGCA GATCAGAGCA TCTTGAATAT GC TO AA 
CATAAATGGT TGGAATTGGA CTTTGATTAT AATCATATTG TCATTCATAA ACAGTTTGCT 
GATTATCAAT rGCCTCAGGG TCAGGATTAT CC^A^A TTCACTATCT TTCTCATCGG 
-CCG^GA AAGATTTGGC GGCCCAAACC TATCGTGAAG TTTGGTGGTA CTATCATGGG 
CTTGAATGGA CAGAATTGGG ACAAAACCAT CA^ACATC CATTACAAAG ATCTCACATC 
TATCCAATAA AGGAACCTTT CACTTGTCTA A^ATAC. CCTCAGACCA TATTGAACAA 
ATTGAGACAT TGGTTCAATC C^AT ATTCAGTTTA AGATAGCAGC TAGAGTAATA 
GTTAGTGATC GATTGGCTCA GATGACAATT TATCCAAACG ^ACTATATT TAACGGAATT 
a ™ TAGATGTCGA TAATGAA^ GTAGAAACCA GTCAAGTACT TTTAGATATT 
AATCATGGCG AAAAGACAGA AGAAATTCTC GATCAATTTG CTAATC^c 
TTATCCTTTG AAAATACTAA AACCTA^AA GTAGGTCAGG AGGCATATGC TGTTGACCAA 
GTTCAAGCAA TGATTGAAAA ATTGAGAGAA ATAAGCAAAT GAAGAAAAAT CATTTAGTAG 
GAGATGCTCT GTTAGTGATC AGATTGAAGA GTTGGATTAT TTTTTATAAA 

ATTTCTCCGT TCATCATATA TGAAAGTTGT TCAAACATCA GAGTGCTTTA TAAAATATAA 
ATAGACCTAA AGATATTTAA TATGAACTGC ACCCCAAAAG TTAGACAGAA AAAATCTAAC 
TTTTTGGsGT CAGTACAATA TTAGGGTGTG ATTAATTATC TTTTTAGGTG AAAATGATTC 
TATATTATAG C**^ CGAAATTTAT TATAAGGAAA TTATGTTAAT GAATACAAAA 
TCTATAGTTT TTAATGCAGA TAATGATTAT GTAGATAAAT TAGAAACTGC AATTAAATCT 
ATTOTCTT ATAATAATTG TTTAAAATTT TATGTATTTA ATGATGATAT TGCGTCAGAG 
™ nmGA GCG ~ ACTATACAAT CTGAAATCGT TAATGTAAAG 

a^gtagatk: A^AA AAAGTTTCAT ^ACCGTTAA AGAATTTAAG TTATGCCACT 
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ATTTTATACC 


TAATTTTGTC 


AAAGAAAGTC 


GTGCTTTATA 


CCTAGATTCT 


8880 




1 IACAGGAAG 


TTTAGACTAT 


TTATTTGATA 


TAGAACTAGA 


TGGTTATGCC 


8940 


TTGGCAGCAG 


TAGAAGATTC 


TTTTGGTGAT 


GTTCCTTCTA 


CCAATTTTAA 


CTCCGGAATG 


9000 


TTATTAGTTA 


ATGTAGATAC 


TTGGAGAGAT 


GAAGATGCTT 


GTTCGAAACT 


GTTAGAACTG 


9060 


ACCAATCAAT 


ATCATGAAAC 


AGCATATGGA 


GATCAAGGAA 


TTTTAAATAT 


GTTATTCCAT 


9120 


GATAGATGGA 


AAAGATTAGA 


CCGAAATTTT 


AATTTTATGG 


TGGGGATGGA 


TAGCGTCGCA 


9180 


CACATAGAAG 


GAAATCATAA 


ATGGTATGAG 


ATTTCTGAGT 


TGAAAAATGG 


AGATTTACCT 


9240 


AGTGTTATAC 


ATTATACTGG 


GGTAAAACCT 


TGGGAAATAA 


TTTCCAATAA 


TCGCTTTAGA 


9300 


GAAGTTTGGT 


GGTTTTATAA 


TCTGTTAGAA 


TGGTCTGATA 


TTTTATTGAG 


AAAAGACATT 


9360 


ATTAGTCGTA 


GTTTCGAAGA 


ACTTGTATAC 


AGTCCTAAAG 


CTCATACAGC 


AATTTTTACA 


9420 


GCTAGTTGTG 


AGATGGAGCA 


TGTAGAATAT 


TTGATAGAAA 


ATTTACCAGA 


GGTACATTTT 


9480 


TCTATACTAG 


CACATACATA 


TTTTGCGTCT 


AGTGTCGTTG 


CTTTATTAAG 


ATATAGCAAT 


9540 


GTTACGATTT 


ATCCTTGTTT 


TTCTCCATTT 


GATTATCGAA 


AAATTTTGGA 


TAATTTAGAT 


9600 


TTTTATTTAG 


ATATTAATCA 


TTATAAAGAA 


GTGGATAATA 


TTGTATCCGT 


TGTTCAACAA 


9660 


CTATCTAAAC 


CAATTTTTAC 


CTTTGAAAAT 


ACTAGTCATG 


ATATAGGCAA 


TCAAACTAAT 


9720 


ATATTTTCTT 


CAACCGAACC 


AAACAAAATG 


GTAGAGGCTA 


TTAGACAATT 


TATAGGAGAA 


9780 


TAAGTTTATG 


GCAGACGAAC 


TAATTAGTAT 


TGTAGTTCCA 


ATCTACAACG 


TTGAGAATTA 


9840 


TTTGCGAATG 


TGTTTGGATA 


GCATTCAGAA 


TCAGACGTAT 


CAAAATTTTG 


AGTGTTTATT 


9900 


AATCAATGAT 


GGCTCTCCAG 


ATCATTCATC 


CAAAATATGT 


GAAGAATTTG 


TAGAGAAAGA 


9960 


TTCTCGTTTC 


AAATATTTTG 


AGAAAGCAAA 


CGGCGGTCTT 


TCATCAGCTC 


GTAACCTAGG 


10020 


TATTGAATGT 


TCGGGGGGGG 


GCGTACATTA 


CTTTTGTAGA 


CTCTGATGAT 


TGGTTGGAAC 


10080 


ATGATGCTTT 


AGACCGATTA 


TATGGTGCTT 


TGAAAAAGGA 


AAACGCAGAT 


ATTAGTATCG 


10140 


GGCGTTATAA 


TTCTTATGAT 


GAAACACGCT 


ATGTGTATAT 


GACTTATGTT 


ACGGATCCAG 


10200 


ATGATTCTCT 


AGAAGTGATA 


GAAGGTAAAG 


CAATTATGGA 


TAGGGAAGGT 


GTCGAAGAAG 


10260 


TCAGAAATGG 


GAACTGGACT 


GTAGCTGTCT 


TGAAGTTATT 


CAAGAGAGAG 


TTACTACAAG . 


10320 


ATTTACCATT 


TCCTATAGGA 


AAAATTGCAG 


AGGATACTTA 


CTGGACATGG 


AAGGTACTTC 


10380 


TAAGAGCTTC 


GAGGATAGTC 


TATTTGAATC 


GTTGTGTTTA 


CTGGTACCGT 


GTTGGTTTAT 


10440 


CTGATACTTT 


ATCGAATACA 


TGGAGTGAAA 


AGCGTATGTA 


TGATGAAATT 


GGGGCTAGGG 


10500 


AAGAAAAGAT 


AGCTATTTTA 


GCAAGTTCAG 


ACTATGACTT 


GACCAATCAT 


ATTTTGATTT 


10560 


ATAAAAATAG 


ATTACAAAGA 


GTGATAGCAA 


AATTAGAAGA 


ACAAAATATG 


CAGTTCACAG 


10620 
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AGATTTACAG 


AAGAATGATG 


GAAAAATTGT 


CTTTACTTCC 


GTAGATAGTA 


ATAAAAAATG 


10680 


AGATAGCGTA 


ATATGAAACT ACATTTAACA 


AATTTATACG 


GCATGGCTGG 


TGATAGTACG 


10740 


GTTATCTTAG 


CTCAAAATGC 


TGTTCAAAAG 


ATAGCTAGTC 


AACTGGGATT 


TAGAGAGGTT 


10800 


GGTATTTATT 


TTTACAACAT 


TGCTTCAGAT 


AGTCCTTCTG 


AAATGAATAA 


GCGTCTGGAT 


10860 


GGTATTATGG 


CCAGTATCTC 


TATTGGGGAT 


ATTTTAGTCT 


TTCAGTCTCC 


AACCTGGAAT 


10920 


GGTTTTGAAT 


TTGATCGTCT 


CTTGTTTGAT 


AAGCTAAAGG 


ATATGCAGGT 


GAAAATTATT 


10980 


TGCTTTATCC 


ATGATGTTGT 


TCCCCTCATG 


TTTGATAGTA 


ACTATTATCT 


CATGAAAGAT 


11040 


TATCTGTATA 


TGTATAATCT 


ATCAGATGTT 


TTGATAGTGC 


CGTCAGAGAG 


AATGAAAACA 


11100 


CGCCTGATGG 


AAGAAGGATT 


GACGACTAAG 


AAGATTCTTG 


TTCAAGGGAT 


GTGGGATCAT 


11160 


CCTCATGATT 


TATCCTTATA 


CACCCCTGCT 


TTTAAAAAAG 


AACTTTTTTT 


TGCTGGAAGT 


11220 


TTAGAGCGTT 


TTCCAGACTT 


ACAAAATTGG 


TCTCAAGATA 


CGCCTTTGAG 


AGTATTTTCA 


11280 


AATAAAGGGG 


AAGCTAGTTC 


TAGTGCTAGA 


AGTCTCAGCA 


TCGAAGGATG 


GAAAAAAGAT 


11340 


GAGGAATTGT 


TGCTAGAATT 


ATCAAAGGGT 


GGATTTGGCC 


TTGTCTGGGG 


AACCcATCAA 


11400 


AATGAGGGAG 


AAAGTAACCA 


ATACTATACC 


TTGAATATAT 


CTCATAAGGT 


GAGTACCTAT 


11460 


CTAACAGCGG 


GCATTCCAGT 


CATTGTACCA 


AGTAGCTTGT 


CAACTGCTAA 


ATTTATAGTA 


11520 


GATCAAGGCT 


TGGGCTTTAT 


GGCGGATAGT 


CTGGAAGAGG 


TTCATGAGAT 


AGTTGATAAA 


11580 


ATGAATCTAC 


AAGAATATCA 


AGAAATGACG 


AATCGTATCA 


AGACCTTTAG 


CTATTTGTTA 


11640 


AAAGAGGGCT 


ATTTCACTAA 


AAAGTTATTG 


GTAGATGCAA 


TCTATCACTT 


GGGAATTGAT 


11700 


TAAGGGAATG 


AAATGAACAA 


AACAATTGTA 


CTAGCAGGGG 


ATCGCAATTA 


CACCAGGCAG 


11760 


TTAGAAACAA 


CGATAAAATC 


TATTTTATAC 


CACAATCGAG 


ATGTTAAGAT 


TTATATTTTG 


11820 


AATCAAGATA 


TCATGCCAGA 


TTGGTTTCGC 


AAACCACGAA 


AAATAGCTCG 


CATGTTAGGT 


11880 


AGTGAGATTA 


TCGATGTTAA 


ACTACCTGAA 


CAAACTGTGT 


TTCAAGATTG 


GGAAAAGCAA 


11940 


GATCACATTA 


GTAGCATTAC 


TTATGCTAGA 


TATTTTATTG 


CAGATTATAT 


CCAAGAAGAT 


12000 


AAGGTTTTAT 


ATTTAGACAG 


TGATTTGATT 


GTAAATACTT 


CTTTAGAGAA 


ATTATTTAGT 


12060 


ATTTGTTTAG 


AAGAAAAATC 


ACTCGCAGCA 


GTTAAAGATA 


CAGATGGAAT 


TACATTTAAT 


12120 


GCAGGTGTTT 


TATTAATCAA 


CAATAAAAAA 


TGGCGTCAAG 


AGAAATTAAA 


AGAACGACTA 


12180 


ATTGAACAGA 


GCATTGTTAC 


AATGAAGGAA 


GTTGAAGAAG 


GCCGTTTCGA 


GCATTTTAAT 


12240 


GGTGATCAAA 


CGATTTTTAA 


TCAGGTCTTG 


CAAGATGATT 


GGTTAGAACT 


AGGTCGAGCT 


12300 


TATAATTTAC 


AAGTAGGGCA 


TGATATTGTG 


GCTTTGTATA 


ACAATTGGCA 


GGAACATCTG 


12360 
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GCTTTTAATG ATAAACCAGT GGTGATTCAT TTTACGACCT ACAGAAAACC CTGGACTACC 
TTGACAGCCA ATCGTTATCG TGATTTATGG TGGGAATTCC ATGATTTGGA GTGGAGTCAG 
ATTTTACAAC ACCATATGGG AGAATTTGAA CTAATATCGC CTCTAGATAA GGAATTTTCT 
TGCTTAACCT TAACGAATTC CCAAGATTTA GAAGGAATAG AAGAGCTAGT TACAGCTCTA 
CCTGAGGTGG TATTTCATAT CGCAGCTTGG ACGGATATGG GAGATAAATT AAAAAAATTA 
GCTGTATATA ATAATGTGAG ATTGCATCCA CAAATTGTTC CACCGGTCTT AGATAAGCTG 
AAAAAGTCAA CAAATCTATA TTTGGATATC AATCATGGTA GTGCAGATGA GAACTTTTTA 
AAATCTTTGC AAGAACAAGA AAAAACGCTA CTAGCTTTTC AATCGACTCA GCACGGAGAG 
TTAGGACAAA TCGTTTTCGA AAATGGGAAA GTTTCCTTTA TGATTGATAC GATTAAAGAT 
TTTAAGAAAA ACGGACATCT TACCTGTTTT CGACAACTTC CAAGTTTAAC TTGTTTAACG 
TTTACGGCTT CTCAGTATAT CGAACAATTG GATTACTTGG CTGGACAGTT GCCAAATGTT 
GTTTTTCAAA TTGCTGCTTG GACAGCTATG GGGCCAAAAT TATATGATTT GTCTAATCGT 
TATCCTAATA TTCAGCTCTA TCCGGCAATT TCTAGAGATA AGCTAGACGA GTTGAAGGAG 
AAGATGGATG CTTATTTAGA TATCAACCTA CTGACTTCAA CATCCGATAT CGTTGCAGAA 
ATGGCTCATC TATCTAAACC TATACTAGCC TTTTATAAAT CTCAAAATGG GAATAATGGC 
CAAAGGTTGT ATTCAAGTGA ACATCCTGAA CGAATGTTGG CTGATTTGCA AAAATTGATA 
ACTAAGGATA TGCTAGAAAA ACCGCTTGAT ATAATCCAGG TGAAAGGGAT AGATGAAACC 
TTGGATTATA TTATTGAACA CAACTCTTCT TTAGTTCGTT TTGGAGATGG GGAAATCAAT 
ATGCTTGCAG GGCATTCAAT TCCCTACCAG GATTATGATG AAGAGTTGGT TTCAATCATG 
AGGGACATTA TCGGCCAAGA AAGTCGAGAA GATTTAGTAG TGTGCCTTCC TGATGCTTTT 
ACAGATCGTT TTAGGTTTAC ATCGTGGGCG ATTCCATTTT GGAAAGATCA CATGGATCAT 
TATATGGATT TTTACAGAGA GTTATGCAGT GATTCATGGT ATGGCTCAAC CTTTGTATCT 
CGCCCTTATA TCGATTTTGA AGACAAGAGT CAAGCTAAAG CTCAATTTGA AAAATTGAAA 
AGCATTTGGG AAAACCGTGA CTTACTGATA GTCGAAGGTG CGACTTCTCG TTCAGGTGTC 
GGAAATGATT TATTCGATGA GGCAAATTCT ATTAAGCGAA TTATCTGTCC TTCTCATAGT 
GCCTTTTCTA GAGTTCATGA ACTTGAACAA GAAATTGAAA AGTATGCTGG TGGTCGCTTG 
ATTTTATGTA TGCTTGGACC TACAGCAAAA GTTCTGAGTT ATAATCTATG CCAGATGGGC 
TATCAAGTTT TGGATGTAGG CCATATTGAC TCAGAGTATG AATGGATGAA AATGGGAGCT 
AAAACTAAGG TTAAATTTTC TCATAAACAT ACTGCAGAAC ATAATTTCGA CCAAGATATT 
GAATTTATTG ATGATGAAAC CTATAACAGT CAGATTGTTG CACGAATATT AAACTAGACT 
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ATTTAAAATA AATGATAAGG ATTTAAAATG AGAAATACCA AACGCGCTGT AGTATTTGCA 14220 
GGTGATTACG CTTATATTCG ACAAATCGAA ACGGCGATGA AGTCACTCTG TAGACACAAT 14280 
AGTCATTTGA AAATTTATCT GCTAAATCAG GACATTCCTC AGGAATGGTT TAGTCAAATA 14340 

AGAATATATT TACAAGAGAT GGGGGGCGAC TTGATTGACT GCAAGTTAAT TGGCTCACAG 14400 

TTTCAAATGA ATTGGTCTAA TAAATTACCT CATATCAATC ATATGACATT TGCACGCTAT 14460 

. TTTATTCCAG ATTTTGTAAC AGAAGATAAA GTTCTCTATC TAGATAGTGA TTTGATTGTG 14520 

ACTGGTGATT TGACCGATTT GTTTGAATTA GACTTAGGTG AAAATTATTT GGCAGCAGCT 14580 

CGTTCTTGCT TTGGAGCAGG AGTCGGCTTC AATGCTGGTG TTCTCTTGAT TAACAACAAA 14640 

AAATGGGGAT CTGAAACTAT TCGACAAAAA TTGATTGACT TAACAGAAAA AGAACATGAG 14700 

AATGTGGAAG AAGGAGACCA GTCAATTTTG AATATGTTGT TTAAAGATCA ATATAGTTCC 14760 

CTTGAAGATC AATATAATTT TCAAATAGGA TATGATTATG GGGCGGCAAC CTTTAAACAT 14820 

CAATTCATTT TTGATATTCC GCTCGAACCA CTGCCACTAA TTTTACACTA TATTTCTCAG 14880 

GATAAGCCTT GGAATCAATT TTCTGTTGGA CGTCTAAGAG AAGTTTGGTG GGAATACTCT 14940 

TTGATGGATT GGTCTGTTAT TTTAAATGAA TGGTTTTCAA AGAGTGTGAA GTACCCTAGT 15000 

AAATCACAAA TATTTAAGTT GCAATGTGTT AATTTAACGA ATTCTTGGTG TGTCGAGAAA 15060 

ATCGATTATT TGGCGGAGCA ATTGCCAGAA GTTCATTTTC ATATTGTTGC TTATACAAAT 15120 

ATGGCAAATG AACTACTAGC TTTAACGCGT TTTCCTAATG TTACCGTATA TCCAAATTCC 15180 

TTACCAATGT TATTGGAACA AATAGTAATA GCTTCAGATT TGTATTTGGA TTTGAATCAT 15240 

GATCGAAAAT TAGAAGATGC ATATGAGTTT GTGCTTAAGT ACAAAAAACC AATGATAGCT 15300 

TTCGACAATA CTTGCTCTGA AAATCTTTCT GAGATTTCAT ATGAAGGTAT CTATCCAAGC 15360 

TCCATTCCGA AAAAAATGGT TGCAGCAATC AGATCTTACA TGAGGTAGAG AACAGTATGA 15420 

GAAAATCAAT AGTATTAGCG GCAGATAATG CCTATCTTAT TCCTTTAGAG ACGACTATAA 15480 

AGTCTGTATT GTATCACAAT AGAGATGTTG ATTTTTATAT TCTCAACAGT GATATAGCTC 15540 

CTGAATGGTT TAAATTATTG GGGAGAAAAA TGGAAGTTGT GAATTCTACA ATTCGCAGTG 15600 

TACACATTGA TAAAGAACTT TTTGAAAGCT ATAAAACAGG ACCTCATATA AATTATGCTT 15660 

CTTACTTTAG ATTTTTTGCG ACAGAAGTGG TTGAATCTGA TAGGGTATTG TATCTGGATT 15720 

CCGATATCAT TGTAACTGGG GAACTAGCTA CTTTGTTTGA GATAGATCTC AAAGGATATT 15780 

CAATTGGTGC TGTTGATGAT GTCTATGCCT ATGAAGGACG AAAATCTGGA TTTAATACTG 15840 

GTATGTTACT AATGGATGTT GCAAAGTGGA AAGAACATTC TATTGTCAAT AGTTTATTGG 15900 



WO 98/18931 



PCT/US97/19588 



584 



AATTAGCGGP 


HjftbUHbA/i 1 


1 p tv 7\ pHviv^nvrv"* 

LAAGTTGX/TC 


ATCTTGGGGA 


TCAGAGTATT 


TTAAATATTT 


15960 


ATTTTGAGGA 


1 AAl XVj4jI* AA 


Gt-CTTTAGAaA 


TV TV TV i~* TV m TV m TV TV 

AAACATATAA 


TTATATGGTG 


GGTATTGATA 


16020 


X * * ,f\ & J. 


xwi- 1 i. AAWVA 


rnp#iv^ K jt #**f»fiv* 

TGTGAACGTC 


TAGATGACAA 


TCCACCTACA 


ATTGTTCACT 


16080 


ATG PT AfiTP A 


mp jrpj ?V TA pprp 


mpn iv tv m tv /"» tv m 

TGGAATACAT 


ATAGTATATC 


TAGACTACGT 


GAATTATGGT 


16140 


GGGTTTATAG 


nunl 1 1 oVjAl 


hvptip ftp a p a 
X*Uvs X t, AG AGA 


TTGCTTTTCA 


ACGTTCCGAT 


TTAAATTATT 


16200 


TTGAAAGAAG 

x 4 VMVUiVjnnVj 


r*a itprptp'p 


AAAAAACAAG 


TGATGCTTGT 


GACATGGAGT 


GCAGATATAA 


16260 


AAPATTT AG & 


ulAi 1 XAwlA 


P TV IV PPPflVTt H P 

CAAL-GGX X AL 


CTGATTGGCA 


TTTTCATTTG 


GCTGCACCGT 


16320 


u 1 1 1 Vj 1 1 ^ 


mp TV PP TV ppfpp 


iv PPfnPfnom iv m 

ACCTCTCTAT 


CACAGTATAC 


GAATGTAACA 


GTATATCAAA 


16380 


A 1\j 1 A x 1 


rn * pmTV on n nun 
1 AG T ACjAATT 


GATTGGCTAT 


TGGACGATTC 


TATAGTTTAT 


TTAGATATTA 


16440 


a t> a p a rc/^TYip 

A 1 nL nuv 1 VjVj 


AuAviu 1 1 X X X 


Ti iv m^m iv iv 

AATGTAGTTA 


CAAGGGCACA 


AGAAAGTGGC 


AAGAAAATCT 


16500 


TPGPf TTTP & 
x v.Vjv_ Ill 1\jA 


<ri a rpp IPS PPT 


AAAAGTATGG 


ATGATGGACT 


CTATGACGGT 


ATTTTTTCTG 


16560 




ft p iv rn/~* iv mmra * 
AG AX GATTTA 


GTGGATAGAA 


TGAAGAATAT 


AGAGATAGAG 


TAATGAGTGA 


16620 


Mil AA 1 1. 1 


J. l\j 1 AL 


pp ivmivmivPiv h 
LGATATACAA 


TACGGGAAAA 


TATTTAGTGG 


AGTGTGTCGA 


16680 


v>V« A Inl 1 vj 


a ftppsi Bf*p*n 
AAoUAAALTVT 


ATCAAAATAT 


iv ^ iv » iv mm iv mm 

AGAAATTATT 


TTAGTTGATG 


ACGGTTCTAC 


16740 


uon 1 AA 1 1 V. i 


p^p>p«p« 74 iv ammrri 
GGGGAAATTT 


GTGATGCTTT 


TATGATGCAA 


GATAATCGTG 


TGCGAGTATT 


16800 


fiPlTfH Hp* a 
vj^_A A LAAVjAA 


71 TVfT»A TV OPPPP 

AA X AAGGGGG 


GGGCAGCACA 


AGCTAAAAAT 


ATGGGGATTA GTGTAGCTAA 


16860 


OPV* SP*rnm 


ATCACGATTG 


TTGATTCAGA 


TGATATCGTA 


AAAGAAAATA 


TGATTGAAAC 


16920 


rpp^irifprpft rnf ft P 
1 V- 1 1 i A 1 \-f\\j 


p tv iv fmnr* * n 

CAAGTCCAAG 


AAAAGGATGC 


AGATGTTGTT 


ATAGGGAATT 


ACTATAATTA 


16980 


Tnirr.a a ariT 

luALUAAAu x 


p a f*f1flf* ft ft rp»T» 

t»ACGGGAAX X 


TTTATTTTTA 


TGTAACAGGG 


CAAGATTTTT 


GCGTCGAAGA 


17040 


Ail nOV, 1 A 1 A 


P* TA ft P 1 ft TV ft mm ft 

l*AAGAAA i X A 


rip tv iv /"■r>/">m/ , *TA 

TGAACCGTCA 


AGCAGGAGAT 


TGGAAATTCA 


ATAGCTCGGC 


17100 


PTTT ITS TW 
V- 1 1 1A1A1 lu 


r*r*r* apt\ nvrun iv 
H-tiACAXT, f a 


AGTTGATTAA 


AAAAGAATTA 


TTCAATGAAG 


TTCACTTTTC 


17160 


AAATGGTPGP 




amp ft TV PP IV TV P 
A l\iAAuV>AAL 


1 A IajCA XaJGC 


TTTTATCTTT 


TAGCCTCTAA 


17220 


AATCGTCTTT 


ATAAACGATA 


ATCTCTATCT 


GTATAGAAGA 


CGTTCAGGAA 


GCATCATGAG 


17280 


AACGGAATTT 


GATCTTTCCT 


GGGCAAGAGA 


TATTGTTGAA 


GTGTTTTCTA 


AGAAAATATC 


17340 


GGATTGTGTC 


TTGGCTGGTT 


TGGATGTCTC 


CGTTCTGCGT 


ATTCGATTTG 


TCAATCTTTT 


17400 


AAAAGATTAT 


aagcaaactt 


TAGAATACCA 


TCAATTAACA 


GATACTGAGG 


AATATAAAGA 


17460 


TATTTGTTTC 


AGATTAAAGT 


TGTTTTTTGA 


TGCAGAACAA 


AGAAATGGTA 


AAAGTTGAAA 


17520 


TAAAAGAATT 


GTTATTTACC 


ATATCACAAA 


CAATGAAGGT 


GAGGGGAGTG 


TTTTATGACT 


17580 


AAGATTTATT 


CGTCAATAGC 


AGTAaaaaaA 


GGACTATTTA 


CCTCATTTCT 


ACTGTTTATC 


17640 


TATGTATTGG 


GAAGTCGTAT 


TATTCTCCCT 


TTTGTTGACC 


TAAATACTAA 


AGATTTTTTA 


17700 
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GGAGGTTCAA 


CAGCCTATCT 


AGCCTTCTCA 


GCCGCCCTAA 


CAGGTGGGAA 


TCTAAGAAGT 


17760 


TTATCAATTT 


TTTCTGTTGG 


ATTATCCCCT 


TGGATGTCCG 


CCATGATTTT 


ATGGCAGATG 


17820 


TTTTCTTTTT 


CTAAACGGTT 


GGGTTTAACA 


TCTACGTCTA 


TAGAAATACA 


AGATCGCCGT 


17880 


AAAATGTACC 


TGACCTTGCT 


AATTGCTGTG 


ATTCAATCCT 


TGGCAGTTAG 


CTTGAGACTG 


17940 


CCAGTACAAT 


CCTCCTATTC 


TGCAATATTG 


GTTGTTCTAA 


TGAATACAAT 


ATTGCTGATA 


18000 


-GCAGGAACAT 


TTTTTCTTGT 


TTGGTTGTCA 


GATTTAAATG 


CGAGTATGGG 


GATTGGAGGT 


18060 


TCTATTGTAA 


TCCTCCTATC 


CAGTATGGTT 


TTAAATATTC 


CTCAGGATGT 


TTTGGAAACA 


18120 


TTTCAGACAG 


TACACATTCC 


AACAGGGATT 


ATTGTGTTAC 


TTGCTTTATT 


AACCCTTGTC 


18180 


TTTTCTTATT 


TACTTGCCCT 


TATGTATCGA GCTCGCTATT 


TGGTTCCTGT 


TAATAAAATT 


18240 


GGCTTACACA ATCGATTTAA ACGCTATTCT TATCTCGAAA TCATGTTGAA 


TCCTGCAGGT 


18300 


GGGATGCCTT 


ATATGTATGT 


GATGAGTTTT 


CTTAGTGTAC 


CAGCTTATTT 


GTTCATCTTG 


18360 


TTGGGATTTA 


TTTTCCCTAA 


TCATTCAGGG 


TTAGCGGCTT 


TATCAAAGGA 


ATTTATGGTT 


18420 


GGAAAGCCTT 


TGTGGGTCTA TGTTTATATT 


TCGGTCTTAT 


TTTTATTTAG 


TATCATTTTT 


18480 


GCTTTTGTTA 


CGATGAATGG 


AGAAGAGATT 


GCAGACCGTA 


TGAAAAAATC 


TGGAGAATAC 


18540 


ATTTATGGTA 


TTTATCCAGG 


TGCGGATACT 


AGTCGATTTA 


TTAATCGATT 


GGTCCTTCGT 


18600 


TTCTCAGTCA 


TAGGTGGTCT 


CTTTAATGTG 


ATTATGGCAG 


GTGGTCCCAT 


GCTTTTTGTT 


18660 


TTGTTTGATG 


AAAAGTTATT 


ACGATTGGCA ATGATTCCAG 


GCTTATTTAT 


GATGTTCGGG 


18720 


GGCATGATTT 


TTACGATTAG 


AGACGAGGTC 


AAGGCTTTAA 


GGCTAAATGA 


GACCTATAGA 


18780 


CCTTTGATTT 


AGGAGACTTT 


TATGTATTAT 


TTTATTCCAG 


CTTGGTATGG 


GTCAGAAAGA 


18840 


ACATGGCATG 


CAGATATCAC 


TCCATGGTAT 


TTTTCTCATT 


TTCGTCTAGA 


GTTTGATGAT 


18900 


ACCTTTCACC 


AGATTCGGCT 


CTTTCAAGAG 


CAAGATATAG 


ATTCTCGTCT 


ATTAGTATTA 


18960 


GCTTACCAGC 


CTCATCTACG 


TTATTTTTTA 


TATAGACATG 


GTGTGTTAGA 


AATGGATACT 


19020 


TATTCCGTTT 


TTGATGTTAT 


GCAAGATTTT 


CATAATCTCC 


ATACCCAAGT 


TTTAAGCATT 


19080 


AGAGATATTG 


AGTGGGATGA 


TGACTGTGAA 


TTTATTTATA 


GTCCCTTTAC 


GATTATCGTT 


19140 


CAAAAAAATG 


GGAAGAAATT 


TGCTAAGGTT 


GAACATGGAG 


TTGAAGGCTT 


CATCAGTGAT 


19200 


ATACAGTATT 


TTGAACCAAA 


TGGTCAAATA 


CATATGCACC 


ATATCGTGGA 


TGATCGTGGG 


19260 


TTTGTATCGA 


GCATTATCTT 


TTTTGAAGAT 


GGGCAAGCAG 


CCTATCAAGA 


ATATCTGAAC 


19320 


CTCAAGGGAG 


AGTGGCAATT 


CAGAGAGCGT 


TTAAAAGAAG 


GAGGACAGGT 


AGAAGTCAAT 


19380 


CCAATTTTGG 


GTTATCGCTT 


TAAAATGCTT 


ACCTATCAAA 


ATATGGGAGA 


TCTGGTGGCA 


19440 
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GAATTTTTTG AGAATTATCT GCAAACGTAT GTGAAGGATC AGGATATTTT TATGCTTCCT 19500 

TCTCATTCTC ATCATGACCA GTTGGTACTA GATCGTTTAC CTAGTACTAA TCCTAAACTG 19560 

TTGAGTCTGT TCATTGGACG TAATCCTCAA GATACCTTTA GGGATTTAGA TGTAACTTTT 19620 

GAAAAATCGG ATTTGATTTT GGTGGATAGA GAGGATAGTT TACGATTGTT GCAGGAGTTG 19680 

TATCCTGAAC GAATGCATCA ATGTTATCAT TTATCATCTT TTGACACCCG ATTACGATTG 19740 

GGACGAAGCC AAACTAAGAA AGAATCCATC ATTTATTTTC AACTGGATTT TGAGCAGGGG 19800 

ATTGATAATC AAGCTCTGCT TCAAGTCTTG TCCTTTGTCG CTGAAAATAA GGATACTGAG 19860 

GTGATTTTTG GAGCCTTTGC TGCTAGTCAG GAGCAAATGA ATGAGGTTGA AGGGATTGTT 19920 

GAGTCTTTCA TCCAAGAAAA CATTCAATCC GAAAATCTGG GAAAGGCGAT TGATTATGGT 19980 

GATGCAGAAA ATCCTCTGGA AGAAAATCAA CACCAGGACT TACGCTTACA GTTTGTTAAC 20040 

TTGAATGATG AGTTAGATTT GATAAAAACA CTAGAATTTG TCCGTTTGAT TGTGGATTTA 20100 

AATAGACATC CTCATCTCTA CACACAGATT GCTGGGATTA GTGCAGGAAT TCCTCAAATC 20160 

AACCTAGTTG AAACCGTCTA TGTTGAACAT TTAAAAAATG GTTATTTGTT AGCAGATGTT 20220 

ACAGAATTTT CTAAGGCTGC ACATTATTAC ACAGATAGGT TGAAGGAGTG GAATGAGTCC 20280 

TTGATATATT CAATTGATAA GATTAAGGAG CACACAGGAC AACAATTTCT TGGAAAATTA 20340 

GAGAAATGGA TAGAGGAGGT TAAAAATGTC AAAGGAACTT AATATTTTAC AGATAGGACT 20400 

TGCCAATTGG GAAAATCACT ATGACATACC TGAAAATATG AGTTGGTATT ATTTTTACCC 20460 

AAACTCATCA AAAGCCCTTC GTGAAATAAT TGAAAAAGAG GATATTAACC GTTTTCATGC 20520 

AGTTTTAATA GAAGATGGTC AGTATTCCAG AGACTTATTT TCCTATGTAA AATATTTTGA 20580 

ACCTTATACT TTATTTTATA ACCAGAATCT ACAAATAAAT GATAGAGAGG TTGTGGATTT 20640 

TCTAAAAAAA CGATGTGCAC AAGCAATTGA TTTTTTAAGT CCCCAACAAC TAATCAATGA 20700 

TTTAAGTAAA TCTCTTTTTG GCGGTGGGTA TGGTGATAAA CTCTTTCCTC CGACGATACA 20760 

AGTCAATCCA AATTTTACAG GAGCTATTTC TTATCAAGGA TTGGATTATG TCAGTTTGGA 20820 

AGGTGAGTTT GGGCAAGATT TTGCCCAGCT TGCCTATTGG GCTTATAATA TTATGGTGCA 20880 

AAAAACACTC CCTATTGAGT TGTGGCTTGA ATATGAGAAG GAAGGCAATT GTGACTTTCG 20940 

TTTAGTAATC CGTAAAATGT GGAGTGGGTC TGTTGATGAT TTCTTTGAAG AAGTAATAGT 21000 

ATCTGAAAAA GACTTGGAGC AAGCACTTTT TATGGATAGT CGAGACGGAG ACTACTTTCT 21060 

CTCGATATCT GTTGAAGCAA GAGGTCGTGG AACTATCAAA CTAGGTAATC TTCACCAACG 21120 

ATGGAGTCGA AAACAATTTG GTAAGTTTGT ACTTGGTGGA AATATCCTAC ATGATTCCAA 21180 

GCGTGATGAA ATAAACTATT TCTTCCATCC AGGTGATTTT AAACCGCCTT TGACTGTCTA 21240 
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TTTTGCAGGT 
ATGTCCCTTC 
GGATGAGCTA 
TGATCATAAG 
TGGTGCTTCT 
TATAGCTAGT 
AATTCATCAT 
GAAAATTTTT 
TGAAGAAATG 
GAAGATTTTA 
TTGGTTTTTG 
ATTATTACAC 
ACAACTGTAT 
GGTGAGATTC 
CAGCTTCCCT 
CCTGAAAATG 
AAAAAGGTAG 
CGAGTTTCTC 
GAGATCAGAA 
TTTAAAGCAG 
AGCTAAAACA 
GGTTGAAGCT 
TGATGTTCAA 
GGGGGAAGGT 
AGGAGTGATG 
TCAAGTTTAT 
GGAGATGAAA 
TAGTAATTTA 
TTTACGACCG 



TATCGACCTG 
ATTTTATTTT 
GAGGGAAAAG 
GATTTGATTT 
TTTGAACCCC 
CGTGGACGTT 
ACAGGTGGGA 
AAACAAGCAA 
GATCCACAAG 
TCTAAAGGAA 
CACTTTTATA 
AGAGACAGTC 
CTTATTACCC 
TAAAGACTTG 
TATTAAAAAG 
GAGTTTATAC 
AAAAAGTGAA 
TTTTAAGTGC 
GTGTTTAGAC 
ATTAATGCCC 
GTAGAGTTTC 
TTTGCTGTGG 
GTCATGGGAG 
AAGACCTTGA 
GTTGTGACTC 
CGTTTTCTAG 
GCTGAAGAAA 
GGTTTTGATT 
TTTAACTATG 



CAGAAGGATT 
CTGATCCACG 
TAAAGGATAC 
TATCAGGTCT 
ATGCCATCAT 
TGGACGCACC 
CAAGTTCTCA 
ATTTTTCAAA 
CCTATGAACA 
CTGCTGGACG 
GAATGGTCTT 
TATTCATTGG 
TGACAAAAGT 
GTTTTCTAGT 
AAAGCAGGAG 
CAAGATAACT 
AGATTTCATA 
TGGATTTGAG 
GTTTAGGTCA 
TGAAAGGCAA 
GTCAGCGTCT 
TGCGTGAAGC 
CTATTGTCAT 
CAGCTACCAT 
CTAATGAGTA 
GATTGACCAT 
AGAAGCTTAT 
ATCTAAATGA 
TGATTATTGA 



587 
CGAGGGTTAC 
TTTAGAGGGG 
GATCACTCAC 
TTCTATGGGA 
AGTTGGTAAG 
GGGTGTCTCT 
AGATATGACG 
GACAACCTTT 
ATTAGTGTCT 
ACACAATGAT 
AGAGACTGGT 
GGAGAAGTTG 
GTTCGTCTGT 
GTCAATTACC 
TATCAACTAT 
TTTTTTGACC 
TTTACTTATC 
TCCTTAACTT 
AGATTTCCAG 
GATGTCCTCT 
TTCCGAGGGA 
AGATAAGCGG 
GCACTATGGA 
GCCTGTCTAT 
TTTATCAAAG 
TGGTGTACCA 
CTATGCTTCG 
TAACCTAGCC 
TGAAATTGAT 



TTTATGATGA 
GGAGCTTTTT 
TATCTTGATT 
ACGTTTCCGG 
CCCTTGGCTA 
AACTTAGCTT 
GAGTTGGATC 
GGTTTATCCT 
TATCTGTGTA 
GATACAGATA 
TTTGGAAGGG 
GTGGGACTTA 
ATAATCCTCT 
AGGCTGCACG 
CACTGGTTTT 
GCTATGGTGA 
CAGAAGATAG 
TTTATCATTT 
CTTAGGAAAG 
CTTTCGGATC 
GAAAGTCTAG 
ATTTTAGGGA 
AATGTTGCTG 
TTGAACGCTT 
CGTGATGCCG 
TTTACGGAAG 
GATATCATCT 
TCGAATGAAG 
GATATCTTGC 



AAACTCTTGG 
ATCTCGGAAC 
ATTTAGGCTT 
CTCTCTATTA 
ATTTAGGAAC 
TTGATTGTTT 
AGCGTTTTTG 
ATATGAAAGA 
ATACAGGTGC 
CCAATATTTC 
AGAAAAGATG 
TATGTATGGA 
ATTGCCATCT 
AACCCAACCT 
TGACTGTCAG 
TATTTTAGAA 
TTATACTTAT 
TTCTATCAAG 
TGAAAAAGAT 
AAGAATTAGT 
ACGATATTTT 
TGTTTCCTTA 
AGATGAATAC 
TTTCAGGAGA 
AGGAAATGGG 
ATCCAAAGAA 
ACACAACCAA 
AAGGTAAGTT 
TTGATAGTGC 



21300 

21360 

21420 

21480 

21540 

21600 

21660 

21720 

21780 

21840 

21900 

21960 

22020 

22080 

22140 

22200 

22260 

22320 

22380 

22440 

22500 

22560 

22620 

22680 

22740 

22800 

22860 

22920 

22980 



WO 98/18931 



PCT/US97/19588 



588 



ACAAACTPfT 


CTGATTATTG 


CGGGTTCTCC 


TCGTGTTCAG 


TCTAATTACT 


ATGCGATCAT 


23040 


X V*rV X /W_.r\\», 1 1 


GTAACAACCT 


TGGTCGAAGG 


AGAGGATTAT 


ATCTTTAAAG 


AGGAGAAAGA 


23100 




CTCACTACTA AGGGGGCCAA 


GTCTGCTGAG 


AATTTCCTAG 


GGATTGATAA 


23160 


TTTATAPAAf: 


GAAGAGCATG 


CGTCTTTTGC 


TCGTCATTTG 


GTTTATGCGA 


TTCGAGCTCA 


23220 


X nnul. 1 V. X 1 X 


ACTAAAGATA 


AGGACTATAT 


CATTCGTGGA 


AATGAGATGG 


TACTGGTTGA 


23280 


TAAfiCJfiAAr'A 


GGGCGTCTAA 


TGGAAATGAC 


TAAACTTCAA 


GGAGGTCTCC 


ATCAGGCTAT 


23340 


1Y2 & Afii*v^ n a 


GAACATGTCA 


AATTATCTCC 


TGAGACGCGG 


GCTATGGCCT 


CGATCACCTA 


23400 


X Lnunu 1L1 1 


TTTAAGATGT 


TTAATAAGAT 


ATCTGGTATG 


ACAGGGACAG 


GTAAGGTCGC 


23460 




TTTATTGAAA 


CTTACAATAT 


GTCTGTAGTA 


CGCATTCCAA 


CCAATCGTCC 


23520 




ATTGACTATC 


CAGATAATCT 


ATATATCACT 


TTACCTGAAA 


AAGTGTATGC 


23580 




TACATCAAGC 


AATACCATGC 


TAAGGGAAAT 


CCTTTACTCG 


TTTTTGTAGG 


23640 


/"»rn/"* » nnvno » n 

v- 1 CALj I IXjAA 


ATGTCTCAAC 


TCTATTCGTC 


TCTCTTGTTT 


CGTGAAGGGA 


TTGCCCATAA 


23700 




GCTAATAATG 


CGGCGCGTGA 


GGCTCAGATT 


ATCTCCGAGT 


CAGGTCAGAT 


23760 


GGGGGCTGTG 


ACAGTGGCTA 


CCTCTATGGC 


AGGACGTGGT 


ACGGATATCA 


AGCTTGGTAA 


23820 


AGGAGTCGCA 


GAGCTTGGGG 


GCTTGATTGT 


TATTGGGACT 


GAGCGGATGG 


AAAGTCAGCG 


23880 


GATCGACCTA 


CAAATTCGTG 


GCCGTTCTGG 


TCGTCAGGGA 


GATCCTGGTA 


TGAGTAAATT 


23940 


TTTTGTATCC 


TTAGAGGATG 


ATGTTATCAA 


GAAATTTGGT 


CCATCTTGGG 


TGCATAAAAA 


24000 


GTACAAAGAC 


TATCAGGTTC 


AAGATATGAC 


TCAACCGGAA 


GTATTGAAAG 


GTCGTAAATA 


24060 


CCGGAAACTA 


GTCGAAAAGG 


CTCAGCATGC 


CAGTGATAGT 


GCTGGACGTT 


CAGCACGTCG 


24120 


TCAGACTCTG 


GAGTATGCTG 


nnnu Inlunn 


TATACAACGG 


GATATAGTCT 


ATAAAGAGAG 


24180 


AAATCGTCTA ATAGATGGTT 


CTCGTGACTT 


AGAGGATGTT 


GTTGTGGATA 


TCATTGAGAG 


24240 


ATATACAGAA 


GAGGTAGCGG 


CTGATCACTA 


TGCTAGTCGT 


GAATTATTGT 


TTCACTTTAT 


24300 


TGTGACCAAT 


ATTAGTTTTC 


ATGTTAAAGA 


GGTTCCAGAT 


TATATAGATG 


TAACTGACAA 


24360 


AACTGCAGTT 


CGTAGCTTTA 


TGAAGCAGGT 


GATTGATAAA 


GAACTTTCTG 


AAAAGAAAGA 


24420 


ATTACTTAAT 


CAACATGACT 


TATATGAACA 


GTTTTTACGA 


CTTTCACTGC 


TTAAAGCCAT 


24480 


TGATGACAAC 


TGGGTAGAGC 


AGGTAGACTA 


TCTACAACAG 


CTATCCATGG 


CTATCGGTGG 


24540 


TCAATCTGCT 


AGTCAGAAAA 


ATCCAATCGT 


AGAGTACTAT 


CAAGAAGCCT 


ACGCGGGCTT 


24600 


TGAAGCTATG 


AAAGAACAGA 


TTCATGCGGA 


TATGGTGCGT 


AATCTCCTGA 


TGGGGCTGGT 


24660 


TGAGGTCACT 


CCAAAAGGTG 


AAATCGTGAC 


TCATTTTCCA 


TAAAAGGAGA 


AAATATGACA 


24720 


ATTTACAATA 


TAAATTTAGG 


AATTGGTTGG 


GCTAGTAGCG 


GTGTTGAATA 


CGCTCAAGCC 


24780 
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TATCGTGCTG 


GTGTTTTTCG 


GAAATTAAAT 


CTGTCCTCTA 


AGTTTATCTT 


TACAGATATG 


24840 


ATTTTAGCCG 


ATAATATTCA 


GCACTTAACA GCCAATATTG 


GTTTTGATGA 


TAATCAGGTT 


24900 


ATCTGGCTTT 


ATAATCATTT 


CACAGATATC 


AAAATTGCAC 


CTACTAGCGT 


GACAGTGGAT 


24960 


GATGTCTTGG 


CTTACTTTGG 


TGGTGAAGAA 


AGTCACAGAG 


AAAAAAATGG 


CAAGGTTTTA 


25020 


CGTGTATTCT 


TTTTTGACCA 


AGATAAGTTT 


GTAACCTGTT 


ATTTGGTTGA 


TGAGAACAAG 


25080 


- GACTTGGTTC 


AACATGCCGA 


GTATGTTTTT 


AAGGGAAACC 


TGATTCGGAA GGATTACTTT. 


25140 


TCTTATACGC 


GTTATTGTAG 


CGAGTATTTT 


GCTCCCAAGG 


ACAATGTTGC 


AGTCTTATAC 


25200 


CAACGAACTT 


TTTATAATGA 


AGACGGGACT 


CCAGTCTATG 


ATATCTTGAT 


GAATCAAGGG 


25260 


AAGGAAGAAG 


TTTATCATTT 


CAAGGATAAG 


ATTTTCTATG 


GAAAGCAAGC 


TTTTGTGCGT 


25320 


GCCTTTATGA 


AATCTTTGAA 


TTTGAATAAG 


TCTGATTTGG 


TCATTCTCGA 


TAGGGAGACA 


25380 


GGTATTGGAC 


AGGTTGTGTT 


TGAGGAAGCA 


CAGACAGCAC 


ATCTAGCGGT 


AGTTGTTCAT 


25440 


GCGGAGCATT 


ATAGTGAAAA 


TGCTACAAAT 


GAGGACTATA 


TCCTTTGGAA 


TAACTATTAT 


25500 


GACTATCAGT 


TTACCAATGC 


AGATAAGGTT 


GACTTCTTTA 


TCGTGTCTAC 


TGATAGACAA 


25560 


AATGAAGTTC 


TACAAGAGCA 


ATTTGCCAAA 


TATACTCAGC 


ATCAGCCAAA 


GATTGTTACC 


25620 


ATTCCTGTAG 


GCAGTATTGA 


TTCCTTGACA 


GATTCAAGTC 


AAGGGCGCAA 


ACCATTTTCA 


25680 


TTGATTACGG 


CTTCACGTCT 


TGCCAAAGAA 


AAGCACATTG 


ATTGGCTTGT 


GAAAGCTGTG 


25740 


ATTGAAGCTC 


ATAAGGAGTT 


ACCGGAACTA 


ACCTTTGATA 


TCTATGGTAG 


TGGTGGAGAA 


25800 


GATTCTCTGC 


TTAGAGAAAT 


TATTGCAAAT 


CATCAGGCAG 


AGGACTATAT 


CCAACTCAAG 


25860 


GGGCATGCGG 


AACTTTCGCA 


GATTTATAGC 


CAGTATGAGG 


TCTACTTAAC 


GGCTTCTACC 


25920 


AGCGAAGGAT 


TTGGTCTGAC 


CTTGATGGAA 


GCTATTGGTT 


CAGGTCTACC 


TCTAATTGGT 


25980 


TTTGATGTGC 


CTTATGGTAA 


TCAGACCTTT 


ATAGAGGATG 


GGCAAAATGG 


TTATTTGATT 


26040 


CCAAGTTCAT 


CTGACCATGT 


AGAAGACCAA 


ATCAAGCAAG 


CTTATGCCGC 


TAAGATTTGT 


26100 


CAATTGTATC 


AAGAAAATCG 


TTTGGAAGCT 


ATGCGTGCCT 


ATTCTTACCA 


AATTGCAGAA 


26160 


GGCTTCTTGA CCAAAGAAAT 


TTTAGAAAAG 


TGGAAGAAAA 


CAGTAGAGGA 


GGTGCTCCAT 


26220 


GATTGAACTT 


TATGATAGTT 


ACAGTCAAGA 


AAGTCGAGAT 


TTACATGAAA 


GTCTAGGCGC 


26280 


TACTGGTCTT 


TCTCAACTTG 


GAGTGGTCAT 


CGATGCAGAT 


GGTTTTCTGC 


CTGATGGTCT 


26340 


GCTTTCTCCT 


TTTACCTATT 


ATCTAGGTTA 


CGAGGATGGA 


AAACCTCTCT 


ATTTTAATCA 


26400 


AGTTCCCGTT 


TCAGATTTTT 


GGGAAATTTT 


AGGAGATAAT 


CAGTCTGCTT 


GTATTGAAGA 


26460 


TGTGACGCAG 


GAGAGGGCTG 


TCATTCATTA 


TGCTGATGGA 


ATGCAGGCTC 


GCTTGGTTAA 


26520 
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i tjtjAAAGACC 


TAGAAGGTCG 


AGTACGTCAG 


GTTGACCACT 


ACAATCGCTT 


26580 


J. 1 v> X 


TTTPPTIPS a 
I 1 IvjL. 1 AUAA 


HiAL 1 1 A I AG 


CGCAGATAGC 


GAGCCGATTA 


TGACAGTTTA 


26640 


CCAAGATGTC 


AATGGTCAAC 


AAGTTTTACT 


GGAAAACCAT 


GTGACGGGTG 


ATATCTTATT 


26700 


GACTTTGCCA 


GGTCAGTCCA 


TGCGTTACTT 


TGCAAATAAA 


GTTGAATTTA 


TCACCTTCTT 


26760 


TTTGCAAGAT 


TTGGAAATAG 


ATACCAGTCA 


GCTTATCTTT 


AATACTCTAG 


CGACTCCTTT 


26820 


CTTGGTTTCC 


TTCCATCATC 


CAGATAAATC 


TGGCTCGGAT 


GTCTTGGTAT 


GGCAGGAACC 


26880 


TCTCTATGAT 


GCCATTCCAG 


GTAATATGCA 


GTTGATTTTG 


GAAAGTGATA 


ATGTGCGTAC 


26940 


TAAGAAGATC 


ATCATTCCAA 


ATAAGGCGAC 


TTATGAGCGC 


GCTTTAGAGT 


TAACTGACGA 


27000 


GAAATACCAT 


GATCAGTTTG 


TGCACTTGGG 


TTATCATTAC 


CAGTTCAAAC 


GTGATAATTT 


27060 


CCTAAGACGA GATGCCTTAA TCTTGACCAA TTCAGATCAG ATTGAGCAAG TAGAAGCAAT 


27120 


CGCAGGAGCC 


TTGCCTGATG 


TCACTTTCCG 


TATTGCAGCG 


GTGACAGAGA 


TGTCTTCTAA 


27180 


GCTCTTAGAC 


ATGCTTTGCT 


ATCCTAATGT 


GGCCCTTTAC 


CAGAACGCTA 


GTCCACAGAA 


27240 


GATTCAGGAG 


CTGTATCAAC 


TGTCGGATAT 


TTACTTGGAT 


ATAAACCACA 


GTAATGAGTT 


27300 


GCTACAGGCA 


GTGCGTCAGG 


CCTTTGAGCA 


CAATCTCTTG 


ATTCTTGGCT 


TTAATCAGAC 


27360 


GGTGCACAAT 


AGACTTTATA 


TCGCTCCAGA 


CCATCTATTT 


GAAAGTAGTG 


AAGTTGCTGC 


27420 


TTTGGTTGAG 


ACCATTAAAT 


TGGCCCTTTC 


AGATGTTGAT 


CAAATGCGTC 


AGGCACTTGG 


27480 


CAAACAAGGC 


CAACATGCAA 


ATTATGTTGA 


CTTGGTGAGA 


TATCAGGAAA 


CCATGCAAAC 


27540 


TGTTTTAGGA 


GGCTAACATG 


TCAGAGGAAG 


ATTTATTTTA 


CAAAGACGTT 


GAAGGCCGCA 


27600 


TGGAAGAGTT 


GAAACAAAAA 


CCCATCAAGA 


AGGAAAAAGA 


AACCCGAGGG 


GAAAAGATTA 


27660 


GTAAGACTTT 


TTCACTTTTA 


CTGGGTTTGA 


TGATTCTGAT 


TGGTTTGCTC 


TTTACTTTGC 


27720 


TGGGAATTTT 


GAGGTAGATC 


TATGATTGAA ATACTAATTG 


TTTTAGCTAT 


TATCCTATCT 


27780 


CTTGCTTTGA 


TTGTATTGGT 


AACTATACAA 


CCCCGTCAAA ATCAACTATT 


TTCCATGGAT 


27840 


GCCACTAGTA 


ATATTGGTAA 


ACCAAGCTAC 


TGGCAGAGCA 


ACACCTTGGT 


CAAGGTGCTC 


27900 


ACTTTATTGG 


TGAGTTTGGC 


TTTATTTATT 


CTACTATTAA 


CCTTTATGGT 


GATTACTTAT 


27960 


AAATAAAAGA 


AAACTTCAGA 


TATTCACCTT 


TTGTGGATTG 


GTCTGAAGTT 


TTCTTTTTTA 


28020 


TACTCAATGA AAATCAAAGA GCAAACTAGG AAGCTAGCCG CAGGCTgCTC AAAACACCGT 


28080 


TTTGAGGTTG 


TAGATATAAC 


TGACGAAGTC 


AGCTCAAAAC 


ACCGTTTTGA 


GGTTGTAGAT 


28140 


ATAACTGACG 


AAGTCAGCTC 


AAAACACCGT 


TTTGAGGTTG 


TGGATAGAAC 


TGACGAAGTC 


28200 


AGCTCAAAAC 


ACCGTTTTGA 


GGTTGTGGAT 


AGAACTGACG 


AAGTCAGCTC 


AAAACACCGT 


28260 


TTTGAGGTTG 


TGGATAGAAC 


TGACGAAGTC 


AGCTCAAAAC 


ACCGTTTTGA 


GGTTGTGGAT 


28320 
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AGAACTGACG 


AAGc tCAGTA 


ACATATATAC 


AGCAAGGCGA 


CGCTGACGTG 


GTTTGAAGAG 


28380 


TATTACTGTC 


TATATTTTTG 


GTAAAAATCA 


ACTTTTACTT 


GGATGAAGGT 


TTTGGCTTCA 


28440 


CGTAGGAGTT 


GAAGAAGGGT 


GGCGCGGGTT 


TCAAATTCTT 


CTCTTGTCTT 


GGGCAGACTG 


28500 


CGGTTCCGGA 


AGACTTCCAG 


ATAACGTTCA 


ATTTCATCTA 


GCAAATCAGA 


AGCAGGATTG 


28560 


GTCTGGCTCA 


GTTGACCTGC 


AATTTTTGAA 


AAGAGTTGCG 


CTAAGATCAG 


GCTTTCACTG 


28620 


-GCGGCAAGGT 


GACAAGTGTT 


AATCTGTTGG 


GCCATGTTTC 


TCAGGATACG 


ACTTTGTCGC 


28680 


TGTCTCATCT 


CAAAGTAGTG 


GATATGGTAG 


TCTGTCTGGT 


GAAAGAGGTG 


GTCAGAGTGA 


28740 


TCCAAATAGA 


CCAGTCTGAG 


GGCTTCTTTC 


AAAAGCGTGT 


CTAATTCTGC 


TACCAGCTGT 


28800 


GCTCGGTTGC 


GTCCGTCTCC 


TCTGGATAAA 


TAGTATTTGA 


AGCGCTGGAG 


GATATCTTTT 


28860 


AACTTTTCTT 


CCACCAGCGT 


GTGGTAGTGC 


TGGATTTCCT 


CTTCTCGTGA 


AGGCATATAG 


28920 


AGATTAACAA 


GCAAGGCAAA 


TCCTGTACCA 


ATAGCAAAGA 


GAAGGAATTC 


ATTGACTAGA 


28980 


AGGTCTGGAG 


AGGTTGACTC 


TTGAACCAAG 


AGATGGCTAA 


CCAAAACAGT 


GCTTGGTGTG 


29040 


ATGCCAATTT 


CCCAGCCCAT 


CTTGTAGGCT 


AAAGGAACGT 


AGAAGGCCAG 


ATAGAGGCCG 


29100 


AGACTCCAGA 


TATGAAATCC 


GCTCAAGTGA 


AAAGCTAGAA 


CACCGATAGC 


CAGAGCTAGA 


29160 


AGCATAGAAA 


AAAGACGATT 


GCGAGCCAGT 


TTTAAAGTAC 


TTCTACGCGT 


ATCAGATAGG 


29220 


CTCAAGAGAG 


CGATAATTCC 


AGCCGAAACT 


GCTGACGAAA 


GATTGAGAAA 


ATAAGCAAGC 


29280 


AGGCAGGCAA 


GACAGGTAGC 


TAAGATGAGC 


TTGGTCGTAC 


GTTGGCTAAT 


AGACATAAGA 


29340 


ATTTCCTAAT 


AAGTTAGAAT 


AAAAGCGTAA 


AAGACAAGAC 


ATGAGCAGGC 


TTGCCTTGAT 


29400 


GAGTTATTTT 


TTACGGGTTG 


CTGCGTATTC 


GGCAACGGCG 


GTAAAGAGGA 


CATCTGTAGA 


29460 


AGAGTTAAGG 


GCTGTTTCAC 


ATGAGTCTTG 


GATGACACCA 


ATCACAAAAC 


CAACCCCAAC 


29520 


AATTTGTATG 


GCAATATCGT 


TAGAAATACC 


GAAAAGGCTA 


CAAGCAACTG 


GGATAAGAAG 


29580 


GAGGGAACCT 


CCGGCAATAC 


CTGAAGCATC 


ACAGGATGAG 


ATAGCTGCTA 


CCACACTGAG 


29640 


GACAAAGGCT 


GTGGCAAAGT 


CAACAGGAAT 


TCCAAGAGTG 


TTAACTGCAG 


CAAGGGTCAA 


29700 


AAGGTTAATG 


GTAATCGCTA 


CTCCAGCCAT 


ATTGATAGTA 


GAACCGAGTG 


GGATAGAAAC 


29760 


AGAATAGGTA 


TCTGGGTTGA 


GTCCAAGGTC 


ATGGCAGAGT 


TTCATGTTGA 


CAGGAATGTT 


29820 


AGTCGCAGAA 


CTACGAGTGA 


AAAAGGCTGT 


CACACCGCTG 


ACACGGAGGC 


AGTTCCAAAC 


29880 


TAGAGGGTAA 


GGATTGCGTC 


TCATAAAGAA 


GAAGGCAATC 


AAAGGGTTGA 


CCACAGGGGC 


29940 


AACAAAAAGC 


ATAGTCGTTA 


CTAATAGAAC 


CAATAAAATA 


CCGTAGTTGG 


CAAGGCTTCC 


30000 


GACTCCCTTG 


TCAGAAATGG 


TTTTAAAAAC 


AAGACCAAGG 


ATTCCAAATG 


GAGCCAGATT 


30060 
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GATGATCCAT 


TCGACAATTT 


TAGAAGTCAC 


GTCAGCGATA 


GTTTTTAGCA 


ATTCTTGACT 


30120 


ATTTTTACTG 


GCTTCTCTCA 


TAGCGATTCC 


AAAAATGACT 


GCCCAAGATA 


AGATTCTAAT 


30180 


ATAGTTAGCA 


GTAAGCAGGG 


CGTTGACTGG 


GTTGTCAACC 


AGTTTGAGCA 


AGAGGTTGCT 


30240 


GAGAACCTGC 


CCAATCCCAT 


CTGGTGGTGC 


AATTTCAGTA 


TTGGCACTAT 


TTGGGGTAAT 


30300 


TTCAATAGGG 


ACGATGAAAT 


TTGCTAGTAC 


AGCTACAAGA 


GCAGCGGCGA 


AAGTCCCTAT 


30360 


CATAGGATAT 


ACAAGAAAAC 


AACAGTTTTC 


ATATTGCTAT 


CTTGTCCCTT 


TTGATGTTGG 


30420 


GAAAGGGCAT TGGCAACGAG AGCAAAGACT AGGATAGGAG CAACAGCTTT TAGACCTCCA 


30480 


ACGAATAAAT 


CCTCGAGTAG 




GAGAGATTAfi 




tppt & ft " i 1 
lUplnuunl 1 


30540 


CCCCACAAAG 


CATACCAATC 


AAGATACGCT 


TGACAAGGrT 






30600 


TGATTCTTTT 


CATAATAATC 


TCCTTTTTGT 


GTAGTGATTA 


TGATTATAGT 


AT AA ATR AT A 


30660 


GACAAAATCA 


AGAATTTTCT 


GTCTATTTTT 


TGAATATTTA 


TGGAGAATfiA 


UnL X via X linM 


30720 


AATATGGTAT 


AATGAAATAA 


AGGAGTTTTA 


TATGCAAAAA 


TTT A TT f* A Ci 


CTT AT ATTfl A 


30780 


AAAGCTAGAT 


GTGACAACCA 


TTATCGAGAA 


TATTCTAACC 


AAGGTCATTT 


CTCTTTTACT 


30840 


GCTTTTAATT 


GTATTTTATA 


TTGCTAAAAA 


AATGCTTCAT 


ACCATGGTGC 


AGAGAATTGT 


30900 


CAAACCTTCT 


CTAAAAATGT 


CTCGTCATGA 


TGTTGGACGC 


CAAAAAACCA 


TCTCACGTTT 


30960 


AC T AG AAAAT 


GTGTTTAATT 


ATACGCTATA 


TTTCTTTTTA 


CTCTACTGCA 


TTTTGTCGAT 


31020 


TTTAGGTTTG 


CCAGTTTCTA 


GTTTGCTGGC 


TGGAGCTGGT 


ATTGCTGGGG 


TAGCGATTGG 


31080 


TATGGGAGCC 


CAAGGCTTTC 


TGTCTGATGT 


CATCAATGGC 


TTTTTCATCC 


TCTTTGAACG 


31140 


TCAACTGGAT 


GTGGGAGATG 


AGGTCGTTCT 


GACAAATGGA 


CCGATTACTG 


TATCGGGTAA 


31200 


GGTTGTCAGT 


GTGGGAATTC 


GTACGACACA 


GCTTCGTAGC 


GAGGAGCAAG 


CCCTTCACTT 


31260 


TGTCCCTAAC 


CGAAATATCA 


CAGTTGTTAG 


CAATTTCTCA 


CGCACAGACT 


AGACCTGTTA 


31320 


TTTTAAGTAA 


TTTGTGGTAC 


AATAGAGGGA 


GTTTAATAAG 


GAGAAAAGAT 


GGTTTTAGAA 


31380 


AAGCAGTTGG 


GCAATGGTTG 


TACCTGGATA 


GACCTAGACC 


TAGGAAAGTT 


GAATAAACTA 


31440 


GAAGACCTTT 


CTGAAATTTA 


CGGTTTGGAC 


AAGGAAACCA 


TTGAATACGC 


ACTGGATAGA 


31500 


AACGAGCGCG 


CCCACATGGA 


CTACCACCGT 


GAAAGTGAGA 


CGGTTACCTT 


TATCTATAAT 


31560 


GTCTTAGACG 


TAAAAAAGGA CAAGGCCTAC 


TATGAGACTT 


TTCCCATGAC 


CTTTATTGTC 


31620 


GAGCATCGTC 


GCCTGATTAC 


CATTAGTAAT 


ACCAAGAACG 


CCTATGTCAT 


TGAACAGATG 


31680 


ACTCGTTATC 


TGGAGAACCA 


TGACACGCTT 


TCGATTTATA 


AGTTTCTCTT 


TGCCAGTCTG 


31740 


GAAATCATCA 


GCAATGCCTA 


CTATCCTGTC 


ATTGAGCAGA 


TGGACAAGAG 


TAGGGATGAG 


31800 


GTCAATGACC 


TCTTGCGCCA 


GCGAACTACC 


AAGAAAAACC 


TCTTTGTCCT 


GTCTGATTTG 


31860 
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GAGACTGGTA TGGTTTATCT GACGGCAGCT GCCAAACAAA ATCGGATTTT GTTAGAGCAT 31920 

ATTCAAGGTC ATGCCTTGTA TCGTAGTTTT GATGAGATTG AGAGAGAACA GTTTGATGAT 31980 

GCCATGATTG AGGCTCATCA GCTGGTATCC ATGACAGACC TAATCTCTCA GATTTTACAG 32040 

CAGCTTTCAG CCTCTTACAA CAATATTCTA AACAATAATC TGAATGACAA TTTGACAACC 32100 

TTGACTATCA TTTCAGTCTT GCTAGCTGTT TTGGCAGTCG TGACAGGCTT TTTCGGAATG 32160 

AATGTTCCCT TACCTTTAAC AGATGAGCCC CATGCTTGGC TCTATATCAG TTTGGCTAGT 32220 

GCAGGTTTGT GGATTGTTTT ATCCTTGTTA CTAAGGAAAA TTGCGAAAAA AAGTTAAGAA 32280 

AAGGAGCCAG AATGGCGATT GAAAATTATA TACCAGATTT TGCTGTGGAA GCAGTCTATG 32340 

ATCTGACAGT CCCAAGCCTG CAGGCGCAGG GAATAAAGGC TGTTTTGGTC GATTTGGATA 32400 

ATACCCTCAT TGCTTGGAAC AACCCTGATG GAACGCCAGA GATGAAGCAA TGGCTACATG 32460 

ACCTTCGGGA CGCGGGTATT GGCATTATCG TAGTGTCAAA TAACACCAAA AAACGCGTTC 32520 

AACGAGCAGT TGAGAAATTT GGGATTGATT ACGTTTACTG GGCCTTGAAG CCCTTCACAT 32580 

TTGGTATTGA CCGTGCTATG AAGGAATTCC ACTATGACAA AAAGGAAGTG GTCATGGTTG 32640 

GTGACCAACT CATGACAGAT ATACGAGCAG CCCACCGTGC AGGGATTCGG TCAATTTTAG 32700 

TCAAACCCTT GGTCCAACAT GACTCAATCA AAACGCAGAT TAACCGAACT CGTGAGCGTC 32760 

GTGTTATG 32768 



(2) INFORMATION FOR SEQ ID NO: 72: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14872 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 72: 

CCAGTCACAA AGAAATTGAG CGCGTTCAGc TGAGGATGCA CTATGATGCA AGCTACATTT 60 

CATTTGATGG GATATTAAGA AAGGAGATTT TCATGACACT TTTAGATGTA AAACACGTTC 120 

AAAAAATTTA TAAAACACGT TTTCAGGGCA ACCAAGTAGA AGCCCTCAAG GATATTCACT 180 

TTACCGTAGA AAAGGGTGAC TACGTTGCCA TCATGGGTGA GTCTGGTTCT GGTAAATCAA 240 

CTCTTCTCAA TATTCTAGCT ATGTTGGATA AACCAAGTCG TGGTCAGGTT TACTTGAATG 300 

GAACTGACAC CGCAACTATT AAAAATTCAC AGGCTTCTAG TTTCCGGCGT GAAAAGCTAG 360 

GATTTGTCTT CCAAGACTTT AACTTGCTAG ATACTCTGTC TGTTAAGGAC AATATCTTGC 420 
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TTCCGCTTGT 
CTGAGAATCT 
AGAAACAGCG 
ACGAGCCAAC 
AAATCAATGA 
GGGCCAAGCG 
AGAAGACAGA 
AGGTGAATTA 
CGCAAACTCT 
TTTTACTCCC 
GCAACACTTG 
CCAATAGTTT 
TGGAGAAGCG 
CTGTTGGAGC 
TCAAACTAAT 
CAGTACTTGT 
TCGCCCGTAT 
GCTTCCTACC 
CCCTTACGGT 
TTATCTTTGG 
AGAAAAACAA 
TCCGTATGAA 
TGGTAACCAT 
TAAATCCTCA 
TCTTGAGCCA 
ACAGTAACTT 
AAAACCGTGT 
TGACTGGTCA 
GACTGAAAGG 
TTAATAAAGA 



CTTGTCAAGA 
GGGTATTAAC 
TGTAGCAGTA 
AGGAGCCCTT 
GCGTGGGCAA 
TGTTCTCTTT 
GCGTCAGATG 
GTATGTTTCG 
ACTATCCCTT 
TAACCTTCAA 
GATTTGGTAT 
TGTCATGAAA 
CCATCTAATC 
GGGTATCGGT 
GAAACTGAAG 
TGTCTTTGGA 
GAATGCCCTC 
TCTCCAAACG 
AACCGATCCT 
TACTTATCTA 
GAAATACTAT 
GAAAAATGCG 
GTCAGCAGCG 
TGATTTTGGG 
GTTTGCAAGT 
TGGTATTGCA 
CCAACCCACA 
AAAACTGTCT 
ACAGAAAGCT 
TTTCATTGTG 



AGACCTATAA 
CAATTGCAAG 
GCCCGCGCCA 
GATTCCAAGT 
ACCATCCTCA 
ATCAAAGACG 
TTCCAAGAAA 
ATTAACCAAT 
TGCACTGGCT 
TCCAAAGATT 
GTTTGTCGTT 
AACCGTTCCA 
AGTATGACCT 
ATTGGAGCCT 
GTTGAGCTGG 
TTGATTTTCC 
CAGCTCTCGC 
ATTCTTGGTT 
CTTACAGCCC 
TTGTTTAATG 
TACCAACCTA 
GTTGGACTAG 
ACAAGCATTT 
GTTTCAGGGC 
GACAAAGGTT 
AATCAAGAAG 
ACAGTTTTCA 
CTATCAGGAA 
CTAACTCTAA 
AACCATGTTC 



594 
CGGAGATGAT 



AGAAGTACCC 
TCATCACAGA 
CATCTGCAGC 
TGGTAACCCA 
GCATTCTTTA 
TCTCTGATAC 
AAGTTAGCGG 
GTTCTCTTGG 
GCGGAAATCC 
ACCCTTGCGT 
AGGAACTGGG 
TTAAGGAGTT 
TGTTTGACAA 
TTGCTACCTT 
TAGGCCTCAT 
GTGAGAAAGC 
CCATAAGTTT 
TAACAACTTT 
CAGGGATTAC 
ATAACCTCAT 
CAACCATCGC 
TCAATTCCGC 
AAAATGTTGA 
ATAGTGTCAA 
GAACCAAGTT 
TGGTATTTGA 
ATGAGGTCGG 
ATGATCATCA 
CAAATAAGTT 



GAAGAAATTG 
TTACGAGATT 
ACCTGAAATT 
CTTACTTGAT 
CTCAACAGCA 
CAACCAAATC 
CTTGACTGTC 
TATCGAACTT 
CAGTCACCAT 
GTGGAGGAAC 
CAcCATTATC 
TATATATGGC 
AGTGGTATTT 
GTTAATTTTC 
CCAAATGAAT 
GTTCCTGAAT 
AAGCGGAGAG 
AGGGATTGGC 
CTTCCTAGCT 
AGTCTTCCTA 
ATCTGTTTCC 
TATTTTGTCA 
AGAAAGCTTT 
AAAAGAAGAT 
AGAGAAAGAA 
AACTATTTTT 
CCAAAAAGAT 
TCTCTTTGCC 
ATTTTCTGTC 
TAATATCTTG 



GTGGTGACAG 
TCTGGTGGTC 
CTCCTTGCGG 
GTCTTTAATG 
GCTGCTAGCA 
TACCGTGGAG 
ATGGCAAGCG 
GATTAAAAAC 
CACCTATCTC 
CACCATTCAA 
GTCCTCTATG 
ATGTTAGGCT 
GGGATTCTAA 
GCTTTCCTGC 
GTTGTCATTG 
GCTCTTCGAA 
AAAAGAGGTC 
TATTATCTTG 
GTTTTGCTGG 
CAAATCTTAA 
AACTTGATTT 
ACAATGGTTT 
AAAAAAGTTC 
TTGGACAAAC 
GTACTTCGTT 
GAAAAAGGAC 
TATGAAAATA 
AAAAATGACG 
AAAGAAGAAT 
ACTACTGATT 



480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
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ACAATTACCT 


TGTTGTTCCT 


GATTTACAAG 


CCTTTTTGGA 


TCAATTCCCA 


GATTCGGCTA 


2280 


TCTATAATCA 


GTTTTACGGT 


GGTATGAATG 


TAAATGTCAG 


TGAAGAAGAA 


CAACTCAAGG 


2340 


TCGCTGAGGA 


GTATGAAAAC 


TACCTCAATC 


AATTTAATGC 


TCAATTAGAC 


ACAGAAGGTA 


2400 


GCTATGTTTA 


TGGTAGCAAT 


CTAGCAGATG 


CTAGTTCTCA 


GATGAGTGCC 


CTCTTTGGTG 


2460 


GTGTCTTCTT 


TATCGGTATT 


TTCCTATCCA 


TTATCTTTAT 


GGTCGGAACT 


GTTCTGGTCA 


2520 


- TCTACTACAA 


ACAAATTTCT 


GAAGGCTACG 


AAGACCGTGA 


ACGCTTTATT 


ATCTTGCAGA 


2580 


AAGTCGGTTT 


GGACCAAAAG 


CAAATCAAGC 


AAACCATCAA 


CAAACAGGTT 


TTAACTGTTT 


2640 


TCTTCCTTCC 


TTTGCTCTTT 


GCCTTCATAC 


ATCTCGCCTT 


TGCCTACCAT 


ATGCTTAGCC 


2700 


TGATTTTAAA 


AGTGATTGGT 


GTACTGGATA 


CGACTATGAT 


GTTGATTGTG 


ACCTTGTCTA 


2760 


TCTGCGCTAT 


CTTCCTCATC 


GCCTATGTGC 


TGATTTTCAT 


GATTACTTCA 


AGAAGTTATC 


2820 


GCAAGATTGT 


GCAAATGTAA 


AAAAGATACC 


TCGACTTCAA 


AATCGAGGTA 


TTTCTTGTAT 


2680 


TCTAAATGCT 


GAAAAGTTGT 


CCGAGCAGGA 


AGGTAACTCC 


CATGGTCAAG 


AGACCAATAG 


2940 


CAAGGTTCCG 


AATCATAGCT 


GTTTTGGTTG 


GGGCTTTTCC 


AAGTCTAGCA 


CTTGTGTAAC 


3000 


CAGTGAGAAG 


AAGGGCCACA 


CCGACAATAA 


GGACGGTAGC 


AGGGATGCGG 


TAATCACTTG 


3060 


GAAAAATGGT 


CACTGACAGC 


ATTGGAGGCA 


AACTTCTAAG 


GAAAAAGGCA 


ACGAAGCTAG 


3120 


AAATGGCAGC 


GTGCCAAGGA 


TTGGTAAATT 


CTTCATACTC 


AATCCCATAT 


TTTTCCTCTA 


3180 


CCAGAGCCTT 


GAGTGGATTT 


TTAAGAAAGA 


TCTTATTGGT 


CAAGAGTTGG 


GCAGAAGTTT 


3240 


TGAATTCTCC 


ATTTTGGATA 


TAAGCAGCAT 


AGAGGGATTT 


TTTGGCTAGT 


TCCCTATCTT 


3300 


GGTCTAGCAA GAGTTTTTCT 


CGCGAAACGG 


CAGCTTCCTC 


GGTATCTTTT 


GGAGTTGAAA 


3360 


CGGATACATA 


TTCTCCACCA 


GCCATTGAAA 


AGGCACCAGC 


TAAGATAGCC 


GTAAAACCTG 


3420 


ATAAAAAGAT 


AATCCAGATA 


TTGGTCGTGG 


CACTGGCAAC 


TCCGATAACC 


ACACCAGCAA 


3480 


TGGAAATAAT 


TCCATCGTTA 


GCATCAAGAA 


CACCCGCACG 


CAGGATATTT 


AAACGACCTG 


3540 


CAAAATTTGA 


ATCAATTTCG 


TGATTTGTTT 


CTGACGCTAA 


ATTTCAAGTT 


CAAGTTAGCC 


3600 


ATCAAGAAGT 


CTTCTCTGGG 


TGACTTGTAG 


TCCAAGCATT 


TTTTAGGATA 


GTTGTTAATC 


3660 


CACTTTTCGA 


TGAATGCGAC 


TTCTTTGGGA 


GTCATTTTCT 


TGGTTCCCTT 


AGGTAACCAT 


3720 


CTACGAATGA 


GCCTGTTGTG 


ATTCTCATTA 


GTTCCCCTTT 


CCCAAGAGGC 


ATAGGGATGT 


3780 


GCATAATAAA 


TGTGCTCCTC 


AGAAAATACA 


TTAGACAAGC 


GATTGAATTC 


CGTTCCATTA 


3840 


TCTGCCGTGA 


TGGAAAGAAT 


CTTGTGTTGT 


TTTAAGATGA 


GTTTTAGAGC 


CTGATTGACC 


3900 


ACATCAGCAC 


TTTTATTTGG 


AATCAATCGG 


ATGATCTGAT 


GTCTACTTTT 


TCGATCCGTC 


3960 
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AAGACAAGCA 


AGCAGTAGTT 


TTTPPPTPTP 


o 1 AAV* 1 nuAA 




Q. 1 {.ATAATGC 


4020 


CCATTCTCCA 


AGCGAAAATT 


1 GATAflPTTPA 


i PPPPPTPTT 


en a tot* a tt , »' > 


ALv-AGCAGGT 


4080 


TTAAAGTTGG 


TGCTGGPPTY5 




GCTTTTCCTT 


I 1 L. TAGGGTA 


AAGCAGATCC 


4140 


TGTTTGCTTA 


APPPPAATTT 


tpp ii TP a it a 


A rv.(_AA i ACj T 


AAATGGTTGA 


AATTCCCACG 


4200 


TTA APPPPTT 


1 AlaCv A 1 vJAL. 


CAT CATTTC A 


GGCGAAAATT 


TTTGGTTATG 


ATAGTGGAGA 


4260 


A TPTTTTP PT 
niv,i ill V_v_ i 




GG rCAAGCTT 


GATTTCTTGA 


CCGAGCGCTT 


GCGATTGTTT 


4320 


TP AT A & fZ A PT 


ial xViAGv-AiA 


GTLGGUAGAA 


TAAACCTCTT 


TGAAGCGCCC 


TTTTCCAAGA 


4380 


P A TTY2TPPP ft 


t- i vj 1 v-LuALG 


CTTGATTTCA 


GTGTGGATAG 


TTTGAGGAAC 


TTTTCCAAGC 


4440 


AGAGAGGCAA 


TTTCTCTATT 


TGATTTCCCT 


TCTTTTTTCC 


ATCTTTCGAT 


TAAGCGACGG 


4500 


CTATCGATTG 


TCAAATGTTC 


GCCTTTTGTA 


GTATAATGGT 


TTTGCATCTC 


TGTGCCTTTC 


4560 


TTGTGTTTGT 


GGTTGAACAA 


CAAGTATAAC 


ACAGAGGTGT 


TTTCTTATGC 


CTACAAGAGC 


4620 


TATCGGCTAG 


TTGAACCATC 


TAATTTTTAG 


GAGGGCTGGG 


TGGCTAACTT 


CATTATAGAA 


4680 


CTTTCATTTA 


CGAACATATA 


GTAAAATGAA 


ACAAGAACAG 


AACAAATCGA 


TCAGGACAGT 


4740 


AAAATCTATT 


TCTAACAATG 


TTTTAGAAGC 


AGAGGTGTAC 


TATTCTAGTT 


TCAATCTATT 


4800 


ATATTTTTGT 


TTTTTATCAA 


AAAATACTTT 


ACAAGTTCTT 


AAAAACATGA 


TATAGTAATA 


4860 


AAGCTTAGAA AATGAGATGA TGTTTTCTAG CAAATATAAA CCCGAGTAAA AAATGCCTAC 


4920 


GGACAGGCAG 


GGTTGAATGC 


CGAAGCGTGG 


TTGAAAAGCC 


ACATTATTGA 


TAGGGTTAAA 


4980 


AGCCTACTTT 


TATAAGTTGA 


TGTTAGGACA 


CTTGTCCTAA 


TTCATAAATT 


TTTAGTGTGG 


5040 


TGAAAGCACA 


CGTCATCTTG 


TGAAACGATC 


AATAAAGTAC 


GTAATATTTG 


CTACTAGAGA 


5100 


GTTAGGAAAC 


ATCGGGAACA 


GACATACTCA ACAGAAACCA 


AAATAAACAC 


GTCAGAAGAT 


5160 


TGCAGAGCAG 


GTGAAAACCT 


GCTCTTTTTT 


CATGAGTCAA 


CCTTTAGTTC 


CTTAGTTTTC 


5220 


ATAAGGTCCT 


AAAAATATTG 


AAAGGAGTAT 


GTTTTGAAAG 


AGTTAGATCA 


AAACCAAGCC 


5280 


CCAATTTATG 


AGGCCTTGGT 


GAAGTTACGC 


AAGAAAAGGA 


TTGTTCCCTT 


TGATGTTCCA 


5340 


GGTCACAAGC 


GTGGACGGGG 


AAATCCAGAA 


CTTGTCGAAC 


TCTTAGGAGA 


AAAATGTGTA 


5400 


GGCATTGATG 


TCAATTCGAT 


GAAACCTTTG 


GATAATTTAG 


GCCATCCTAT 


TTCGATTATT 


5460 


CGTGATGCAG 


AGGAGCTGGC 


TGCAGATGCT 


TTTGGAGCTA 


GCCATGCCTT 


TCTAATGATT 


5520 


GGTGGAACAA 


CTTCATCGGT 


GCAGACTATG 


ATTCTGGCAA 


CCTGCAAGGC 


AGGAGATAAG 


5580 


ATTATTCTGC 


CACGAAATGT 


CCATAAATCT 


GCTATCAATG 


CGTTGGTTCT 


ATGTGGTGCC 


5640 


ATTCCCATCT 


ATATCGAGAT 


GAGTGTAGAT 


CCTAAGATTG 


GTATCGCTTT 


AGGTCTTGAA 


5700 


AATGACCGAG 


TAGCACAGGC 


CATAAAGGAC 


CATCCAGATG 


CTAAGGCTAT 


CCTAATCAAC 


5760 
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AATCCTACTT ACTACGGCAT CTGTTCAGAC CTAAAGGGGT TGACAGAAAT GGCTCATGAA 5820 

GCTGGCATGA TGGTTTTAGT AGATGAAGCC CACGGAGCGC ATTTGCATTT CACTGATAAA 5880 

CTTCCAATTT CTGCTATGGA TGCAGGGGCT GATATGGCAG CAGTTTCCAT GCATAAGTCT 5940 

GGTGGGAGTT TGACCCAAAG CTCCATTTTA CTTATCGGGG AGCAGATGAA TTCTGAATAC 6000 

GTTCGTCAGA TAATTAACCT GACCCAGTCT ACATCTGCCT CTTACTTGTT GATGGCTAGT 6060 

J TTGGATATTT CACGTCGCAA CTTGGCCCTT CGTGGTAAAG AGTCGTTTGA GAAAGTCATT 6120 

GAGCTATCTG AGTATGCCCG CCGTGAAATC AATGCTATCG GTGGCTACTA TGCCTACTCA 6180 

AAAGAGTTAA TAGACGGTGT TTCGGTTTGC GATTTTGACG TAACTAAGCT GTCAGTTTAC 6240 

ACTCAGGGTA TTGGCTTAAC AGGTATCGAG GTTTATGACC TCTTGCGAGA CGAATACGAC 6300 

ATTCAGATCG AGTTTGGTGA TATCGGCAAT ATCTTGGCCT ATATTTCCAT CGGCGACCGC 6360 

ATCCAAGACA TCGAGCGCTT GGTTGGTGCT CTGGCTGATA TTAAGAGACT CTATTCAAGA 6420 

GATGGAAAAG ATTTGATAGC AGGAGAATAT ATTCAGCCCG AGTTAGTGCT GTCTCCGCAA 6480 

GAAGCCTTCT ATTCAGAAAG AAAAAGTTTA ACTTTGGATG ATTCTGTTGG ACAGGTCTGT 6540 

GGAGAATTTG TTATGTGTTA CCCTCCAGGT ATTCCTATCT TGGCTCCTGG TGAACGCATT 6600 

ACACGAGAAA TTGTCGACTA TATCCAATTC GCCAAGGAAC GTGGTTGCTC CCTCCAAGGG 6660 

ACGGAAGATC CAGAGGTCAA TCATATCAAC GTTATTAAGA GAAAGACAAA CTATAAGAAA 6720 

AGTCAATAGT TTTATCTAAA CTATTTCTTA TTTCAATTTG ATGATTTGGC GATGATTTTA 6780 

GAGCACGGCA AAAAGCCCTT GAATTAGAAG CGGTCAATCG CTTAATTTCT ATCAGCTTAT 6840 

CAAATCCTGC CTCAAGCCTT TTCTGAGGAT TAGGGTAGCG TGTCAAGAGT TGGTAGGTAT 6900 

ATTCTGAATG CTTTCCAACG ATTTTATCCA ACTCAGGAAA GATGATATCA AGACAACGAG 6960 

TGTATTGTAC TTTCCAATCA GACTGTTTTT TCTTGAGACG ATGAATATGT CTAGCCAGTA 7020 

TTTTTAGTTC TACTTGCCGA TTATCGTGTT GAAATTGTTC ACGATTGGGG TCAGAAAGAA 7080 

GTTTAAGAGC GATGCCATGA GCGTCTTTCT TATCCGTTTT AGTTTTGCGA AGTGATAATG 7140 

ATTTGGCAAA TTTCTTGATG AGCAAAGGAT TGTAGGTGTA AACTTTATAT CCTTGTTCAT 7200 

GCAGGAAGTT CAGTAGATTA AAGGCATAAT GTCCGGTATT TTCAAGAGCG ATGAGACAGT 7260 

CTTGGTTGAG CTGTCGAAGA GACAGATCTA AGAGTTCAAA ACCAGCTTTA TTATTTGAAA 7320 

AAGTGAGTGG TTTAAGAACA GTTTTTCCTG GAACATTCAA GGCTGTAACA TCGTGTTTAT 7380 

TTTTAGCGAC ATCAATGCCC ACATAAAGCA TGGGAGTATC TCCAGATATA GTATTTCAAG 7440 

TCTACTGGGT TATCCACGAA CTTTTTGCCT TGTTACCTTA GACGAGATAA AACGTCTATG 7500 
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CGTTATCAAA CTCATTACCA ATTGAAACAA AAAACTGTGG TTAGAGCCTT TCGGAAATCG 7560 

TCAAGCGATT GGAGGAAATG AACTAATCCA CAGTGGCTTA TTCCAAGTAT ACCACTTGGG 7620 

CTTTGGCAGT AGCTAACTGC GCTAAATATA ATATAAGGAG AAATAGATGG ATTTATGGTT 7680 

TTCTGAAGTT CATACTCCAG ATGTCAAATT GTCTCTGAGA ACAGCCAAGC AACTTTACGC 7740 

TGGAAAAAGT GAATGGCAGG ATATCGAAGT CTTGGATACG CCAGCTTTTG GGAAAATACT 7800 

GATTTTAAAT GGCCATGTCT TGTTCTCAGA TGCGGATGAT TTCGTCTACA ATGAAATGAC ,7860 

CGTTCACGTT CCCATGGCTG TCCACCCAAA TCCAAAGAAA GTATTGGTTA TTGGGGGTGG 7920 

TGACGGCGGT GTTGCCCAAG TATTAACCCT CTATCCTGAA CTGGAGCAAA TTGATATTGT 7980 

GGAACCGGAT GAGATGTTGG TCGAGGTCTG TCGTGAGTAT TTCCCAGACT TTGCTGCAGG 8040 

GCTAGATGAT CCTCGTGTTA CCATTTACTA CCAAAATGGG CTACGCTTTT TGCGAAACTG 8100 

CGAAGATGAT TACGATATTA TCATCAACGA TGCGACAGAT CCATTTGGCC ATACGGAAGG 8160 

ACTCTTTACC AAGGAATTCT ACGGCAATAG TTATCGAGCT CTGAAGGAAG ACGGCATCAT 8220 

GATTTACCAG CATGGGAGTC CCTTCTTTGA CGAGGATGAG TCGGCCTGCC GAAGCATGCA 8280 

CCGCAAGGTC AATCAAGCCT TTCCAATCAG TCGGGTTTAT CAGGCCCATA TTCCAACTAG 8340 

CCCAGCTGGC TATTGGTTGT TTGGATTTGC ATCGAAAAAA TACCACCCTG TCAAAGATTT 8400 

TGACAAGGAA GGCTGGAAAA AACGCCAGCT TTTCACAGAA TACTACACTG CAAACTTACA 8460 

CGTGGGAGCC TTTATGTTGC CCAAGTATGT TGAGGACATT TTAGAAGAAG AGGAAGGAAA 8520 

AAAATGAGTC GTTTACTAGT TATTGGTTGT GGGGGCGTTG CCCAAGTTGC TATTTCAAAG 8580 

ATTTGTCAAG ATAGCGAAAC ATTTACAGAG ATTATGATTG CTAGCCGTAC CAAGTCAAAA 8640 

TGCGATGACT TGAAAGCGAA GCTAGAAGGC AAAACAAGTA CTAAAATTGA AACTGCAGCA 8700 

CTTGATGCTG ACAAGGTTGA AGAAGTGATT GCCCTGATTG AAAGCTACAA ACCAGAAGCT 8760 

GTTTTGAATG TAGCTCTGCC TTATCAAGAT TTAACCATTA TGGATGCTTG TTTGGCAACA 8820 

GGTGTTCACT ATATCGATAC AGCCAACTAC GAAGCAGAAG ACACAGAAGA CCCTGAGTGG 8880 

CGTGCTATCT ACGAAAAACG TTGTAAGGAA CTTGGTTTTA CAGCCTACTT TGACTACTCA 8940 

TGGCAGTGGG CTTATCAAGA GAAATTCAAA GAAGCAGGCT TGACTGCTCT TCTTGGTTCT 9000 

GGTTTTGACC CAGGTGTAAC TAGTGTCTTT TCAGCTTATG CCCTCAAACA CTATTTTGAT 9060 

GAAATCCATT ATATCGACAT TTTAGACTGT AATGGCGGTG ACCACGGTTA TCCATTTGCA 9120 

ACCAACTTTA ATCCAGAAAT TAATCTCCGT GAGGTTTCTG CGCCAGGTTC TTACTGGGAA 9180 

GATGGGAAAT GGGTCGAAGT CGAAGCTATG TCTATCAAGC GTGAGTATGA TTTCCCTCAA 9240 

GTTGGACAAA AAGATATGTA TCTCCTTCAC CATGAAGAAA TCGAATCATT GGCCAAGAAC 9300 
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ATTCCAGGTG 


TCAAACGCAT 


TCGTTTCTTT 


ATGACTTTTG 


GTCAATCTTA 


CTTGACGCAC 


9360 


ATGAAATGTC 


TTGAAAATGT 


TGGACTCCTT 


CGTACGGATA 


CCATTAACTT 


TAACGGCCAA 


9420 


GAAATTGTTC 


CAATTCAATT 


TTTGAAAGCC 


TTGCTTCCAG 


ATCCTGCCAG 


TCTTGGGCCA 


9480 


CGTACAGTCG 


GAAAAACCAA 


TATTGGATGT 


ATCTTTACAG 


GTGTCAAAGA 


CGGTGTCAAA 


9540 


AAGACTATCT 


ATATCTACAA 


TGTCTGCGAC 


CATCAGGAAT 


GTTACGCAGA 


GGTTGGTTCG 


9600 


-CAAGCTATTT 


CTTATACGAC 


AGGAGTTCCA 


GCCATGATTG 


GGACAAAATT 


AGTCATGAAC 


9660 


GGAACTTGGA 


AACAAGCTGG 


AGTGTATAAC 


CTTGAGGAGT 


TAGATCCAGA 


TCCATTCATG 


9720 


GAAGCTTTGA 


ATGAGTATGG 


TTTGCCATGG 


GTTGTGGTTG 


AAAATCCACA 


AATGGTGGAC 


9780 


TAATGAAGTT 


AGAACAAGTA 


CCAACACCAG 


CCTATGTTAT 


TGACTTGGCC 


AAGTTAGAAG 


9840 


CTAATTGCCG 


CATTCTACAA 


TATGTACAAG 


AAGAGGCCGG 


TTGCAAGGTC 


TTGCTTGCCC 


9900 


AGAAGGCATA 


TTCCCTCTAC 


AAAACTTATC 


CCTTGATTAG 


CCAGTATCTA 


TCAGGTACGA 


9960 


CAGCTAGTGG 


ACTCTATGAG 


GCCAAATTGG 


CAAGGGAAGA 


ATTTCCTGGT 


GAAGTCCATG 


10020 


TATTTGCGCC 


TGCTTTCAAG 


GATGCAGACT 


TGGAGGAATT 


GCTAGAGATA 


ATGGACCATA 


10080 


TAGTCTTTAA 


CTCAGAGAGA 


CAGTTGCGTA 


AACACGGTCC 


GCGTTGTCGA 


GAGGCTGGTG 


10140 


TCAGTGTTGG 


TTTGCGCCTC 


AACCCTCAGT 


GTTCAACTCA 


AGGcAGATCA 


CGCGCTCTAT 


10200 


GACCCTTGTG 


CACCAGGTTC 


TCGCTTTGGA 


GTTACTATAG 


ACAAGATTCC 


GAGTGATTTG 


10260 


CTAGATTTGG 


TTGACGGACT 


TCATTTTCAT 


ACCCTTTGCG 


AGCAGGGAGC 


AGATGATTTA 


10320 


CAAACAACTT 


TGAAAGCAGT 


AGAAGAACAG 


TTTGGTCCCT 


ACTTACATGA 


GGTAAAATGG 


10380 


CTCAATATGG 


GTGGTGGTCA 


TCATATTACA 


AGAGAAGGTT 


ACGATGTGGA 


TTTGCTGATT 


10440 


TCAGAAATCA 


AGCGTATCCG 


AAAAACTTAC 


AATCTTGAAA 


TCTATATCGA 


GCCTGGTGAA 


10500 


GCCATTGCGC 


TTAATGCGGG 


TTATTTAGCA 


ACTGAGGTAT 


TAGATATTGT 


AGAAAACGGT 


10560 


ATGGAAATCT 


TGGTTTTAGA 


CGCCTCTGCG 


ACCTGCCATA 


TGCCTGATGT 


ACTTGAGATG 


10620 


CCCTATCGTC 


CACCTTTGAG 


AAATGGCTTT 


GAGTCACAGG 


AAAAAGCCCA 


TACCTACAGA 


10680 


CTTTCTTCTA 


ATACCTGTCT 


GACGGGCGAT 


GTGATTGGTG 


ATTATAGTTT 


TGAAAATCCA 


10740 


GTCCAAATCG 


GAGACAGACT 


TTATTTTCAA 


GACATGGCCA 


TTTATTCTTT 


TGTCAAAAAT 


10800 


AATACCTTTA 


ATGGTATTGG 


ATTGCCAAGT 


CTCTATCTCA 


TGGACGAACA 


GGGAGACTGT 


10860 


AGCTTACTCA 


AAGCTTTTGG 


CTATCAAGAC 


TTTAAAGGGA 


GATTATCATG 


ATGGACAGTC 


10920 


CAAAAAAATT 


AGGCTATCAC 


ATGCCAGCAG 


AGTACGAACC 


CCATCATGGT 


ACCCTCATGA 


10980 


TATGGCCGAC 


TCGACCAGGA 


TCATGGCCTT 


TTCAAGGAAA 


GGCTGCTAAA 


AGAGCATTTA 


11040 
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CTCAGATTAT CGAGACCATA GCAGAAGGGG AAAGAGTCTA TCTTTTGGTG GAGCAGGCCT 11100 

ATCTATCTGA AGCCCAATCC TATCTTGGAG ACAAGGTTGT TTATTTAGAC ATTCCCACCA 11160 

ATGATGCCTG GGCGCGTGAT ACTGGCCCAA CCATTCTCGT CAATGATAAA GGTAAGAAAT 11220 

TAGCCGTGGA TTGGGCCTTC AATGCTTGGG GAGGCACCTA TGATGGTCTT TATCAAGATT 11280 

ATGAAGAGGA TGACCAAGTA GCCAGTCGTT TTGCTGAGGC CTTGGAAAGG CCTGTCTATG 11340 

ATGCTAAACC TTTTGTACTG GAAGGAGGCG CAATCCATAG CGATGGTCAA GGAACTATTC 11400 

TCGTAACTGA AAGTTGCTTG CTTAGTCCTG GTCGCAATCC TAACTTGACT AAAGAGGAGA 114 60 

TTGAAAACAC ATTATTAGAA AGTCTTGGTG CTGAAAAAGT TATTTGGCTT CCTTATGGTA 11520 

TtTATCAGGA TGAAACCAAT GAACACGTCG ATAATGTTGC TGCCTTTGTT GGTCCTGCTG 11580 

AGCTTGTTTT GGCTTGGACA GATGACGAAA ATGATCCCCA GTATGCCATG TCAAAAGCAG 11640 

ATCTCGAACT CTTAGAACAG GAAACAGATG CAAAAGGTTG TCACTTCACC ATTCATAAAT 11700 

TGCCTATCCC TGCAGTTCGA CAAGTTGTGA CAGAAGAAGA TTTGCCAGGC TACATCTATG 11760 

AAGAAGGAGA AGAAAAGCGA TACGCAGGTG AACGACTAGC AGCTTCCTAC GTAAACTTTT 11820 

ATATCGCCAA CAAGGCTGTC TTGGTTCCAC AGTTTGAGGA TGTAAACGAC CAAGTGGCCT 11880 

TAGATATCCT CAGCAAGTGT TTCCCAGACC GTAAAGTTGT CGGAATACCA GCCAGAGATA 11940 

TTCTCTTAGG TGGTGGCAAT ATCCACTGTA TCACCCAACA AATTCCAGAA TAGGAGAAAA 12000 

AGATGAGAAA TGTAAGAGTT GCAACCATTC AGATGCAATG CGCTAAGGAT GTGGCAACAA 12060 

ATATCCAAAC CGCAGAGCGT TTAGTACGTC AGGCTGCTGA GCAAGGAGCC CAAATTATTC 12120 

TCTTGCCCGA GTTGTTTGAA CATCCCTATT TCTGTCAGGA ACGTCAGTAT GACTACTACC 12180 

AGTATGCCCA ATCTGTAGCG GAAAATACTG CCATTCAGCA TTTTAAGGTG ATTGCTAAGG 12240 

AACTACAAGT TGTTTTACCA ATCAGTTTCT ATGAAAAAGA TGGTAATGTC TTGTATAACT 12300 

CTATTGCCGT CATTGATGCA GATGGGGAAG TGCTGGGCGT TTATCGAAAG ACCCATATAC 12360 

CAGATGACCA TTATTATCAA GAAAAATTCT ATTTCACGCC TGGTAACACT GGTTTCAAGG 12420 

TCTGGAATAC TCGCTATGCT AAGATTGGTA TCGGTATCTG TTGGGATCAA TGGTTCCCTG 12480 

AAACAGCGCG CTGTCTTGCA TTGAATGGTG CTGAATTGCT CTTTTATCCT ACAGCTATCG 12540 

GTTCAGAGCC AATTTTGGAT ACAGATAGTT GTGGTCACTG GCAACGTACT ATGCAAGGGC 12600 

ACGCAGCAGC GAATATTGTT CCAGTCATCG CAGCCAATCG TTATGGTTTA GAGGAGGTTA 12660 

CTCCTAGTGA GGAAAATGGC GGACAGAGCT CCAGTCTTGA CTTCTACGGT TCCTCCTTTA 12720 

TGACGGATGA AACAGGAGCT ATTCTAGAAC GAGCTGAAAG ACAAGAAGAA GCTGTTCTGT 12780 

TAGCTACTTA TGACCTAGAC AAGGGAGCAA GTGAACGCCT AAACTGGGGC TTGTTTCGAG 12840 
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ATAGAAGACC AGAAATGTAT AGACAAATTA CAGATTAGTG TGGGAGAAAT GAGAGATTCA 12900 

TTCTGCTAGA CTAACTTCTT ATTAGTAACT ATAAGATACT ATGGCATCTA GTAAATCGAT 12960 

TTTTATGATT CGCTATTCTT GTCTATTGAT TAGTCCGTAT TTTAAAATAT TAGCAAAAAA 13020 

GCAAATAGCA GTAACTTCTG TCTATTTGCT TTTCTTTTTT ATAGAATATA TTTCTCAATA 13080 

GCACGCGCAA CGCCGTCTTC TTCGTTGCTT GAGGTAACGG CATCCGCAAG AGATTTGATA 13140 

TAATCGCTGG CATTTCCCAT TGCAATCCCA AGCCCTGCAA ACTGGAGCAT TTCGATATCG 13200 

TTATTAGCAT CGCCCATGGC CATAATCTCT GAGGAATCAA TCTTCAAAAT CTCAGCTAGT 13260 

CGTGAAAGAG CAGTAGCCTT TGTCGTTCCA AGCGGCATTG CTTCATAAAT GACAGGCTGC 13320 

GAACGAACTC CACTGAATCG TTGGCAAAGC TCTTCAGCAA AACGCTGCTC AAAATCGTCT 13380 

GTTTGTTCTT TTGTTCCTAA ACACATACCT TGGAACATCC GGAACTTTCC ACTAGTCGCT 13440 

TCTTCAAGAG AAATTTCAGT CAGGTCTGAA AATACTAGTT TAGCATCATT TTCAATAACT 13500 

TGATTGGGCT TGTCACCGAG AACAAAATAA TGTGACTCGT CAAAAAGTGT CAACTGAACA 13560 

TCACTCTTTT CAGCAAGGTC ATAGAGGTAT TCGATGTCAG CTGGACTCAG TTCTTTCCAG 13620 

TCAACTAGAC TCCAATCACT GGTCTGGTGA GTTGAACAAC CGTTGTTAAC AATAATATAT 13 680 

TCGTTCTGGA GGTCAAGCTC CAGTTTTTTG TAGTAGGGGA GGACACCGAA AAGGGGGCGA 1374 0 

CCCGTACAGA GAACCAGTTT GACACCTTTT TCAATGGCTT TGTGAATAGC AGTAATGTGT 13800 

GCTTGTGGGA TTTCCTTGGC TTCATTGAGG AGGGTGCCGT CCATATCCAA GGCTAGTAGT 13860 

TTAATCATAG GTCTTCCTCT TTATCTTTGC TATTATTATA GCATATTTTG GAGAAGAAAT 13920 

TGATAGAAAG CTTGAGACTA ATTGATTTTA TAGTTTAAGA TGTTTTGATG ACAATTCATG 13980 

ATTTGAAGAG GATATTTCGC AAAGATATGC TATACTATGT TTGTCAATGT TGCAACTAGA 14040 

CAAATTAAAA AACCAACTTA ATATAATAGT TTTTTTGTAA GTAGGTATGA GTAGCAGATT 14100 

ACTCAACTAA TCTGAAGAAT AATGGAGGAA ATATATCATG ATTTTAATGA CAAAAAATAT 14160 

AAATCTAACA AATGAAGAAT TAGAGCTGAT ACAAGGTGGA GCAGATCCAT ATGGTAAAAA 14220 

TCCTAATGGT AGGTACGATT GGGAAATAGA ACCAGTATTA ACTCTGCTGG TTCATGGATT 14280 

TTGTCCCAGA GGCACCTATG ATTCAGGATA TATTGGAGGA GGTAATCATC TTTGCAAAGG 14340 

AAGTGCTGCG AGATTTTAAG TAAAATTTAT TAGGAATATG AAGAAACAAG GGGAGAAAAC 14400 

AGAGGATTTA ATATGAAAAA ACGAGCTATT CAAATTTTAC TAGCATTGTC CTTAATTTTT 14460 

TACAAATCAA CTTGGTTTTG GAGGCTTTTC AATTATCTCG CAAAGCCCTA TCTACCAGCA 14520 

AGTCGTGAAT TTTTTCAGAT TCTGCTTTTG ATGGAGAGCG GAGTTCTTTT CTTAGCGGTC 14580 
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ATCTATCTAC TGGTTTTTGC AGGAAAGAAA ATTTTTCATT TCAAGTGGCA GCTGAGGTAC 14640 

TTCATCTACC TTTTACTGGG CTACATCATT TCATATATGT CTGACTTCCT CTTTTCGTAT 14700 

TTCATATCCC TGTCTTCAAA TCAGATTTCT TTGAATGAAA CGGTAGAAAT GATGGGGAGA 14760 

CAGGAGTTCC CTTATGTCTT GCTCATCGTT TGCTTCATCG CCCCTATTGC TGAGGAATTG 14820 

ATTTATCGAG GtGTGCTTAT GACAACCTGT TGCAAAAACT CACCTTGGTA CG 14872 



(2) INFORMATION FOR SEQ ID NO: 73: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10223 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 73: 

CGTGCTATCG GTCTCAAAAC CAATCTGGTC GCTATGGTCA AATCCAGTTG GAAAATCCAT 60 

TCTTCTTGGA GCCATCTGCT GGATTGCCAT CATCCTCACC ACTCTTGGTA TGCAGACCCT 120 

TATCGGCATT TTCTAATACT CTTCGAAAAT CTCTTCAAAC CACGTCAACG TCGCCTTGCC 180 

GTAGGTATAT GTTACTGACT TCGTCAGTTC TATCTGCAAC CTCAAAACGG TGTTTGAGCT 240 

GACTTCGTCA GTTCTATCTG CAACCTCAAA ACGGTGTTTT GAGCTGACTT CGTCAGTCGT 300 

ATCTACAACC TCAAAACAGT GTTTTGAGCT GACTTCGTCA GTTCTATCTG CAACCTCAAA 360 

ACAGTGTTTT GAGCAGCCCG TGGCTAGTTT CCTAGTTTGC TCTTTGATTT TCATTGAGTA 420 

TAACACAAAA GGTAGCCCAT CAGCTACCTT TTTCTTATGC TTCCTCAATC AAGCGAGTAT 480 

GTTCTCTCTT GATACAGCGA TTCATCACGA TATCATCACA TCCACCATCA CGCAAAATCT 540 

CTTTCGCTTC TAAACTTTCA AGTCCTAGCT GTGCCCAAAA AATCTTGGCA TCAGCTTTGA 600 

GAAAATCACG CGCCACATCG GGCAGAAATT CACTGCGACG ATAAACATTG ACAATATCTA 660 

CAGGAAAAGG AATTTCAGCG AGGCTAGCAT AAGCCTTTTC ACCCAAGATT TCGCCACCTG 720 

CCGCCTTGGG ATTGACTGGG ATGATTTTAT AGCCCCGAGC CTGCATTTCC TTTGTTACTC 780 

GATTGCTGGT TGTTTCTTCA CGGTCAGACA AACCCACCAC AGCAAGGGTT TTACTCGTTG 840 

CGAGATACTG ACGAATCACG CCATCACTTG GATTGATAAA TTCTTGACTC ATAGAAATCC 900 

TCCTTTTTCA TCAGTATAGC ACATTTTGAA AAGGTTTGCA GAATTATACT ACAAAAAAGG 960 

AGGACTAGCC CCCTTTTTAT TTAGCCTCGT ACCAGGTTGC CCCTTCATTC TCATCTGCGA 1020 

TAAGAGGAAC ACTGAGTTGA ATGGCTTCTT CCATGGTTTG TTTCACCAAT TTTTTCATCT 1080 

CTACCAATTC AGATTTAGGC ACTTCAAGGA CGATTTCATC GTGCACTTGT AACAGCATCT 1140 
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TAGTCTGATA ACCACCTGCA ACCAAGGCTT 
TATCTGCTGC CGAACCCTGG ATAGGTGAGT 
TATTGAAGTT GCGCGAATTG ATATCTGGCA 
CATAGCCCTT ATCACGCGCC TCCCGCACCA 
AACGTTCAAA GTAGGTATCA ATGTAGGCTT 
- TAGACAAGCC AAAGTCTGAA ATCCCATAAA 
GACGGTCGTT TGCAGTCACA TCATCAGGAC 
AAGTATGGAT ATCTGCCCCC TCTTGGAAGG 
GCGCCAAAAC GCGCAATTCA ATCTGTGAAT 
ACTCTGGCAC AAAAGCCTTC CGAATCAAGC 
GCAAGTTTGG ATCCACACTA GACAAACGCC 
TATGAATCTT TCCATCAGCC AAAATCCAGT 
TCTTAGCAAT TTGACGGTAA TCCAGGATTT 
GCTCTAAAAC ATCCACTGCT GTCGAATAAC 
GAAGTCCCAA TTTCTCAAAG AGAAGCACGC 
CCTCACCAGC CAGCTCGTAA ATCTCTTGAG 
CCTGCATCTC AAGCAAGGTC TCTTTCTTGA 
GGACAAAAGC CAGAGGTTGC TCCATATCAT 
GTTTTTCAAG TAAAATAGGC TCTGTTTCTA 
CCAAGAATTT CTCACGTTCA GGAATGGCCT 
CATCAACCAA GTAAGTCTGA CCATAAAGAC 
CAGTCGAAAG GAGGTATTTA GCCAAACGGA 
CAAAACGTTG CAAAAGAACT TTAACCTTCT 
CTAAGAAATC CTTGAAAATC GGGTCTTGCA 
TATCCCCACA AGACCAGACA AATCCAACCA 
GCTCAAAGTG GAAGATAGAC TCTTCACTCA 
TAAAATCCAA ACTCTCAGAC ACATCAGCTG 
TGAAGCCCAT CTCATCGTAG AATTTCCCAA 
AGTCCTCTAA ACCAATCGCA ATCGGTGCCT 



603 
TATCCAGCTG 
TGATAGCAGT 
ACTCACGGCG 
CTTCATCCAT 
TGGCTTCCTT 
CCACTCCAAA 
GCTCAATGCC 
CCTTAATCAA 
AGTCAGAGCT 
GCCCCTGTTC 
CGGTCTGGGT 
CCTGCAAGCC 
TCTTAACAAT 
CTGTCTTGGT 
CCAACTGCTT 
TCAGTTTTTC 
CCATAATCCC 
AAAGAAGCTC 
CCAAAACAGC 
TTTTAACACC 
TAGCGATGGT 
TGTCAAAAGC 
TAAAGTCATA 
ACAGCTCAAG 
AATTATCCGT 
GCATATCTTG 
ACGACACATT 
GATTTTCAAC 
TGGTATCAAT 



AATCATGGCA 
TCGCTCCGCA 
ACGCTTAAAG 
GTAGTTTTTA 
ACGACTAATT 
GTTAACTGCC 
AAAGACCCGC 
GTGCTCATCC 
GAGTAGCACA 
CAATCGGGCA 
CAAATCCTGC 
AATTACATAA 
CGGAGCAATA 
TTTCTTAGTG 
AGGCGAGTTG 
AATGACAAGC 
AGCAATTTCC 
TAATTGCCCA 
AAGTTTACAA 
CTTACCGTAG 
CGCAATTTCA 
AGGCGCCTGC 
AACTCTCAGA 
CTTGTCTGTG 
ATGGTAATTC 
ACTGATTTGG 
TAAAGCCTGC 
ATCTGGACCA 
GGTCGCTAGT 



ATCTTGAGAA 
AAACCACGAA 
AGGGTCTCTA 
ATACCTGGAA 
CCCAAATTAT 
TTGGCATTGC 
ATGGCTGTCG 
TTAGAAATAT 
CTATCCTCCC 
GGAATATTTT 
ACATAGCGAG 
GTAGATTGAA 
GGAGCGAGAC 
TATTCTAGAG 
ACATTAAACT 
TCATTTTCAG 
ATCTTGGCAA 
TTTTCGCTGA 
GCTAAGTGTT 
AAAGTTTCAT 
TTGTCCTCCA 
AAATCCACAC 
GATGTTTTTT 
GCATAGAGCT 
TCACCAAAAA 
TCAACAATAG 
TTTAGCTGTT 
CTATAGACCA 
GTTTTAGACA 



1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2880 
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AAAAGGCCTG TTCCTTGTCA TTGATGAGAT TTTCCTTCAT CTTAGAAGTC TTCATTCCAT 
CAATATTTTC ATAAATCCCC TCAAGCGAAC CATGCTCCAG CAAGAGCTTA ATACCCGTCT 
TTTCACCGAC TTTGGTCACC CCAGGGATAT TATCCGACTT ATCACCCATG AGCGCCTTGA 
GATCGATAAA CTGAGCTGGT GTGAGGCCCA TTTCTTCCAT GAGGTAATCT GGCGTAAAGG 
CCTCAAACTC AGCCACACCT TTCTTGGAAA TTTCAACCAC CGTATGCTCA TCCGTCAGCT 
GAATCAAATC CTTGTCCCCA CTGACAATAG TAATATCAAA ACCATCCTGC TCTGCTAGCT 
TATCCAGCGT CCCAATGATG TCATCCGCCT CATACTGAGC CAGATCATAG TGACGAATCC 
CCATATGATC CAGCAACTCA CGAATGAAAG GAAATTGCTC ACGAAACTCA TCAGGAGTCT 
TGGCCCGACC ACCCTTATAG TCCGCATACA TCTCTGTCCG GAAGGTCGTC TTTCCCGCAT 
CAAAAGCCAC CAAAATATGA CTCGGCTCAA CCCGCTCCAA TAAATGACTC AACATCAACT 
GAAAACCATA AATCGCATTG GTATGCAAAC CAGCCACATT CTTAAAACGG TCCAACTGCT 
C3ATACAGCGC AAAAAACGCC CGAAAAGCTA CAGAAGACCC ATCAATCAAT AATAATTTTT 
TCTTATCCAT ACACCCATTA TAAAGGAAAG AATCAAAAAA TACCATTGGG AAGAGCTAGA 
GCAAGTATTT TTCAAACTTT TTCCGAATAA ATAGATAGAG CCAGAGAATT TAGTAAACCT 
AGATTTAAAA ATGTGCTATA ATATAGTATA TTGAATCTAT AATAGTACAC CTTGACTGCT 
AAAATATTTC TATAAATTAA TTTGACTTTC CTGATAGAGT TATTCACATC TTATTTCAAC 
TCACTATAGA AGGAGGAATA GGAGGATTCT CAGACATCCG GGCATCAGCC CAACTAATGA 
TTTGATTGCT AAGAAAATAT TCAGCAATCC AGAAATCACT TGTCAATTTA TTCGCGATAT 
GCTGGACTTG CCAGCAAAAA ATGTGACCAT TTTGGAGGGA AGCGATATTC ACGTATTACT 
CTCCA TCC CT TACTCGGTGC AGGATTTTTA TACCAGTATA GACGTCTTGG CGGAGTTGGA 
TAACGGTACT CAAGTAATTA TTGAGATTCA AGTCCATCAT CAGAATTTTT TCATCAATCA 
CTTGTGGGCT TACCTGTGCA GTCAGGTTAA TCAAAATCTT GAAAAAATTC GTCAGCGAGA 
AGGTGATACT CACTAGAGCT ACAAACACAT CGCTCCTGTT TACGCCATTG CTATCGTGGA 
TAGTAATTAT TTCTCAGATG ACCTGGCTTT TCATAGCTTT AGTATGCGCG AAGACACAAC 
AGGTGAGGTA TTGGCGATTA CCAACAATGG ACAGGAAAAC CATCTGGTTA AGATGGCATT 
CTTGGAATTA AAAAATACAG AGAAACCAGC AAAGACAAGG TTCGCAAGCC ATGGTTGGAG 
™CGGCA ACAAGCCCTT TACCCAGCAA CCGCAACGAG CCATTACCCA AGCAAATCAA 
CTGCTGGACT ACAAGAGCTG GTCCGAGGAG GACAGGAAAA TGTTTAGTCA ACTACATATG 
CGAGAAGAAC AAGTCTTGTT AGCACAGGAC TATGCCTTGG AAACTGCTAG GGCTGAAGGC 
CTTGAACAAG GACTAGAGCG TGGGAAAGTT GAAGGAAGGG CAGAAAGGAA ACTTTTTGCC 
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605 



TTCCTAGACA TAGTACGCCA AGGTCTTCTG AC^AGG ^GCCAGCCA GCAATTAGGT 
ATGTCAGTAT C^AA^ ^ 
AAATCCCCTA ^ 

TAATGAAAAG CGACAAAACA ACTCATTAGA AAGAATCATA TGGAACAATT ACATTTTATC 
ACAAAATTAC TAGACATTAA AGACCCTAAT GTCCAGATTT TAAACATCAT CAATAAGGAT 
. ACACACAAGG AAATCATCGC CAAACTGGAC T ACGACGCCC CA^GCCC TGAGTGCGGA 
AACCAATTGA AGAAATATGA CCTTCTAAAA TTCCTTATCT TGAAACGACT 

-A^CCA CAAGAATTCT CCTTAGAAAG CGXCGA^A AGTGCTATCA C***^ 
ATGATGGTCG CTGAAACTTC TGATGACGTA CAGTCATATT TCTTCTCTTT TTATTATATC 
ACAGTTTTAA ATCTAGCTTT ACTAGATTCA CCGCTACTAT CTATTTATTC GGAAAAAAGA 
CGAAAAAACC TGAGAATCAT CTCAGGCTTG GTCATTAAAT TTTTTTCTCA ATATCGAAAA 
GTGGAGAAAG TGGTCGTTTT TCA^AC GTACGATAGC ATCCCCTAGG AGATGAGCGA 
TTGAAATCTG CTCAATCTTA TCAATCAAAC GCTCTTCTGG CAGATAGATG GTATCCAAAA 
CAACCAATTT CTTAATAGCT GATTTTTGGA TATTGTCCGT AGCAGGACCA GAAAGAACTG 
GGTGCGTACA GCTTGCATAG ACTTCAACAG CACCAGCC CGCAAGAGCA TCTGCCGCAT 
GACAAATCGT TCCAGCGGTA TCAATCATAT CATCAATCAA GATACAAGTC TTGCCTTCAA 
CCTTACCGAT GATATTCATA ACTTCACTAG TATTCATCTT ATCAACGCTA CGACGTTTAT 
CAATAATAGC GATAGATGTT TTCAAAAATT CTGCCAACTT ACGAGCACGA GTCACCCCTC 
CATGGTCCGG GCTCACAACC ACATAGTCAG AACCAACCAT ACCACGACGC TCAAAATAAT 
C^CAATCAG AGGAGCACCC ATCAAATGAT CCACAGGAAT ATCAAAGAAT CCTTGAATTT 
GCGCAGCATG CAAGTCGATG GTCAATAAAC GATCCACTCC AGCTACTTCA AGCATATTTG 
CGACAAGTTT TGAAGTGATT GOCTCACGCG CCTCCCCTT TCTATCCTGA CG^CATACC 
CATAGTAAGG CATGACAACA TTGACAGATT CW*** ACGCTTCAAA GCATCTACCA 
TAATCAAAAT ^ CAGGCGAACT AGTTGATTGT AAGATAAAGA 

CCTG^CCC ACGGATTGAT TCTTCAATGT TGACCTGAAT CTCTCCATCT GAAAATTCGC 
GAACACTTGA TTTC CCCAAC TCTATCCCAA T^CGC CACACG^ GCCAATTCTT 
TATTAGAAGA AAGGGCAAAC AGCTTTAAAT CAGAAAAAGA CATGATTTCC TCCGGTATAT 
ATGTATAACT TGTGCTTTTC ACAAGATTTT CCA T C T ACCA TTGTAGCGCT 
TTTTTCAATC AAAAATAAAA GAAGGGCACC ATATTTGTAC CC^CA^ 
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AAATATTCTA GGTCATCAAC TCATTGTGTT TCTCAa'caAA GCAATAAGCA TGATAAAAAC 
CATAGAGAGC AATAGCCGTA ACCACTGGAA TCGCTAAAGG CAACTCTGTT ' TCCAACTCCA 
CAAAAGGAGA GTTAAACAAG AAGTGAGTTC CCAAGGCTAA ACCTAGAAAA ATAAGGCCCT 

G-m-crrccc aaccttc™ cctttatagg ctc^taaag caagtaaaca cctactacag 

CTAGACCTGA AAAAGTCCAG TGAGAGGCAA TTCCTGAGAT GATACGCTCT AAAATTCGCG 

aaatagtaaa gtcaaagccc tctggcaaat ccgtacgaat ataaccaata tccttaatca 

TTTGGAATCC CAAACCGGAA GCAATTCCAA GTAAAAACAA AGATTTTAAT TTTCGCACAG 
GAATCAAAGC CAAAACAAAA ACAAGTGACA ATAATTTCAA GGGTTCTTCT ACCAAAGGAG 
CCGCAATAGC ACTTTCAAAG GCATTTAAAA ATGGACTATC TGGGAAAAGA ACCCCCAGTA 
AATCATGGAT ATAAGTATTA GCAAAACTAG ACAACCAGCC TGAAAGGAAC ATCCCTCCCA 
ATAAAGACAG AATCAAAACC TTCTTTGGCA ATTCCCATTT TTCCCAATAC GGAAGAGAAA 
ATAAAGAGCC GGAATCATGT AAAAGAGAGC TAGAAAGATA GAAACTCCCA TTAGTCCATA 
TTCCGCACCT GACCTCGAAC CGTCCGTATA GTAGATGGTT TCATACTGTA AACCAATACA 
TAGCAATAAA ataaaaataa ATAAAATATT GCTTTTCTTC ATACACTTTC ITTCTAAATG 
AAGTATTTAT AATTCTACGA CTGTCATACT TCCTGTATCA ACATTGTAAA TGGCACCAGA 
GATAATGACA TCGTCTGGTA TTAGGGGAGA CTCGATAAGC AGTTGCATAT CCTCGCGTAC 
ACTCTCTTCT ATATCTTGGA AGGGCAAGAA GTCCTGGTCT GACACATCGA CACCCAATTC 
TTCCTTCAAA TACTCCTGAA AAGGTTCATT TTCAAAGGTC TGAGCACCAC AGTCTGTATG 
ATGCAATACC ACAATTTCTC TTGTCCCCAT TTGTTGCTGG GAAATAACTA GAG AACG AA T 
CATATCCTCA GTCACTCGAC CACCTGCATT CCGCAAAATA TGAGCATCCC CAAGTGCCAA 
ACCTAGAGCT TGCGCAACGT GCAAACGTGA GTCCATACAG GTCACAATGG CTACTCTGGT 
TTTAGGTTTA AGTGGCAGAT TTAACTGCCC ATGTAGGGCA ACATAAGCCT GATTGGCTTG 
CATAAACTGT TCAAAATACG ACACGATTCC CTCCTTGAAA ATTTGATAGT CAAATATTTC 
TCCTATCTTA TCATTTTTAA GAGAATTTGT CACGGATTAT GCAAAGACCT TTTTCAAGAC 
TTCCTGAATC GTTGTCACGC CAATGACCTG AATTTCCTTA GGCAGAGTGA TTCCTGTCAA 
GGAATTCTTA GGTACATAAA TCTTAGTAAA GCCCAGTTTA GCAGCTTCGT TGATGCGTTG 
CTCAATACGA TTCACGCGCC GAATCTCTCC TGTCAAGCCC AGTTCTCCGA CAAAACATTC 
CTGAGGATTA GTTGGCTTGT CTTTGTAGCT CGAAGCAATA GCAACTGCAA CAGCCAAGTC 
AATCGCAGGT TCATCCAATT TAACACCACC AGCAGATTTG AGATAGGCAT CCTGATTTTG 
CAAGAGAAGC CCTGCCCGTT TTTCCAAAAC AGCCATAATC AAGCTAGCAC GGTTAAAATC 
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AAGTCCTGTC 
CTCCGCCAAA 
ATCCAAACGC 
CTGCATCTCA 
AATACGAAAG 
CAACATACGA 
AATGTTATTG 
AGACCCCTGC 
AAAGTCTGGC 
GAGATAAAAC 
AGACTCCTCC 
TAGGAGAAGA 
TACCACTCCG 
ATTGATGGAA 
ACGCGCATTC 
GTTGGGGCAA 
CGCTTTTTTC 
GGCAAAAATC 
CTACCACAGT 
CCGTAGCTGT 
ACAGTCTTCT 
ATAAAGGCTC 
AAGCTCTTGT 
TCAATTTCTG 
CTATATAAGA 
TCTTCATTCT 
CCATCTGCAA 
ACAACCTCTT 
TTTCCATAAT 



GTAGTACGCT 
ATCGGACGCG 
TCTTCTAGGA 
AAAATCCCAA 
GTGTGGTGAC 
GGCCCAGCCA 
GTCTTGGCCA 
ACCCCTGAAA 
TGGATACGCT 
TCACTATCAA 
CCACTGACAT 
GTTGATTTCC 
CCTCCAAGCA 
GTCACCTCAG 
TTAACTTCGG 
CGTCCCAGAT 
TTTGCGATGA 
AATCTTCTCA 
CTGATGTTCT 
CTCATATTGA 
AGCTGTTTCA 
TACTCTTAAA 
GACTGCAGAA 
GACTTGCTGC 
ACGCTCCTCT 
AGTGCCCTGT 
TTAAGAAAGT 
GGTCTGTGAT 
AATCCCCATC 



TGGCATTTCC 
TCCCTTCCAT 
AAACT-TGACT 
TCTCATTAGT 
GCTCCCCTTC 
AGGTTCCTTC 
ACTGCATGAG 
TCTCAGGAGA 
CCACTTCTGC 
TATCACCTAA 
AGAGAACTGT 
CAATCCCAGG 
CACGGTTGAA 
CTAGTTTCAT 
CAACCTCAAC 
ATTTAGGGGA 
CAAACCTCTT 
TTTGGCACAA 
CGATACTTAG 
ATAGGACTAT 
AAGTTTTCTA 
GGAAGACCCG 
GATACGATTA 
CGTCCCATCT 
AACTCACGGT 
CGATCCAAAT 
AGCAAAAACA 
ATTCTTCATC 
AATGACTCCA 



607 
AAACATGGTC 
GGTTACAACG 
CGGATTGAGT 
GGAACCAAAA 
AAAGTAAAGC 
TTTGGTCACA 
TTCAGCGGTC 
CATGATGGTC 
ACGAACACTC 
GCGCTCTGCA 
CCCCACTTGG 
ATCCCCACCG 
TTCCTCCATC 
GGGCTTGGTT 
CTCTTCCACA 
ATTATACCCA 
TCTATATCTC 
ACTGGCGCAT 
ACATACATTC 
TAGGAAGCAA 
TTCCTGTTTT 
TAGCACAGAC 
TTTCAGCTGA 
CAGACAAGGG 
AATCTGGCTC 
CCACCAGTTC 
GCCTGGACAA 
TGCGCAAAAA 
ACTGAGTTAA 



GGTGTTACCA 
ATGGAGGAAC 
ACCTCAACCA 
CGATTTTTGA 
ACCGTATCCA 
TGACCTACGA 
ACTTCACGCA 
TGGATGGAAT 
TGCATATTGG 
CGTAGTTTAA 
GACAACTGGG 
ATAAGGACGA 
TCCGTCTTGG 
TTCTCACCTG 
AAAGAAGACC 
CAATTTTGAC 
TAACTCACAC 
GAGCATTCGA 
TAGAAACCGA 
CTCCCCCTTG 
ATAGACCTGC 
CACATACGAA 
CGAGAGTAAA 
TGCCAAATCT 
CCCATGACGT 
GAACGCCATC 
TACGCTCCCC 
TATGCCCTTC 
TTAAAACCAA 



AAGCCTGAAC 
CAGTCGCCCC 
AGCCGCCCGA 
CCGCTCTCAA 
CCATATGCTC 
TAAAGATGGC 
CCTGAGAAAC 
CAATAATGAG 
TCTCTGCATA 
TCTGCTGGGC 
TTGAGACTTG 
GACTTCCTGG 
TTCGATTGAC 
TCAAGGACAC 
AAGACCCACA 
ATACAAATGT 
TCAATCACTT 
TGAGCAACAA 
GACTTCATTT 
TTTTCTAAAA 
CATTCATGTA 
GCCGTTTCTA 
GGATTTTTGC 
ATCCCAAATC 
ACAAAGATAA 
AGCTGCATCT 
AACTTCAAGA 
ATTTCCAGGA 
GCCCTTCTTA 
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608 

CGAGGATTTG AAGAACGATC ATAGAGGTAG AGAACCTCAG TCGGCTGCAT ATAAGCCTTA 10020 

ACCCCTGTCG GAACCAAGAC AATCTCTCCT GGCGCAACAA CTGTACGCAC AGCAACCTTT 10080 

AAGTCGTAAC CAGTCGCATG CGCTGTCTCA CGCTTGGGCA ATAAATTTTC ATCTGTAAAA 10140 

CTCGAAACCA ATTCAAAACC ACGAATTTTC ATAATTTTCT CTTTTCTATT ATCATTTATT 10200 

CTAGATTATT CTATACTTAT TTA 10223 
(2) INFORMATION FOR SEQ ID NO: 74: 

(ii SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 16535 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS : double 
(D) TOPOLOGY: linear 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 74: 

TGGTTCTGTC CTTATCGGCG CCTTGTCTTG CTTGCCATGG CTACACCAAC TATCTCATCC 60 

GACGAAAGTA CACCAACCAC TAACGAACCC AACAACAGAA ATACAACCAC CCTTGCCCAA 120 

CCTCTTACTG ATACAGCAGC TGGCTCTGGT AAGAACGAAA GTGATATTTC TTCACCTGGA 180 

AATGCAAACG CTTCCCTAGA GAAAACAGAA GAAAAACCTG CTGCAAGCCC AGCCGATCCA 240 

GCACCACAAA CTGGACAAGA TCGTTCAAGT GAGCCAACTA CTTCTACTAG TCCAGTAACA 300 

ACTGAAACTA AGGCAGAAGA GCCCATCGAA GATAACTACT TCCGTATCCA TGTCAAAAAA 360 

CTTCCTGAAG AAAACAAGGA TGCTCAAGGA CTATGGACTT GGGACGATGT TGAAAAACCA 420 

TCTGAAAACT GGCCAAACGG AGCTTTGTCC TTCAAGGATG CCAAGAAAGA TGACTACGGC 480 

TATTACCTAG ATGTCAAATT AAAGGGAGAA CAAGCCAAGA AAATTAGCTT CCTCATCAAC 540 

AATACAGCTG GAAAAAATCT AACCGGCGAT AAATCTGTAG AAAAACTAGT TCCAAAAATG 600 

AACGAAGCTT GGTTAGACCA AGATTACAAG GTTTTCTCTT ACGAGCCACA GCCTGCAGGA 660 

ACTGTTCGCG TCAACTACTA CCGCACAGAT GGCAACTATG ACAAGAAATC TCTCTGGTAC 720 

TGGGGAGATG TGAAAAATCC AAGTAGCGCT CAATGGCCTG ACGGAACAGA CTTTACGGCT 780 

ACAGGCAAAT ATGGCCGCTA TATCGACATT CCTCTTAATG AAGCCGCAAG AGAATTTGGA 840 

TTTTTATTAC TAGATGAGAG CAAACAAGGA GACGACGTGA AAATCCGTAA AGAAAATTAT 900 

AAGTTCACAG ATTTGAAAAA TCATAGCCAA ATTTTCCTAA AAGACGATGA TGAATCGATT 960 

TACACAAATC CATACTATGT CCATGATATC CGTATGACAG GAGCCCAACA CGTAGGCACT 1020 

TCTAGCATTG AAAGTAGCTT TTCAACACTT GTCGGTGCTA AAAAAGAAGA TATCCTCAAA 1080 

CACTCCAACA TCACTAATCA CCTAGGAAAC AAGGTAACTA TTACCGATGT TGCAATCGAT 1140 
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GAAGCTGGTA AGAAAGTGAC CTACAGCGGA GATTTCTCTG ACACAAAACA TCCTTATACT 
GTTAGCTACA ATTCCGACCA ATTCACTACC AAAACAAGCT GGCGCCTGAA AGATGAGACA 
TACAGCTATG ATGGCAAACT GGGAGCTGAC CTAAAAGAAG AAGGAAAACA AGTTGATTTG 

ACCCrTTGGT caccaagtgc tgataaggtt tctgttcttg tctacgacaa gaatgaccct 

GACAAAGTAG TTGGAACTGT CGCTCTTGAA AAAGGGGAAA GAGGAACTTG GAAACAAACT 
. CTAGACAGCA CAAACAAACT CGGAATCACA GATTTCACTG GCTACTATTA TCAATACCAA 
ATCGAGCGTC AAGGTAAAAC TGTTCTTGCA CTCGATCCTT ACGCTAAATC TCTTGCTGCT 
TGGAATAGCG ACGATTCCAA GATTGACGAT GCCCATAAAG TGGCTAAAGC CGCCTTTGTA 
GATCCAGCTA AACTCGGACC TCAAGACTTG ACTTATGGTA AGATTCACAA TTTCAAGACT 
CGTGAAGACG CCGTTATCTA CGAAGCTCAT GTGCGTGATT TCACTTCAGA TCCTGCCATT 
GCAAAAGACT TGACCAAACC ATTTGGGACT TTTGAAGCCT TCATTGAAAA ACTAGACTAT 
CTCAAAGACT TGGGTGTAAC CCATATCCAG CTCCTTCCAG TCTTGTCTTA CTACTTTGTC 
AATGAATTGA AAAACCATGA ACGCTTGTCT GACTACGCTT CAAGCAACAG CAACTACAAC 
TGGGGATATG ACCCTCAAAA CTACTTCTCC TTGACTGGTA TGTACTCAAG CGATCCTAAG 
AATCCAGAAA AACGAATCGC AGAATTTAAA AACCTCATCA ACGAAATCCA CAAACGTGGT 
ATGGGAGCTA TCCTAGATGT CGTTTATAAC CACACAGCCA AAGTCGATCT CTTTGAAGAT 
TTGGAACCAA ACTACTACCA CTTTATGGAT GCCGATGGCA CACCTCGAAC TAGCTTTGGT 
GGTGGACGCT TGGGGACAAC CCACCATATG ACCAAACGGC TCCTAATTGA CTCTATCAAA 
TACCTAGTTG ATACCTACAA AGTGGATGGC TTCCGTTTCG ATAT GATGGG AGACCATGAC 
GCCGCTTCTA TCGAAGAAGC TTACAAGGCT GCACGCGCCC TCAATCCAAA CCTCATCATG 
CTTGGTGAAG GTTGGAGAAC CTATGCCGGT GATGAAAACA TGCCTACTAA AGCTGCTGAC 
CAAGATTGGA TGAAACATAC CGATACTGTC GCTGTCTTTT CAGATGACAT CCGTAACAAC 
CTCAAATCTG GTTATCCAAA CGAAGGTCAA CCTGCCTTTA TCACAGGTGG CAAGCGTGAT 
GTCAACACCA TCTTTAAAAA TCTCATTGCT CAACCAACTA ACTTTGAAGC TGACAGCCCT 
GGAGATGTCA TCCAATACAT CGCAGCCCAT GATAACTTGA CCCTCTTTGA CATCATTGCC 
CAGTCTATCA AAAAAGACCC AAGCAAGGCT GAGAACTATG CTGAAATCCA CCGTCGTTTA 
CGACTTGGAA ATCTCATGGT CTTGACAGCT CAAGGAACTC CATTTATCCA CTCCGGTCAG 
GAATATGGAC GTACTAAACA ATTCCGTGAC CCAGCCTACA AGACTCCAGT AGCAGAGGAT 
AAGGTTCCAA ACAAATCTCA CTTGTTGCGT GATAAGGACG GCAACCCATT TGACTATCCT 
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TACTTC ATCC ATGACTCTTA 
GCTACAGATG GTAAAGCTTA 
ATTGCCCTTC GTCAATCTAC 
CGTGTCCACC TCATCACTGT 
GGCTACCAAA TCACTGCTCC 
AAAGCTCGCG AATTTAATTT 
GCAGATGAAA ACCAAGCAGG 
GAAAAAGGCT TGAAATTGAA 
ACTAGCCATG AGTCAACTGC 
CAAAATGAAG CTTCTCACCC 
ACTAAACCAG ATGCCAAAGT 
TCACAAGCTG AACAACCAGC 
AACGAATCGG TAGAAAACTC 
GAACTTCCAA ATACAGGAAT 
CTTGCGCTCC TTGGTCTCGG 
CCTATAGAAA AATCCCCCAA 
TGTCCTAATA AACTTGATTA 
TACTTTGGGT GCAACTTGTG 
CATAGAACCA AGCGGTAGAT 
AATCAATTTT TCGGCCACCT 
CTGCTCCAAA TATTGCTCAT 
ATCCACCACT TGCTTAGTCG 
AATCCACCCC CAAGAATGGG 
CCAATCTCAC GATAAGCCTG 
GCATATACAA TCGGTAGACC 
GTAGCTATCC ACAAGGGCAA 
ATCGTTTGAG TCAATCGACC 
AAGTCTAATT TCTCATCAAA 
AAAGATTCCG TGAAAGAGCC 
TCGATAGTGG CATACTGTTG 



610 

CGATTCTAGT GATGCAGTCA 



TCCTGAAAAT GTCAAGAGCC 
AGATGCCTTC CGACTTAAGA 
CCCAGGCCAA AATGGTGTGG 
AAACGGCGAT ATCTACGCAG 
GGGAACTGCC TTTGCACATC 
ACCAGTCGGA ATTGCCAACC 
TGCCCTTACA GCTACTGTTC 
AGAAGAGAAA CCAGACTCAA 
TGCACATCAA GACCCAGCTC 
AGCTGATGCG GAAAATAAAC 
ACAAGAAGCA CAAGCATCAT 
TAGCAAGGAA AATATACCTG 
CAAAAACGAA AACAAACTCC 
TTTCTTACTA AAAAATAAAA 
GCATTATAGC TCGGGGGATT 
GGATTTTTTA TTAAGCCTCT 
TTCCGAAGAG TTCAATAGCT 
GAAGCATGAA GCGGTCCAAT 
GATCTGGATT GCCAACAAAC 
AACGCAATTC CTGCCAGTGC 
GATGGAAATA ATCTTTCACC 
CTCCCACTTT CAAGTCTTTG 
AATCAACTTT TTAAAATAAC 
AGCCTGAGCA ATCTTCACTG 
TTTGTCCTGA ACTGGACGAG 
TTGCCAGTCT AACTTGGTCT 
AAGAGAGTCG TAGTCTTTCA 
CCTTCCAGCC ATAATCTCCG 
GAACAAACGA ATCGGGTCCA 



ACAAGTTTGA CTGGACTAAG 
GTGACTATAT GAAAGGTTTG 
GTCTTCAAGA TATCAAAGAC 
AAAAAGAGGA TGTAGTGATT 
TCTTTGTCAA TGCGGATGAA 
TAAGAAATGC GGAAGTTTTG 
CGAAAGGACT TGAATGGACT 
TTCGAGTCTC TCAAAATGGA 
CCCCTTCCAA GCCTGAACAT 
CAGAAGCTAG ACCTGATTCT 
CTAGCCAAGC TACAGCTGAT 
CTGTAAAAGA AGCGGTTCGA 
CAACCCCAGA TAAACAAGCT 
TATTTGCAGG AATCAGCCTC 
AAGAGAACTA AACTAGCCCT 
AATTTTTGTA CAATATTTGT 
TTCATAGCAA AATAAGCTCG 
CTCAGAACCT GGTCATGAGG 
CCTAAATCCT CTATCATGCG 
ATGGCGCCAT TTGGCCCTAC 
GGACGGTCTT TGGAAATAGC 
GCCTGCTCAC CATCTTCCGC 
TCAGCATGGC CCCTTCGCTT 
GTGGATTACC ACCAATAATA 
TTGATTCGAC ATGACCACCT 
GATAAACTTC TTTACCAGCA 
TTTCATTGAC TAACTGAAGC 
AGTCATAACC AAACAGAGGG 
ATCGTCCATT TGACAAAGCA 
TGCTTGACAG AATGCTGACT 



2940 

3000 

3060 

3120 

3180 

3240 

3300 

3360 

3420 

3480 

3540 

3600 

3660 

3720 

3780 

3840 

3900 

3960 

4020 

4080 

4140 

4200 

4260 

4320 

4380 

4440 

4500 

4560 

4620 

4680 



WO 98/18931 



PCTAJS97/19588 



611 

AACGGATTTT CTTGGTATTG ACTGCCCCAG CGGCCAGAAC AATCTCTGGG 
GCTGATACTG CAAAATCCGC CCGATGCTCC TCACCAATCC CATATACATC CAAACCAACC 
™CAGCCA CCTCAAO^C TGCCACCAAC TGGCGAATGC GTTCAGCATG ACTGTAAGTT 
TGTCCAGTCC CTTCAAGCTC CGTTA^CC CCAAATGTTG AAATTCCCAA TTCTACCATT 
GTGATTCTCC TTATCTATCT CTGTACTTCA ATTTGAAAAA TTATTCTAAC ACGAATCTTG 
. AGTACAAGCA ACCGA^C TCATTAGAAA AAGCCTAGAT AACTAGACTT TTTTAGCTTA 
-CTACCGO. AC^ACrrOC CAAGGrTACG TCG ™ CC ACATCGAGGC CACGGTGGAG 
GGTTGCAAAG TAAGCGACTA A,™* TACGACCATT GAAATTGGTG AGAGGTATGG 
ATGTACGGTC GTAAGGACGA TATCGTCGGT ATC^GCT ACATTCTCTT CTGCGATAGT 
GAGGACTTTG GCACCACGGG C^CGACCTC CCACGAGTAT GAT^CCAAG 

AACTGGATCT GACAAGAGAG CCAAAACAGG CGTTCCTTCT TCAATCAAGG CAA^rTCC 
GTGCTTGAGT TCTCC^CAG CAAAGCCTTC ACACTGGATA TAAGAAATCT CTTTGAGTTT 
GAGACTTGCT TCCATGGCTA CGTAGTAATC ^ACCACGT CCGA^AAA AGGCGTTACG 
AGTTGTTTCA AGAAGTTCAC GAACCTTGAC rronan, TCTTTCTCTG AAAGAGTTGA 
TTCGATAGAC TGAGCTACGA TTGACAATTC ATGAACCAGG TCAAAGGCTT GCGCTTTAGC 
A^ACCA™ GCTTCT CCGA C^CTT^c mggaaggca AGGGCTGCGA 
ATAGGCTTTA GTTGATGCCA CGGCAATTTC AGGACCTGCG TGAAGGAGCA TGGTATAGTT 
GGCTTCACGT GAGAGGGTO AACCTGGAAC GTTTGTCACT GTTAAGC™ OAATTcccat 

^AGCC ttgaccaaaa cttgacgact atccgctgtt tcaccaga™ GGCTGA.AAA 

GATGAAGAGT GG™CT TCC TGAGAAGTGG CATACCGTAG CCCCACTCAG ATGAGATTCC 
AAGTTCAACT GGTGTATCTG TCAATTCTTC CAACATTTTC TTAGAAGCAA ATCCTGCATG 
GTAAGATGTT CCAGCTGCAA GGATGTAGAT GCGGTCTGCG TCTTGAACAG CCTTAATGAT 
ATCTGGGTCT ACGACAACTT GACCAGCCTC ATCTGTGTAG GCTTGGATGA GTTTCCGCAT 
AACAGTTGGT TGCTCGTCAA TTTCCTTGAG CATGTAGTAA GGGTAAGTTC CC^ACCGAT 
ATCTGACAAG TCAAGTTCAG CAGTGTAGCT AGCACGCTCA COACGA^ CA^T^ 
TTGAACTTCC ACACTATCAG CCTTGACGAT TACCAACTCT TGGTCATGGA TTTCCATCTA 
TTGGTTAGTT TCACGAATCA TAGCCATGGC GTCTGAGCAG ACCATGTTAT AGCC^TCC 
AAGACCAATC AAAAGTGGTG ATTTATTTTT AGCTACGTAG ATGACTTCAG 
GTCAACCAAG GCAAAGGCAT AAGAACCACG GATGATGTGA AGGGCTTTTT TGAAGGCTTC 
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AAGAACTGAG 
ATCTGTCTGC 
CTCAATCACC 
ATTGTCCTCA 
CTCAACACCA 
GTTATCAGCA 
TTCAAGCCCT 
ACACATAGTA 
TTCATCTTTT 
AAAAATTGGT 
AATCAGTCAG 
TGATTTCTTC 
TTTATAGCTG 
GTTGGCACCA 
ACGGTAATCA 
ACCATTCTCA 
GTCATAAATC 
CATCACATAA 
TCGAGGAGCC 
ATCACGAATG 
GATTTCTACC 
CTGAAAAAGG 
AGCCGGTGTC 
CTTAACAGCC 
ATCCACCTTA 
CTCGTTAAAG 
ATAGTCTTCA 
GGCAAAAGGT 
AGCCTCAAAG 
TGGAAAAATC 



AGCCCTTCTT 
CCCTTGAAGT 
CCATTATGCA 
GTTGGTTTTC 
GCTGTCTTGG 
CCATCTAGGA 
TGAATCAAAA 
TATACGACAC 
ATAGAATCAG 
ATAGTTCAAA 
AGTCCTTTTT 
AGACGACGTT 
TTGAGCCATG 
TCTTCAATGG 
TCTTGAATCA 
GTCAACATCC 
CCTTGCTCAT 
GGCTCGTAAA 
ATAACACGCA 
AGTTCCAACT 
AACTCCTGTG 
GCCGCAATAC 
AAGTTGAGGA 
CGGCTGCTGG 
TGTGGATAAT 
GTAATCCATT 
TAGGCTGAGA 
AAATCAAAAT 
ACCTTACGAT 
CGTGACCACT 



CTTCCGCAAA 
GGTGACCTGC 
CCAAGACAAA 
CGTGAGTAGC 
CAGACAATTC 
CAAAAATTCC 
TATCAGTTGC 
AGGCAAGCTG 
CAAAAACAGT 
TTAAGCTCCT 
AAAATCCATT 
CTTGCGTTTC 
ACCAGCAGTC 
CACGGTGAAG 
TTCCATCTTG 
ACTCAATATT 
AAATCTCCCA 
AATGTTCTGG 
AAGGTTGATA 
CTTCCTCTGT 
GATAAGTCCC 
GAGCTGCCTT 
CAATCCCAAT 
CCAATTGTGT 
GGGCATCATA 
GATCCACTAA 
CTGTCGCCTT 
GATAGAGATT 
AAAAATCCAC 
GAATAGAAGT 



612 
TTTTCCAATC 



AAGGTATTCT 
ACGTTCCGTC 
CCAACGAGTA 
TGCAATACGA 
CGCAGAATCA 
ATTTGTGTTT 
TGCTTTCTCC 
ATATACTTGT 
GTAAGCATAA 
ATTATCGCTT 
CAAGTCTAAT 
AATAAAGGTC 
TTCACGAAGA 
ACGGAATTTT 
GCCATAATTT 
ACCACGGTGA 
TAAGAGTGGA 
GTAGTTCACA 
AGCATCAGGT 
CAAGACAGAT 
GACATCAGCA 
CTTGGAATCA 
ATGATAGGCT 
AAAATAACCA 
ATCTCCATAA 
ATTTTCCCAA 
GACTAACAGA 
ACCTTGAGTG 
CCGAAAGGCT 



AAATGAACGG 
TCCTTGATTT 
TCAGAGCGGT 
TGTCCGATAC 
CCAACCGCCT 
TAGCCACGGT 
CCAACAACAC 
TTAAAATTGG 
TTCTTTCACT 
AAACTCTGAC 
AATTCTTTGA 
TCGACCAAAC 
CAAATCAAGT 
TGACCTTTTA 
TCTTCCCCTT 
TCCTTGATAT 
GAATTGATTT 
CTCTCTGGAT 
CCAAGGAAGT 
AAAAGACCGT 
GGATCTAAGA 
GGATGCTGGC 
GGCAAAAGTT 
ACCTTAACAG 
AATTCTACAG 
GTCTCAAAAC 
CCATCACCAT 
CGAATTCCTT 
TTGACTTTTC 
GTGTGACCAG 



CTATTTCAGT 
CAAGATAGTT 
GTGGGTGAGC 
CAGTTGTTCC 
TCACCAAATG 
ATTCAAGCTT 
CAACAATTCC 
TATAGTCTAA 
TGTCAAGAGT 
CGATTGGGAT 
ACCAGTGGCC 
CATAGCGATT 
AGCCCTTACA 
CAAAGTCAAT 
CAACACCCAT 
TTTGGGCGAT 
TACGTCCAGG 
GCTTAGCAAA 
CCACCGTATT 
GTTCATGCAA 
AAGATTGGGC 
TACGTGGATA 
CATGGCAAGC 
CTGCCTCTGC 
GAACGATGGG 
AAAAACGAGC 
CCTCTTGAAG 
TAGCCTTAAT 
CACAGCCTTG 
TCTCTAACAA 
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AAGCTCAATA 


TCCCGCTCCC 


AATTTTCATA 


AAAAGTCGAT 


GTCTTATCTG 


AACCAATCCC 


8280 


ATTATAGTAA 


CGATTTGGCT 


CCACTTGGAA 


CCAGTAATCC 


CAGAGATTGT 


CTCCCTTACC 


8340 


GTCACCAGCT 


ACACGTCCTT 


CTGTCTGCGG 


TCCAGAAGTA 


GAGGATCCCC 


AGACAAAATC 


8400 


CTTTGGAAAT 


CTTAGCATAC 


ATTTACCTCT 


TTATCTACTC 


ATTTCTCCCA 


TTATACAGAA 


8460 


AAAACAAGGT 


AAAAACTAGT 


TACATTTTTT 


CCTTGTTTTT 


CTTCTGATTA 


TAGTTTTTAT 


8520 


TTCTTGCTTA 


GGATTTCAAG 


CGTTTCAAGC 


ACGTTATCTG 


CATGAACCTC 


AATGGTGTCA 


8580 


CCAGTTGCCT 


TGATCTTAAC 


TTCTACAATG 


CCATCGGCCG 


CTTTTTTACC 


AACAGTGATA 


8640 


CGGATTGGAA 


GACCAATCAA 


GTCACTATCG 


CTAAATTTAA 


CACCGACACG 


TTCGTTACGG 


8700 


TCATCTGTCA 


AGACTTCATA 


ACCAGCTCCC 


ATCAAGCTTG 


CTTCAAGTTT 


TTCTGTCAAG 


8760 


GCTTGCGCTT 


CTTCATCCTT 


GACATTGACA 


GTAATCAAAT 


GCACATCAAA 


TGGTGCCAAT 


8820 


TCTTTAGGGA 


AATTGATTCC 


CCAAGCGTAA 


CGGTATTCAC 


CTTTTGGCGT 


TTTGTTAACA 


8880 


AAGAGGCGAG 


CGTGTTGCTC 


CATCACTGCT 


GAAAGAAGAC 


GGCTGACACC 


GATACCGTAA 


8940 


CATCCCATGA 


TGATTGGCAC 


AGCACGACCA 


TTTTCATCCA 


AGACATCTGC 


TCCCATGCTT 


9000 


GCTGAATAGC 


GAGTTCCGAG 


TTTGAAAATA 


TGACCGATCT 


CAATACCACG 


CGCAAAGTTA 


9060 


AGGACACCTT 


GTCCATCTGG 


GGAAATTTCA 


CCCTCACGAA 


CTTCACGGAT 


ATCCACATAT 


9120 


TCTGCAGTAA 


AATCACGGCC 


TGGGTTCACA 


CCAGTCAAGT 


GGTAGTCATC 


TTCGTTAGCA 


9180 


CCGACAACTG 


CATTGCGAAC 


ATCTTGTACC 


TTACGATCTG 


CAATAATTTT 


AATATTCTCT 


9240 


GGCAAACCAA 


CTGGTCCAAG 


TGAACCAAAT 


CCTGCTTGAA 


CAACATTCGC 


CACTTCTTCT 


9300 


TCGCTAGCAA 


CGTCAAAGAA 


ATCTGCTCCC 


AAGTGATTTT 


TCAACTTGAC 


TTCGTTGAGT 


9360 


TGGTCATTTC 


CAACTAGAAG 


GGCTGCAACA 


AGCTCACCAT 


CTGCAATGTA 


GAAGAGGGTT 


9420 


TTAATCGTTT 


GTTCTTCTGG 


AACATTGAGG 


AAGGCTGCAA 


CTTCATCAAT 


TGATTTAACA 


9480 


TCTGGCGTTG 


CAACACGAGT 


AACTTCTTCT 


TCAGCGACAA 


CACGGTTGCT 


TGGTTTGTAC 


9540 


TCGTTTGTTG 


CCATTTCTAA 


GTTAGCTGCA 


TAGCTAGACT 


CACTTGAGTA 


AGCAATGGTA 


9600 


TCTTCACCAG 


AGACTATCCA 


TTTGAGCAAT 


TCTGCCTTGA 


TTTCTTCTTG 


CACTTCTGCA 


9660 


GGAATTTCGT 


CAAATGAGGC 


AACTGACTTG 


TCCAAGACAA 


CCCAGCGGTC 


AAGGTCTGTA 


9720 


CGAGCAGATG 


TAATGGCCAT 


AAATTCTTGG 


CTATCCTTAC 


CACCCATGGC 


TCCACCGTCA 


9780 


CCAATAATAG 


CCTTGAAGTC 


TAAACCACTA 


CGAGTGAAAA 


TACGCTCATA 


GGCTGCTTTG 


9840 


TACTCATCAT 


AAACACTATC 


CAAACTATCA 


TAGTTAGCGT 


GGAAACTATA 


AGCATCCTTC 


9900 


ATGATAAACT 


CACGTGTACG 


AAGAAGTCCA 


TTACGCGGGC 


GTTTTTCATC 


ACGATACTTG 


9960 
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GGCTGAATTT 


GATAAAGGTT 


GAGTGGCAAT 


TGCTTGTAAG 


ATTTAACAGA 


ATCACGGACA 


10020 


ATAGCTGTAA 


AGGTTTCTTC 


GTGAGTTGGA 


CCTAAGATAA 


AGTCTGATTT 


TTCACGGTTT 


10080 


TTTAGTTTGT 


AAAGGTCTTC 


ACCATAGGTT 


TCGTAACGAC 


CTGATTCACG 


CCACAATTCT 


10140 


GCACTAAGAA 


GGGCTGGAGC 


CAACATCTCA 


ACAGCACCAA 


TCTTTTCGAA 


TTCTTGGCGC 


10200 


ATGATGTTTT 


TAGCTTTTTC 


AATCACACGG 


TTGGCAAGTG 


GTAGATAAGA 


ATAAACACCT 


10260 


GCTGAAACTT 


GGCGAACATA 


ACCAGCACGC 


AACATAAGAG 


CATGGCTGAT 


AACTTGAGCA 


10320 


" TCGCTTGGCA 


TTTCGCGAAG 


CGTTGGGATA 


GGCATTTTAC 


TTTGTTTCAT 


AATATTCCTC 


10380 


GATTATCTAA 


AAAAGAGTCG 


CATAATGTCA 


TTCCAAGTCA 


CAGCAATCAT 


CAAGACAACC 


10440 


ATGATGACCA 


CTCCGGCCAA 


GGTGACATAG 


GTTTCAATTT 


CTTGTTTCAA TGGTTTGCGG 


10500 


CGGATGGCTT 


CTAGGATATT 


GAGCACAATC 


TTACCACCAT 


CCAAGGCTGG 


AATCGGAATA 


10560 


AGATTAAAAA 


TCCCAATATT 


GATGGAAATC 


ATTGCCAAGA 


AGTACAAGAT 


ATTTTCAATT 


10620 


CCATTTTTAG 


CAGCATCACT 


ACTTGCCTTA 


AAGATAGCAA 


CAGGTCCACC 


CAACTTGTTC 


10680 


AAATCTGGTT 


GGAAAATCAG 


ATTTTTCAGA 


GCTGAGAGAA 


TTCGGAGAGC 


TGAGTCAGCA 


10740 


GCAGTTGTAA 


AACCACCTAC 


AAACATGGAT 


AGAAAATCTG 


ACTTAACCCC 


CGGTTGAACA 


10800 


CCTAGAAGGT 


AACGACCTTG 


ACTATCTTTG 


GGTGTAACAG 


TGACTTGTTT 


GTCACTCCCC 


10860 


TTTTCAGAAA 


TAGTCACATC 


CAAAGTCGGT 


GCCGTCTTAT 


CTTTGGTTTC 


TGTTTCCACA 


10920 


GCTTGGATCA 


AGCTTTCCCA 


GTTGCTAACC 


TCATGTGAGC 


CAATCTTGGT 


AATTTGTGCC 


10980 


ATTTCTGGTA 


CTCCTACCTT 


GGCCAAGGCA 


CCTTGGGGCA 


TGATATGGAA 


CTGATTGGTA 


11040 


TCAACATCTC 


TGACACCACC 


CTGCATAAAG 


ATTAAAACCC 


AAAAAACAAC 


GACACCTAAG 


11100 


ATAAAATTGT 


TCATAGGACC 


TGCAAAATTG 


GTAATCAGTT 


TGCCCCAGAT 


AGTCGCATTT 


11160 


TGATATTGAA 


CATCTAAAGG 


TGCAATCCGA ACCTCAGTAC 


CATCTGCTTC 


CACAACCGTT 


11220 


GCATCGTGAT 


CCACTGCAAA 


TGTTTTTTCT 


TCTTCCAGAA 


CCAATCCTTT 


GATAAAGAGC 


11280 


TTGTCTTCAA AATCAAACTG 


GGTCACCTGC 


ATAGGGAGGG 


CTGTTTGATC 


CAATTTTTTA 


11340 


CCTGAGAGAT 


TGATGCGTTT AACCTTACCA TCATCAGCAA 


GTGTCAAACT 


AACAGGCGTT 


U400 


CCTGTCTTGA TTTCAGTTGT 


ATCATCACCC 


CAACCGGCCA 


TGCGGACATA 


GCCACCCAGA 


11460 


GGCAAGATTC 


GAATGGTATA 


GGCCGTTCCA 


TCCTTGCCAA 


TGTGAGCAAA 


AATTTTAGGT 


11520 


CCCATACCGA TGGCAAATTC ACGTACTAAA ATCCCTGATT TCTTGGCAAA GTAGAAGTGA 


11580 


CCGAACTCGT 


GCACCACTAC 


AATAATCCCG 


AAAACCAGAA 


TAAAGGTTAA 


AATTCCGAGC 


11640 


ATAGCGTTTC 


CTCCGTCTTT 


TGATTAAAAG 


AGTCCAAATA 


AGTGCATGAT 


TGGAAATACA 


11700 


AGCAACATAC 


TATCGAAACG 


ATCCAAAACA 


CCACCATGTC 


CAGGGATAAA 


TTTCCCAGAA 


11760 
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TCCTTAACAC 


CAAAATGACG 


TTTGATCGAA 


CTTTCTAGTA 


AATCACCAAA 


TTGTCCAGCA 


11820 


ATGCTAAAGA 


AAATAGCAAA 


GACTGACATC 


TTGTAAATTC 


CATATGGAAG 


AGCAACTGTA 


11880 


CTGTCAACTA 


TCATAAGGAT 


AATGGTTACT 


AAAATTGCTC 


CTAAAATACC 


ACCCAAGGCA 


11940 


CCCTCAAGGG 


TTTTATTAGG 


CGATACCCTT 


GGTGCTAACT 


TTCGTTTCCC 


ATAGTTCATC 


12000 


CCAACAAGAT 


AGGCACCACT 


GTCTGTCGCC 


CAGACGATAC 


ACAAGGCTAA 


GAGAGCCTTG 


12060 


TCCAAACCTG 


CAACACGAGC ATCTAGTAAA GCATTAAATC 


CAAAGCCCAC 


GTAGAAGCTC 


12120 


ATAGCAAGAG 


GGAAAACCGC 


ATCCTCAATC 


GTATAAGACT 


TGCTAAAAAC 


GGTCGTTCCT 


12180 


AACATGATTG 


AAATCAAAAC 


ACTATAGGCA ACCACATTCC 


CATCAACTGG 


CAAAAAAGTC 


12240 


AGGTAATTCT 


CCAAGGGAAT 


GGTCAATGCA 


AAGGTTGCAA 


AGAGGGTCAA 


GAGGCCCTCC 


12300 


ATCGTCATGG TCTCTAGACC 


TCTCATCTTC 


AAAAGTTCAT 


GCATGGCTAG 


CATGGCTATG 


12360 


ATTCCGATTG 


CTATCTGAAG 


CAAGAGGCCC 


CCAATCATTA 


AAATTGGTAG 


GAAAATAGCC 


12420 


AGGGCAATCC 


CTGCAAACAA 


GGTTCTTTTC 


TGTAAATCCT 


GGGTCATATT 


TCCTCCTAAA 


12480 


CTCCTCCAAA 


TCGGCGATGA 


CGACGATTAT 


AGGCAAGAAT 


AGCTTCCTGC 


AAGGCCGCTT 


12540 


CGTCAAAATC 


AGGCCATAAG 


GTGTCCGTAA 


AATAAAGCTC 


ACTATAGGCT 


CCCTGCCATG 


12600 


GAAGGAAATT 


GCTCAAACGT 


AATTCTCCAC 


TAGTACGGAT 


AATCAAGTCT 


GGGTCTCGTA 


12660 


AGTCCTTAGG 


CAAATGCTGA 


GTAAAGAGAT 


AGTTACCAAT 


CAATTCCTCT 


GTGATGTCAC 


12720 


CTGGGTTGAT 


TTTGGCATCT 


AAAACATCCT 


GGGAAATCAA 


CTTAAGCGCC 


TGTGTAATCT 


12780 


CAGCACGTCC 


ACCATAGTTA 


AGAGCAAAAT 


TAAGAATCAA 


TCCTGTGTTG 


TTCTTAGTCA 


12840 


ATTCCTCAGC 


CTTGGTTAAA 


GCTTCAAAGG 


TTTGCTTAGG 


CAGGCGGTCT 


GTCTCCCCAA 


12900 


TCATTTGAAT 


CTTAACATTA 


TTCGCATGTA 


GTTCCGGGAC 


ATAATTATCA 


TAAAACTCTA 


12960 


CTGGCAAGTT 


CATGATAAAC 


TTGACTTCCT 


GATCTGGACG 


GGTCCAGTTT 


TCCGTAGAAA 


13020 


AAGCATAGAC 


CGTAATAACC 


TTGACGCCCA 


GTTTGTTGGC 


TGCCTTGGTC 


ACGGTTTGCA 


13080 


ATGCTTCCAT 


GCCCGCCTTA 


TGTCCAAAAA 


CTCGCGGTTG 


CATACGTTTT 


TTAGCCCAAC 


13140 


GGCCATTGCC 


ATCCATGATG 


ATGCCGATAT 


GAGCAGGAAC 


CTGTGTCGGA 


ACCTCTACTT 


13200 


CCACAGCCTT 


ATCTTTCTTA 


AAAAATCCAA 


ACATGATCTT 


ATTCCTATTC 


AAAAATCTAT 


13260 


CGTTTCATTA 


TACCATATTT 


CCCCATTTTC 


TTCTATCACT 


AAGCTATTTA 


TTCTCAGGCA 


13320 


CCAAGCCCAT 


TTTTCAAAAA 


AATAAGCCGC 


CTGATTGGGC 


GACTTTATTT 


TTATAGGGAG 


13380 


ATTATTATGA 


AAAAGTTTTA 


GGAGTTTAAG 


TTAAGGTCTT 


CTTAACTTAT 


GAACTTAGTG 


13440 


TACACTCCCT 


AGCTTAAAGT 


TTCCTTAAGT 


ATTTTTAAAA 


ATCAAATTTT 


TCCATTTCTC 


13500 
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CTGCCAATTT 


TTCTTGGATA 


AACGTGTTTG ATAGAGTTCC 


ATTCGGTCTT 


CATTTTCTAA 


13560 


GAAATGAGGA 


GTTGGACGAA 


CTTGAAAATT CAAAATATCC 


TCCAAACCAT 


AAGGTACATA 


13620 


GAGTTCAAAA 


TCTAATTCTT 


CATTCAAGCG CAGTCCAACT 


GCCGTACACC 


GTTCTGGATA 


13680 


CTTACTCATA 


GCATCACGAG 


AACTGGTATA GGAAGCAGTG 


TGAGGACTGT 


GCTGATGCAT 


13740 


ATAGACCTGA 


TTTTTCAATT 


CCCACTGGTA CTGAGGAAAA 


TCCTCTCTCA 


GCTTTTTCTC 


13600 


CAGTAATAAG 


GTTTCCTCAT 


AAGAAAAATC TGGATCAAAG 


AAAATCACAT 


CTATATCTGT 


13860 


TTCATGATCA AAAGGGGATT TGTCTGACAA AAGATTCCAG ATGAAATTTC 


TGACAGAACC 


13920 


TGCTGCCAAC 


CACGAGTCTT 


TCAAACCAAG GTCTCGGATG 


ATCGTCAGAA 


TGGCCATCAT 


13980 


ATCTGGACTT 


TCTCTAAAAG 


CCTCTAAGAT TTCTTGCTTA 


TTTTTCACTG 


TATTCATAAC 


14040 


CTAAGTGCTC 


ATATGCCTTA 


GCAGTCGCCA CCCGTCCAGA 


CCGTGTCCGC 


ATGATAAAAC 


14100 


CTTTTTGAAT 


CAAGTAAGGC 


TCATACATGT CTTCAACTGT 


CTCACGCTCT 


TCGGCGATAT 


14160 


TCACAGAAAG 


AGTTCCTAGA 


CCAACAGGTC CTCCACTGTA 


CATCTCAATC 


ATGGTGCGAA 


14220 


GGATTTTTTG 


ATCCACATAG 


TCCAAACCTT CATGGTCAAC 


ATCCAGCATA GTCAAAGCCT 


14280 


TATCGGTAAT AACATCATCG ATAACCCCAT TCCCCATTAT CTGGGCAAAA TCGCGCACGC 


14340 


GCTTGAGGAG 


ACGATTGGCA 


ATACGAGGGG TTCCACGACT 


ACGTAGGGCC 


AACTCAGATG 


14400 


CTGCCTCATG GGTGATTTCC ATCTCAAAAA TATCTGCCGT CCGCTCGACA ATTTCTGTCA 


14460 


AGTCAGCATG 


AGCATAATAC 


TCCATATGAC CTGTAATCCC 


AAAACGTGCC 


CGTAGTGGAT 


14520 


TTGAGAGCAT 


ACCAGCCCGA 


GTCGTCGCAC CAATCAAGGT 


AAAAGGAGGC 


AACTCCAAAT 


14580 


GAACACTGCG 


ACTGCCTTCA 


CCAGCCCCAA TCATAATATC GATGTAGAAG 


TCCTCCATGG 


14640 


CACTATAAAG 


CACTTCTTCC 


ACTGACATGG GTAAGCGATG 


AATCTCGTCA 


ATAAAGAGGA 


14700 


CATCTCCAGG 


CTCTAAATCA 


TTCAAAATCG CTACCAAATC 


ACCCGCTTTT 


TCGATAACAG 


14760 


GACCAGACGT 


TTGCTTGAGA 


TTGACTCCCA GTTCATTGGC 


AATGACAAAA 


GCCATGGTTG 


14820 


TTTTCCCAAG 


CCCTGGAGGG 


CCAAATAAGA GCACATGATC 


CAGCGCTTCA 


TCCCGCATTT 


14880 


TAGCGGCTTC 


GATAAAGATC 


TGAAGTTGAT CCTTAACCTT 


ATCCTGACCA 


ATATATTCAC 


14940 


GTAAATACTG 


AGGACGGAGC 


GTGCGTTCTA CTAACTCCTC 


ATCACCCATC 


ATCTCATTAT 


15000 


CTAAAATTCT 


ACTCATGGCT 


CTATTATATC AAAAAAAACA 


AGCCACAAAC 


AAAAAAGCCA 


15060 


CCTGATTGGG 


TGACTCCTAA 


GTTTAGCACT TATGTGGTAT 


AATATTATAC 


GGCACTTCTA 


15120 


CACCGCCTAC 


GAAAGGAGGT 


GAGATAGCCC ATGATGGAAT 


TAGTACTCAA 


AACTATTATC 


15180 


GGACCAATTG 


TGGTCGGTGT 


CGTTCTTCGT ATAGTCGATA 


AATGGCTAAA 


CAAGGACAAA 


15240 


TAGTGTCAAA 


AAAGACCTCA 


AGCTTATTTG GTCGTGAGCT 


TGGGGTCTTT 


TCTAGCCTAT 


15300 



WO 98/18931 



PCTYUS97/19588 



617 

GATATAGAAC TAGTACTCAA TTCCTTTTTA TTATCCCATA 
CTTTACATTT TCTTCAACCG CTGTACGACA AGACGGTTAA 
TTCTATCAAT TTCATAGAAA TTTTGATTTC GTAAACGAAG 
TCTCATTTAA TACGCCACTA CTAGACAAGC AAAATCATTA 
CAATTAACAG TCACTTACAA TCAAATTGAG TTTGAACTAG 
' TTTCTTAGTC ATATTCGCTA AAAAAATCCC CGCCAAAATC 
CCCCGACCAA AATCCGAAAA ATACCGAAAA ATATCGAAAA 
AAAAATCCTG AAATAGAGCT AAAAAACTCC ACCTGATTCG 
TATGAAAAAG AAAAGTTTAG GATTTTATTA AATAAAGTTA 
TACATGATAC AAGACGAAAC TTAAAACTAG CTTAACTTTT 
AAAAAATTTC TATCACCAGC ACCTCACCAA TCGAGTAGGG 
ACACCACCGT ACGTGCCGTT TGGCATACGG CGGTTCAACT 
CAAGGTAATA ATCCAAACAC GAAACCAGTC CACGTTTTTC 
CACGTTTAAG TACCGACTTC TGAGCTACTA ATTGATAATG 
TATCTGCTAT CCATTTAGGA ACTCCTAACT TAAGCAATCC 
TCTTCCATTG CTTCCAGATA ATCACTCGTA GGCGAGTACG 
CGACTATACT TTTCATATTT CCCAATGAGC AATAGTTTAT 
TCAGTTGCTC AATACGTCTT GTTAGGTCTA TACTCCATTT 
ATTTAAACTT AAATCTCCGA ACACTATCTT GATGTGGACG 
ATTTCCAGAA CCCAAAACCT AGATATTTCA ACTCTCTTGG 
AGCCGTTTCT CAATAAACGA CTGACTGAAT ACATC 
(2) INFORMATION FOR SEQ ID NO: 75: 

fi> SEQUENCE CHARACTERISTICS • 

(A) LENGTH: 8136 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



GTTCACGAAT TTTGTCAAAA 
GATTAAGAGA ACGTTAGGGA 
AGACAATCTT ACATGTCACT 
TTACAGTAGT TCCAGTCCTT 
CTGAAGCGAC CACAGACCTA 
TCAAAAAGTC CCCGCCAATT 
ATTATTTTTA GAATAGTCCC 
GTGGAGTTAA GGGAGATTAT 
GGAGGTCTTT ATTTAATAAC 
CTAAAATTTT ACTATTTTGC 
GATAATCTCT AGCCCCTCTC 
AACTTTTAAC GCATGTCGTT 
CAGGACTGGT TTTGATATAG 
GTCGCCCCAG CCAGATACCT 
CCATAATCGT CTCGATTTCT 
CAAGCGCTCA TCTATGCTGG 
CCATCCTCGA ATAGACAAAT 
CCTCTGTGTT AGTTTCTTCA 
GCTTTTCCAA CCATCTGATA 
TCATGTTTAC TTTCAAACCT 



15360 

15420 

15480 

15540 

15600 

15660 

15720 

15780 

15840 

15900 

15960 

16020 

16080 

16140 

16200 

16260 

16320 

16380 

16440 

16500 

16535 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 75: 
CCAGAGCGTT GCGTCCGAAA GTCTATCCAG ACACGGCTCT TTAAAAACAA AAGGAGAAAT 
OATGCATACT TATTTGCAAA AGAAAATTGA AAATATCAAA ACAACCCTAG GTGAAATGTC 



60 
120 
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AGGTGGTTAC 
GGCTATCTGG 
TTTAGGAAGT 
TGGGATGATT 
AAGTAATTAT 
AGGCTTTTAT 
TCTAGTTTGG 
ACTGGACGGC 
TGGCTTCATT 
AACCAATGGG 
GGCGGCTACC 
GAAATATCTA 
TAAGCAGAAT 
TCGATTAAAA 
GGTCCTCTTT 
GCTTTACTCT 
TGGTCTGTGA 
ATGACCAAGT 
GCTAGCACGG 
TTTTTGGAAA 
TGGGCGAGGT 
TAGCGGCAGA 
CGTCCCACAT 
TTCAAAAGTT 
ATGTTAGTCA 
CTTCTATCCT 
ATTATAACAA 
TAAATTTGAC 
TTCTTGCTGA 
TAGAAACAAT 



CGTCGTATGG 
GATGACCTCT 
TTTCCTCTCT 
TGTAGCTTGA 
CTTTTTGGCT 
GGTGAGGTGC 
ATTTATCAGG 
AAGGGCTGGA 
TATCAGTCTA 
GTAGGGCAAA 
AATGTCTTTT 
ATTTATCTCA 
ACTGATTTAC 
CAGATATAGT 
TGTTGTTGAA 
ATATTCAATT 
AATCTTGACC 
GTTGGAGGTA 
AATAGACATG 
ATAGCAGACT 
AGTCCTCGGT 
GCTGGTCCAG 
CTGCGAGATC 
GTATAAAAGC 
TATGGACACT 
TTCTTTGAGT 
AGGAATGAGA 
TTATGCCATT 
CTCTGTGCAT 
CTCCAATCGT 



TTGCGGCTAT 
TTGCCCATCG 
GGCTGGAGTT 
CAGGGATTAT 
TGATTAACTC 
TGACGACACT 
CACAGTTTAA 
CAAAGTATCT 
TTGGTGCCAA 
TCCTCATGAC 
CAATCTATCT 
TTAACAGTCT 
TTAACTAGGA 
TGATAATCAA 
AAGATAAAAA 
TTTAGGAATG 
TTTTTTGAGC 
AGAAGTAGGC 
GTGTTCTAGC 
GGTGATATGG 
TGAAATGGCT 
AAGAAGCTCC 
AATGATATCC 
TTGATAGATG 
TAAGGCAAAT 
TTTAGTGGAT 
AATATGAAGG 
GTTGAGTTTA 
GACTTGGGAG 
GAAGAAGACA 



618 
GGCTGATTTA 



TAGTTTTGCC 
GGTTTACGAA 
CTGTGTAATC 
TGTTATTTAC 
TTACTTCACA 
GAAGGAAAAG 
TTCCATTAGT 
TCGTCCCTAT 
AGCTGTTTAC 
CTGGTGGGGA 
AGTTGGTTGG 
AAAGATGTTT 
GGATTTATAG 
ACTCAGTAAC 
AGAAGGTCTA 
CACCAGGTCA 
AGATTAGGGT 
TCTTTATGGA 
TCTTGGTTTT 
TGCTCTCTTT 
TTACTCTCAT 
TGAACAGTAG 
GCTTTTTTGG 
TGTTCAGAAC 
AATGATAATG 
CAAAATATGC 
TTGCAGGTGG 
ATGCGATTGC 
ATCAGTACAC 



GGATTTTCAG 
CAGTGGATTT 
CATCGTATTG 
TTTGTATCGG 
CTTATTTTGG 
GTCATGCAGC 
CAGGAGTTTG 
GTGCTTTGGT 
CGTGATTCAA 
CGTGAACAGT 
GAAAGCCTGC 
TATCAATGGA 
GAAAGTGCTG 
TATGAAAAAG 
CTAGAAATAA 
GATAAAATTG 
ATGTCTCGAT 
GGGCTTCTTT 
GTTGACGGAG 
TATGAAAATG 
CAAAAAGATG 
AGTGACAGTA 
TGGCCTCGTA 
TTTTGCTGAT 
TGAATAAAGC 
AACAAGGTGT 
TGTTTGGGTG 
AGTATTTGGT 
AATTGGAATA 
CTTGGGCTAT 



GAACTATGAA 
ATTTGCTGGT 
TTGACTGGAT 
AAGGTCGAGC 
CCCTACAGAA 
CAATTGGACT 
TCGCGCGTAA 
GGTTGGCCTT 
TCACAGATGC 
GGATATTCTG 
AAATTCAAGG 
GCAAGGCAGC 
TTTTGAGATT 
AGGATCGGCG 
GACAACTGAA 
GACAACTTCC 
AAAGTTGGAC 
TAAATTATCA 
GAAGTAGTCA 
GAGAAAGAGG 
ATGGAAGAGG 
AAAGGTGGAT 
GCCCTTAGCA 
ACGGCGGTCA 
TGACGTTTTG 
TCATAAATCT 
GCTTTTTTCT 
TCTAGCGCTG 
TCAGCTTTTC 
AAGCGGTTTA 



180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
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GCCTGCTAGG AGCCTTGGTA ACAGCTGTGA TTCTCGTAAC GGGCTCTGTT CTAGTCATTT 1980 

TGGAAAATGT CACGAAGATT TTGCATCCGC AACCAGTCAA TGATGAGGGG ATTCTCTGGT 2040 

TAGGAATTAT TGCGATTACT ATCAATCTGT TAGCGAGTCT GGTGGTTGGT AAGGGAAAGA 2100 

CAAAGAATGA GTCTATTCTG AGTCTGCATT TTCTGGAAGA TACGCTAGGG TGGGTAGCTG 2160 

TTATCCTGAT GGCGATTGTT CTTCGATTTA CGGACTGGTA TATCCTAGAT CCTCTTTTGT 2220 

CCCTTGTCAT TTCTTTCTTT ATTCTTTCAA AAGCCCTTCC ACGTTTTTGG TCTACACTCA 2280 

AGATTTTCTT GGATGCTGTG CCAGAAGGTC TTGATATCAA GCAAGTAAAG AGTGGCCTGG 2340 

AGCGATTGGA CAATGTGGCC AGCCTTAATC AGCTTAATCT CTGGACTATG GATGCTTTGG 2400 

AAAAAAATGC CATTGTCCAT GTTTGTCTAA AAGAAATGGA ACATATGGAA ACTTGTAAAG 24 60 

AGTCTATTCG AATTTTCCTA AAAGATTGTG GTTTTCAAAA TATTACCATT GAAATTGATG 2520 

CTGACCTAGA AACTCACCAA ACCCATAAGC GAAAGGTGTG TGACTTGGAA CGGAGTTATG 2580 

AGCATCAACA TTAGAAAAAA GTGAAAAATA CTTGGGTACT ATCTTATTTG GAATAGAGTA 2640 

ATTTCTTTAT TATTTAAATA TTTCAAAAAT TGGTAAGAGA AGAGCATTGT ATAAACTCCA 2700 

GATATATGAT TGTTAATGAT AAAAATTTTT CGATTAGATA CAAAATGCTT GACTTGGAGT 2760 

CAACTCAAAG TTATATAATA AGATAAGTGA GTTAGAATAG CGTGAATTCA GTGAATGAAA 2820 

TGAGAGGAGG TTAGCGTGTG AATATTAAAT CTGCCAGTGA TTTGTTGGGA ATTTCAGCGG 2880 

ATACGATTCG GTATTATGAA CGGGTTGGTC TTGTGCCACC GATTACTCGT ACTGCTACTG 2940 

GGATTCGTGA TTTTCAAGAT CAGGATATCG AAGCGCTGGA ATTTATTAAG TGTTTTCGTT 3000 

CGGCGGGTGT CTCTGTAGAT AGTTTAGTTG ACTATATGTC GCTCTACCAA AAGGGAGATG 3060 

AAACGAGAGA GGAGAGGCTT GGTATTTTAG AAGAGGAAAA GCAAAAATTA GAGGAGCGCT 3120 

TGTCTCAGCT ACAGACAGCT TTAAATCGTT TAAATCTCAA AATTAAACTT TATAAGGAAG 3180 

GAAAATTTTA AATGAAATCA GCAGTATATA CAAAGGCAGG TCAGGTTGGA CTTGCTAGCA 324 0 

TTGAACGTCC GCAAATAATA GAAGCGGATG ATGTGATTAT TCGTGTGGTT CGTGCGTGCG 3300 

TTTGTGGTTC AGATTTATGG AGGTACCGTA ATCCAGAAAC GAAAGCTGGA CACAAAAATA 3360 

GTGGACACGA AGCGATTGGG ATTGTTGAAG AAGCTGGGGA AGCCATTACG ACGGTGAAAG 3420 
CAGGTGATTT TGTGATTGTC CCTTTTACAC ATGGATGTGG TGAGTGTGAT GCCTGTCTTG . 3480 

CTGGATTTGA CGGTTCTTGC GACAATCATA TTGGCAATAA TTTGGGGGGT GATTTTCAGG 3540 

CAGAATATAT TCGCTTCCAC TATGCAAACT GGGCGCTGGT TAAAATCCCT GGTCAACCTT 3600 

CTGACTATAC AGAAGGGATG CTCAAGTCCC TTTTGACTCT TGCAGATGTC ATGCCGACAG 3660 
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GCTATCATGC GGCGCGTGTT GCAAATGTTC AAAAAGGGGA CAAGGTTGTT GTTATCGGTG 3720 

ATGGGGCTGT TGGTCAATGT GCTGTCATCG CGGCTAAGAT GCGTGGAGCA TCACAAATTA 3780 

TCCTTATGAG CCGTCATGAA GACCGTCAAA AGATGGCTAT GGAGTCAGGT GCGACAgcTG 3840 

TTGTTGCAGA ACGTGGTCAA GAAGGAATTA CCAAGGTGCG TGAAATCCTC GGTGGAGGAG 3900 

CAGATGCAGC ACTTGAATGT GTTGGTACGG AGGCTGCTAT AGAACAGGCG CTAGGTGTTC 3960 

TTCATAATGG AGGGCGTATG GGCTTTGTAG GAGTCCCACA CTATAATAAT CGTGCTCTTG 4020 

GTTCGACATT TATGCAAAAT ATCTCTGTAG CAGGTGGGGC AGCTTCTGCT ACAACATACG 4080 

ATAAGCAATT TTTACTAAAA GCCGTCCTTG ATGGTGATAT CAATCCAGGT CGCGTCTTTA 4140 

CTTCAAGTTA TAAACTGGAA GATATCGACC AAGCCTATAA AGATATGGAT GAACGTAAGA 4200 

CAATTAAGTC TATGATTGTA ATCGAATAAA AAACGAATAG GAGTTTTAGA ACTCTATTCG 4260 

TTTTTTATGT TATCCTATTC TTGATTTAGG GTACTTTCTC TTAATGTCAG TCTGGTTCCC 4320 

AGCATGGTCA GGCTAGGGAT TTTCCGACCG TGGAGGACTT CCTTGTTAAG AATATCCATA 4380 

CCTGCTCGGC CCATTTCTTC AGTATAAACT GTAATACTAG AGAGGGGAGG ATAGACCTGT 4440 

TTGGTCAGAC TAGTGTCGTT AAAGGAAATG AGGCTGACGC GATCTGGCAG GCTGATTCCA 4500 

GCTTCTTGGA GGGCACGGAG GGCACCGATA GCTAAACTAT CGCTGGCTGC GAAAAATGCT 4560 

GGCGGAAGTT GGTCTCCCAA GCTCTGAATG GCCTCCTTCA TTAAGTCATA GCCAGACTGG 4620 

GCAGTAAATC TTCCTTGAAA GACCAGTTCA TCATGATAGA TTCCCCTCGC TTGACTATAG 4680 

TTTTTGAAGT TTTCTAGACG CTTGTCCTGA ATGATTTCTT CTTGGTCTGT TGTTTCTTCA 4740 

AGGCCTGTTA GAATCCCGAT ACGGTCCATT CCTTGACTGA GGAAATAATC GACAACCTGT 4800 

TTCATAGCAG TGTAAAAATC CGTGATAATA CAGGTATGTC CCAGGGAAAG TGTATCGCTG 4860 

TCTAGAAATA CAAGAGGCTT TTGGTATTCT TCAAAGGCAG AAATCTGAGC TCGACTAAAC 4920 

TTTCCGATGC AGAGAATCCC AATCACTTCC TCGCTTAGGG TAAAAGGGTG GTCATTAAAA 4980 

TAGCGCAAGA TATCATAGTC CAACTCTTGG GCTCTTTTTT CTATTCCTAG GCGAATCTGG 5040 

TAGTAGTAGA GGTCGTCCAG CTCCCCTTGT TCGCTGACCC ATTGGATAAT GGCAATCTTT 5100 

TGCTTGGGTT TGTGGGACTC GCCTGTCTTG AGGTGCTTGG TGTAGCCCAG CTCTTCAGCA 5160 

ACGGTTAAAA TACGGTGTCT GGTTTCTTCT GTAACAGATA GGCTCTGGTC GCGGTTGAGG 5220 

ACGCGGGATA CGGTCGCGAT AGAGACAGAG GCTAGCTGTG CAATGTCTTT TAAGGTAGCC 5280 

ATAAATCCTC CTTGATTAGG TTAGTATATC ATGTTTTTCT TCTTTTTACT GATATTTTAC 5340 

TAAAATTTTA GTAAAAAGGA TTGACCTTGG AAAATTCCTT GGATATAATA GAAAGAAAAC 5400 

GATTACACGT TAAGATGGCT TAACGGACAG TCAAAGGAGA ATTCATATGG CACAACATCT 5460 
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TACTACTGAA GCCCTTCGCA AAGACTTTCT TGCTGTTTTT GGTCAAGAAG CAGATCAAAC 5520 

CTTCTTTTCA CCAGGCCGCA TTAATTTGAT TGGTGAACAC ACAGACTACA ACGGTGGGCA 5580 

CGTTTTTCCT GCTGCTATTT CCTTGGGAAC TTACGGTGCA GCTCGTAAGC GTGACGACCA 5640 

AGTCTTGCGT TTCTACTCAG CTAACTTTGA GGACAAGGGC ATTATCGAAG TGCCTCTCGC 5700 

TGACCTCAAG TTTGAAAAAG AGCACAACTG GACCAATTAT CCAAAAGGTG TCCTTCATTT 5760 

CTTGCAAGAA GCTGGGCACG TGATTGACAA AGGTTTTGAT TTTTATGTTT ATGGAAATAT 5820 

TCCAAATGGT GCTGGCTTGT CTTCTTCTGC ATCCTTGGAA CTCTTGACAG GAGTCGTGGC 5880 

TGAGCATCTC TTTGATTTAA AATTAGAGCG TCTCGATTTG GTTAAAATCG GCAAACAAAC 5940 

AGAAAACAAC TTTATCGGAG TAAACTCTGG CATTATGGAC CAGTTTGCTA TTGGTATGGG 6000 

GGCAGACCAA CGTGCTATTT ACCTAGATAC TAATACTTTA GAATACGACT TGGTGCCACT 6060 

TGATTTGAAG GACAATGTCG TTGTTATCAT GAACACCAAC AAACGCCGTG AATTGGCGGA 6120 

CTCTAAATAC AATGAACGTC GTGCTGAGTG TGAAAAAGCA GTGGAAGAAT TGCAAGTTTC 6180 

CTTGGATATT CAGACTCTGG GTGAATTGGA CGAGTGGGCC GTTGACCAAT ATAGCTATCT 6240 

GATTAAAGAT GAAAATCGTT TGAAACGTGC TCGCCATGCT GTGCTTGAAA ACCAACGTAC 6300 

CCTCAAAGCT CAAGTAGCAC TCCAAGCAGG AGATTTGGAA ACATTTGGAC GCTTGATGAA 6360 

TGCGTCACAC GTTTCTCTGG AGCATGATTA TGAAGTAACT GGTTTGGAAT TGGATACCCT 6420 

TGTTCACACA GCTTGGGCAC AAGAAGGAGT TCTCGGTGCT CGTATGACAG GGGCTGGTTT 6480 

TGGTGGCTGT GCcATTGCCT TGGTTCAAAA AGATACTGTT GAGGCCTTTA AGGAAGCTGT 6540 

AGGCAAACAC TACGAGGAAG TAGTTGGATA CGCTCCAAGC TTCTATATCG CTGAAGTTGC 6600 

AGGTGGCACT CGCGTCCTTG ACTAGTCAAA AGGAGGCTCT ATAGTGACCT TAGTAAATAA 6660 

ATTTGTAACA CATGTCATTT CTGAAAGCTC ATTTGAGGAA ATGGATCGAA TCTATCTGAC 6720 

CAATCGTGTT TTGGCACGAG TGGGAGAAGG TGTTTTGGAA GTTGAGACCA ATCTGGATAA 6780 

ATTGATTGAC CTCAAGGACC AGCTGGTTGA AGAAGCCGTT CGATTAGAGA CGATTGAGGA 6840 

TAGTCAGACT GCGCGTGAAA TCCTTGGTGC TGAACTGATG GATTTGGTGA CTCCTTGTCC 6900 

AAGTCAGGTC AATCGTGATT TTTGGGCAAC CTACGCCCAC TCTCCAGAAC AAGCGATAGA 6960 

GGATTTTTAC CAACTCAGTC AGAAAAATGA CTACATCAAA CTCAAGGCCA TTGCTAGAAA. 7020 

TATCGCTTAT CGTGTTCCAT CTGACTACGG AGAACTTGAA ATTACCATCA ATCTCTCTAA 7080 

GCCTGAAAAA GATCCCAAAG AGATTGTGGC AGCCAAGTTG GTGCAAGCTA GTAATTATCC 7140 

TCAGTGTCAG CTTTGTCTAG AGAATGAGGG CTACCATGGT CGAGTTAACC ACCCAGCTCG 7200 
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TAGCAATCAC CGTATTATCC GTTTTGAAAT GGTTGGTCAG 
GCCCTATGCT TACTTTAATG AGCATTGTAT CTTTTTAGAT 
CATTAGTCGT CAGAGTTTTG AACGTCTGTT GGCTATCGTA 
TGCTGGATCT AATGCCGACC TGCCGATTGT GGGGGGCTCT 
TCAGGGAGGC CGTCACGTAT TTCCTATGGA ATTGGCTCCC 
TGCTGGTTTT GAGCAGGTCA AGGCTGGAAT TGTCAAGTGG 
GACTTCGGAT TCCAAAGAGG ATTTGATCAA TTTGGCTGAT 
CCAGTATTCA GATCCTGCAG TGCAGATTTT GGCAGAGACA 
TATCACACCC ATTGCCCGCA AACGCGATGG ACAGTTTGAG 
CAATCAGACT TCAGCAGAGT ATCCTGATGG TATCTATCAT 
TATCAAGAAG GAAAATATCG GCTTGATTGA GGTCATGGGC 
TCTGAAAGAA GAAGTGGAGC AAGTCGCTAG CTATCTTGTA 
CGATTATCAT CAGGAGTGGG CAGACCAACT CAAATCCCAA 
GAAAAAGCCC TTGCAATCGT CAAGGACTCT GTGGGTGCTA 
GATGCAGGAG TCTACAAGCA GACAGAACAA GGGCAGACAG 
CAGGTCGGAA TTTTACTAGA CTAGGAGCTT TCTCGG 
(2) INFORMATION FOR SEQ ID NO: 76: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10011 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



GAATGGGGTT 
GGCCAGCATC 
GACCAGTTTC 
ATTCTAACTC 
TTGCAAAAGG 
CCCATGTCTG 
AAGATTTTGC 
GACAGGACAC 
TTGGACTTGG 
CCCCACAAGG 
TTGGCAATCT 
GGAGAAGCTG 
CATCCAGACT 
TCTTTGCGCG 
CCTTTATGCG 



TCCAGTATTC 
GTCCCATGGC 
CAGGATATTT 
ATGATCATTA 
CCTTCCGATT 
TCCTACGTTT 
AGGAATGGCG 
CGCATCACAC 
TCTTGCGAGA 
ATGTCCAACA 
TGCCACCACG 
TTACAGTTGC 
AACGGATAAA 
TGTACTTGAG 
CTTTGTGGAA 



7260 

7320 

7380 

7440 

7500 

7560 

7620 

7680 

7740 

7800 

7860 

7920 

7980 

8040 

8100 

8136 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO; 76: 
CCCATAGTGA AGAGTGGCCA TAAGAAGGTC TTCTAGGCTT AATTTAGGTT 
TTTTGCGTGT TTAAGTTGAT AAGCTGTTTT TAACACAGCT GAACATCTCT 
TGCGCTGAAC ACCAACAAGA CATTTAAATC GTGTATCAGT TAGTTGTTTA 
CATTCATAGA ACTACTATAC CATGTTTTGT TTCGCAGGAA GTCTAATATT 
GGAACGCTCA TTGCTGGGAT ACGGAATAAG ATTGGCCCAG CTTCGATAAC 
GGTTCAAAAC CAAGGTCTGT TGCAGCGATT GGTGTAAAGA TATCGTAACC 
TCTTCGTTTA CATCTTTCAC CATAACTGCA TCACAGTGAA CATCGTAACC 
AGTTCTTCTT CTAGAGCACT TTTAATTTGG TGACTTGAGT TAACACCTGC 



TTCGTCCACC 
TCAAAAGTCG 
CTTGCTTCAT 
GTCAAATACT 
TGGGATACCT 
TTTCATAAGG 
ACGGTTTGAA 
ACCGCAGGCA 



60 
120 
180 
240 
300 
360 
420 
480 
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GCAAGAATTT TAATCATTTG GATTTCCTCC GATTTTATTT TTTAATAGAC AAGATTAAGC 540 

GGTTGCTTCA GCAATGTAAG CATAAAGGGC TTCTGGTTCA GAAATTTTTG ATAGGTCTTC 600 

AAGATGACCA TTTCCTGTGA AGAAGTCCAT TAACTGAGCA AGAATGTTCG TTTGACTTGA 660 

ACTTGAATTA TTGATGATAA AGAAGAGCAA GGATACTTCT ACTTCCTTAC CTGGCGCAAT 720 

CATATTATGG AAAGTCACCG GTTTCTCTAA TCGAACAACC ACCACTTTCT CAGCTAGATT 780 

ATGAACAATA TCTGTGTGAG GAATCATTAC ATTTGCAAGT CCTTTCCTAG AAATTCCATA 840 

TATAAACCAG TTGGAAATGA CTTTTCACGC GTGATCAAGG CTTCACGATA AGTTGGAGTG 900 

ACAATTTCTC GTTCTTCCAA CAAGCTTGCT ACCTGATCAA AAAGTTATTC TTGATTATCC 960 

GCTTCTAAGC AAAACACAAG GTTTTTGTCA AAGAAATAAT CTAATACCAT AAGGTTTTCC 1020 

CTTCTTTCCA TTAACTTTAT GCTATAAGTA TAACACTATA TGAAATCGTT GTTAATTACT 1080 

TTCTATTCTT TTTTGTCTCT TTTTTTATAT TTTTGTTTTG TTTATAGTTT GTTATATAAA 1140 

AATAAACACA CAAACAAATA CTCCAAGCAT TTTTCTGTTC TAATACTCAA TGAAAATCAA 1200 

AGAGCAAACT AGGAAGCTAG CCGCAGTTGT TCAAAACACA GTTTTGAGGT TGTAGATGAA 1260 

ACTGACGAAG TCACTCAAAA CATGGTTTTG AGGTTGTAGA TGAAACTGAC GAAGCAACAg 1320 

CCATACATAC GGTAAGGCGA CGCTGACGTG GTTTGAAGAG ATTTTCGAAG AGTATAAAAA 1380 

CTAAAAAAGC AGACCATCTA AGCCTGCTTT ACTATTGATT CTTATATAAA TTTCCTGTGA 1440 

ACAAGGAAAG GCATTTCTGA TAACTTATTC TTCATCCATA CTCAAGACGC TGAGGAAGGC 1500 

TTCTTGCGGA ACTTCAACTG ATCCGATGGA TTTCATGCGT TTCTTACCAG CTTTTTGTTT 1560 

TTCAAGGAGT TTACGCTTAC GAGAAACGTC ACCACCATAA CATTTAGCAA GTACGTTCTT 1620 

ACGAAGGGCC TTGATATCAG TACGAGCGAC AATCTTGTGT CCAATAGCCG CTTGGATTGG 1680 

AACTTCAAAT TGTTGGCGAG GGATGATTTT CTTGAGTTTA TCAACGATGA GTTTCCCACG 1740 

TTCGTAGGCA AAGTCCTTGT GAACGATAAA GCTGAGGGCA TCCACCTTAT CTCCATTGAG 1800 

AAGAATATCC ATTTTCACCA GCTTAGATGG GCGATATTCT GACAATTCGT AGTCAAAGCT 1860 

TGCATAACCA CGTGTCGAAG ACTTAAGTTT ATCAAAGAAG TCAAAGACAA TTTCAGCAAG 1920 

AGGAATTTGA TAGATAACAT TGACACGGTT ATCATCAATA TAGTCCATAG TCACAAAGTC 1980 

CCCACGCTTA CGCTGAGCTA GCTCCATTAC TGCTCCGACG AACTCCTGTG GTACCATGAT - 2040 

TTGCGCCTTG ACATAAGGCT CTTCAATGGT CGCAATCTTA GTTGGGTCTG GAAACTCAGA 2100 

TGGGTTAGAC ACATCCATAG ACTCACCGTC GGTCAAATTA ACTTTGTAAA TAACAGACGG 2160 

AGCTGTCATG ATGAGGTCAA TATTGAACTC ACGCTCTAAA CGTTCCTGGA TAACATCCAT 2220 
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ATGGAGAAGT CCAAGAAATC CACAACGGAA ACCAAATCCA AGTGCCTGAG ATGTTTCTGG 2280 

TTCAAACTGA AGACTAGCAT CATTCAGTTG CAATTTTTCA AGcGCTTCAC GCAGGTCATT 2340 

GTACTTGTTT GATTCGATTG GGTAGAGACC CGCAAAGACC ATAGGATTCA TCTGCTTATA 2400 

ACCATGTAAT GGTTCTGCCG CAGGATTGGT TGCCAAGGTA ACGGTATCAC CCACACGAGT 2460 

ATCCTGAACC GTCTTGATAG ACGCCGCAAT GTAACCAACA TCACCAGTCG CAAGGAAATC 2520 

ACGACCAACC GCTTTTGGTG TAAAAATACC GACTTCGGCC ACATCAAAGG TCTTACTATT 2580 

GCTCATGAGC TGAATCTTAT CACCAGGTTT GACCACTCCG TCCATGACAC GCACTTGGAG 2640 

GATAACCCCA CGGTAAGCAT CGTAAACAGA GTCGAAAATC AAGGCCTTAA GTGGCGCCGT 2700 

CACATCACCC GTTGGTGCTG GTACTTTTTC TACAATTTGC TCGAGGATTT CTTCAATCCC 27 60 

AATACCAGCC TTGGCAGAAG CCAAAACTGC TTCACTGGCA TCCAAACCAA TCACATCTTC 2820 

AATCTCTGTA CGCACGCGCT CCGGATCTGC AGCCGGCAGG TCAATTTTAT TAATGATAGG 2880 

CATGATTTCC AAATCATTAT CCAAAGCCAG ATAAACGTTG GCAAGAGTTT GAGCCTCAAT 2940 

TCCTTGAGCC GCATCGACCA CCAAAATAGC ACCCTCACAG GCAGCTAGCG AACGTGAAAC 3000 

TTCATAGGTA AAGTCAACGT GCCCTGGTGT GTCAATCAAG TGGAAAATAT AAGTTTCCCC 3060 

ATCTTTTGCA GTGTAATTCA ACTCGATGGC ATTCAACTTA ATAGTAATTC CACGTTCCCG 3120 

CTCTAGCTCC ATGCTATCCA AAAGCTGGGC CTGCATTTCA CGACTTGAAA CCGTCTCTGT 3180 

TTTTTCCAAA ATGCGGTCTG CTAGAGTTGA TTTTCCGTGG TCAATATGGG CGATAATAGA 3240 

GAAGTTACGG ATCTTCTCCT GTCGTTTTTT CAATTCTTCT AAGTTCATGA TTCTCTTCCT 3300 

TTCAGGGTAT CTATTTATTA TAAATTGTTT TTGATATTTT GACAAGACCA TACCCTGCTA 3360 

GGAGTACTAA TCTTCAGCGA CAAAGCCGTC ATTTTCGATA AAGTGGTGTT CTGTCATTCC 3420 

TTGGTCTGTA AAGACAATCC CGTGAAGGAC ACCACCATAA ACAGCTCCTC CATCCATTCC 3480 

AATCTTGCCA TCTTCTGTAG TCCAAAGCTC AGATGTACCG CGTTCTTGCT GTAACAAACC 3540 

ATAGACCGGT GTATGACCGA AGACAATGGT TTTTCCAGTA TGATTTTCAG CTCCGTGGAA 3600 

TGGTTTTCTA AGCCATACTT TTTTATAATC TGTTGTTTCA TGCCAGTCGT CCAAGGTCAA 3660 

ATCAATACCT GCGTGAACAA AGATATACTT GTCTGTCTCT ACTACAAATG GCATTTGACG 3720 

AATGAATTCG ACCAAGTCTG CCGCTTCAGC GgCAACCCGC TTGGCATCTT CTACTCCATC 3780 

AACTGGTGCA TCCAAGGGAC GACCTAGGAT AGAGTTAATG GTTGTATCTC CACCATTGCG 3840 

ACTATAATGG TCATAACTTT CTTCTGGGTC ATCTAGCCAA GTCAAAAACA TATACTCGTG 3900 

GTTTCCGGAC AAACAGATAG CCCCTTGATT GTCCACCAAG TCCTTGACCA TTTCAAGAAC 3960 

ACGGTGACTA TCCTCACCTC TGTCAATCAA ATCACCTAGA AAGAGCAACT GGGGCTGACC 4020 
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ATCCCAGGTT TTGAGAAGGT CTTCCAGCAT CCCAGCTTTT CCGTGAACAT CTCCAATTAC 4080 

ATAATAATCT GTCATCTTAT TTCTCCCTGT TTCTCAACAA TTCTCTTGCT TGCGTCAGGG 4140 

CTGCTTCTGT CACATCATCA CCTGCCAACA TCTTGGCAAC TTCCTCCACT CGCTCTTCGA 4200 

CCGTCAAGAG ACGAACAGTC GAAACCGTTG AATGGTCATT ACTAATCTTC TCAATAAAGA 4260 

ATTGATAATC TGCAATCGCA ATTACTTGTG GCAAATGGGA GATAGCCAAA ACCTGACCAT 4320 

GCTGACCAAT TTTATGAATT TTCTGAGCAA TAGCTTGAGC AACACGACCT GAAACTCCCG 4380 

TATCCACCTC ATCAAAGACA ATGCTAGTCT TGCCTTCTTT ACGTGAAAAG GCAGACTTAA 4440 

TGGCTAACAT GAGACGAGAT AATTCCCCTC CAGAAGCAAC CTTAACCAAG GGTTTAAAGT 4500 

CTTCTCCAGG GTTGGTTGAA ATATAAAACT CAACCATTTT ATTTCCCTCA CGACTGAATT 4560 

TTCCCTTACT AAAACGAACC TGAAACTGGG CTTTTTCCAT ATAAAGATCT TGCAGTTCTT 4620 

GTTTAATCTC AGCTTCGAGT TGCTGAGCCA AATTATGACG AGCAGAAGCA AGTTGACCTG 4680 

CCAAATTGAC AAGATTGACT TCCAACTTCT TAAGCTCTGC TTCCATGTCC TCAGACGAAA 4740 

GATTATTGCC TGTCAAGAGA TTGTATTCTT CCGTAATCTT GGCAAAATAA AGCAAAACAT 4800 

CATCAACAGT CCCACCATAC TTACGAGTAA TAGTATGAAG GAGGTCCAAA CGATTCTCAA 4860 

CCTGCATCAG GCGATTGCCA TCAAAATCAA GGTCCTCAAT GATAGCTTCC AAACGTTTGC 4920 

TAATGTCTTC TAAAACATAG TAGGTCTCAG ACAGATAGCT TGAAATTTCA CGGTATTCAG 4980 

GATCATACTC TTCGACACTT TCCATGTCAT TCATAGCTGA ACGAACATTG GCCAGACTTG 5040 

AAAAATCTTC ATTGTCCAAC ATACTGTAGG CATTGGTCAG TGTATCCGCA ATATTTTTGT 5100 

GGTTGAGGAG TTTATCTCGC TCTTGATTGA GAGCCAAGTC TTCTCCAGCC TGCAAGTTTG 5160 

CTGCCTCAAT CTCTGCCATT TGAAATTCCA ACATTTCGAT ACGTGCCTTG TGTTCCTGTT 5220 

GGTTTTTCTT GACTTCCAGA ACCTGCTTGC GCATTTTCCG ATAGGCATCA AAACTCGTTT 5280 

GATAGGTTTC TTTCAAGTCC CAAAAAGCGG CATCACCAAA TTCATCCAAC ATCTGGATAT 5340 

GCAGTTGGGG ACGCATTAAC TCCTCATGGT CATGCTGACC ATGAATATCT ACAAGATGTT 5400 

GCCCAATAGC TCGCAAAACA GACAGATTAA CCATCTGACC ATTTACACGG CTGATACTAC 5460 

GACCATTTTG CAAGATTTCC CGACGGATGA TAATTTCATC ACCTAATTCT AAACCTTGCT 5520 
CATCAAAAAT TTCCTGTAAA AGACGACTAT TCTCAACTGA GAAAAGCCCC TCAATCTCTG. - 5580 

CCTTTGGTGC ACCATGACGA ATAACATCTG TCGTCGCACG AGCTCCCAAC ATCATATTCA 5640 

TGGCATCAAT GATAATCGAC TTCCCTGCAC CCGTTTCACC AGTCAGGACA GTCATCCCCT 5700 

TTTCAAAATT GAGGGAAATA GCCTCAATAA TGGCAAAGTT TTTTATCGAA ATTTCAAGTA 5760 
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ACATATAGAC CTACCAATTT TTTACTTGTT CAAAGATTTC CTCTGCTAGA CTTCCACTTC 5820 

TGGCAATGAC TAAAATCGAG CTATCATCAG TCAAACAGCT AAAAATCTTG TCTGCAAAAG 5880 

TCTCGATTAA CTGAGCTTTT ACAAAAGCCG TATTTCCTGG AATAACTTGG AGATTGATCA 5940 

TCTTATCCAT CAATTCAGCC GATTCGATAT TGTCTTCAGC CAGTTGCAGA CTTTTTACGA 60O0 

TTGATTTTGG CAATTCGTAG ACATAGGTGT TGTCTCTCAA AGGAATTTTG ACAATACCTA 6060 

ACTCTTTGAT ATCTCGGGAT ACCGTCGCCT GAGTGGCAGT GATACCTGCT TCTTTCAAAT 6120 

GTTCTACAAT TTCTTCTTGC GTGCCGATTT GATAATCTGT CACCAATCTT CTAATTTTTT 6180 

CAAGTCTCTC TTTTTTATTC ATTTTTAAAT TGACTATGCG CCCTCTCTAC TGCTTCTTTA 6240 

ATCTCAGCAA GAATCTGATT GCTTGCTGAC TTTTCTTTTT TCAAATACGC TAAAAATTCA 6300 

ATATTTCCAT GTCCACCTTG GATGGGAGAA AAGTCCAAGC CAAGGACTGA AAAACCTACC 6360 

TCTACTGCCA TAGCTGTTAC AGATTCAAGG ACATTCTGAT GAACCTTAGC ATCTCGAATA 6420 

ATTCCATTTT TCCCAATCTG CTCACGTCCT GCCTCAAACT GAGGTTTGAC AAGTGCTACC 6480 

ACCTGACCTT GATCAGCCAA GACACGGTGC AAGGCTGGCA AAATCAGACT AAGGGAAATG 6540 

AAACTCACAT CAATACTGGC AAAGCTCGGC TCCTGCTCGA AATCAGTCTT TTCAGCATAG 6600 

CGGAAATTGA ACTGCTCCAT GCTGACAACT CGTGGGTCTT GGCGTAATTT CCAAGCCAAC 6660 

TGATTGGTAC CAACATCGAC TGCAAAGACC AACTTGGCAC TATTCTGTAG CATGACATCG 6720 

GTAAAACCTC CAGTAGAGGC CCCGATATCA ATCGTAGTCG CGCCATCCAC CGACAAATCA 6780 

AAGACCTGCA AGGCCTTTTC CAGTTTCAAA CCACCACGGC TGACATACTT GAGTTTCTCC 6840 

CCCTTGAGTT TTAATTCGGT GTCATCTGGA ATTTTCTCTC CTGGCTTGTC AAACCGTTCT 6900 

CCATTAAGGA CTGCTACGAC TAGGCCAGCC ATCACACCTC GCTTGGCCTG CTCTCTCGTT 6960 

TCAAACAACC CCTGTTTATA AGCTAGTACA TCCACTCTTT CCTTAGCCAT TGATTCTCAA 7020 

ACTTTCTACT ACACTTACAA TCGATTCTGT TTCAAAGGGA AGCTGCTGGG CAATTTCTTC 7080 

TAATTTTTCA TTAGCTTGAT CCAGGGTTTG GTTACAAAAG GCAATGGACT CTTCCAAGCC 7140 

CAACAGGGCA GGATAGGTTG ATTTTTCTGC CTGCAGATCC TTTTGAGGTG TCTTGCCGAT 7200 

TTCCTCAAAA CTAGCTGTCA CATCCAGTAC ATCATCTCTG ACTTGAAAAG CAAGTCCAAT 7260 

CAATTCACCC ACAGTTTTCA GCTTCACCTG CATTTCAGGT GACAATTCAG CTATAATAGC 7320 

TGCCGCTTGG AAGGGATAGG CTAGTAACTT CCCAGTCTTA TTGGCATGAA TAGTCTGAAG 7380 

TTCTTCCAAA GACAAGTGCT GGTGTTCGCC CTCCATATCC AAAACTTGCC CTGCTACCAT 7440 

ACCCAGACTA CCTGAAGCAA GGGATAAGTT GGCAATCAAG TCCACCTTAA TCTGACTTGG 7500 

CAAATCTGCC TGCGCAATCA AGGCATATGA GTCTAAGAAT AAGGCATCTC CAGCCAAAAT 7560 
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GGCCATAGCT 


TCACCGAATT 


TCTTGTGATT 


GGTTAACCGC 


CCTCTTCGAT 


AATCGTCATC 


7620 


ATCCATAGCA 


GGAAGGTCAT 


CGTGAATCAA 


GCTCCCTGTA 


TGAATCATCT 


CTAAGGCAGT 


7680 


AGCTACCTGC 


GCGTGAGCAG 


GTTTGATGGT 


AACCTGCAAG 


GCTTCCAGAA 


CTTCTAACAA 


7740 


GAGAAAAGGC 


CGAATACGCT 


TGCCACCAGC 


ATGAATAGAA 


TAGAGAACAG 


ACTCCCGTAA 


7800 


ACTAGAGGCA 


AACTGCTGGT 


CTCCATAAAA 


ATCTTCCAAA 


GCCGACTCGA 


CAAGAGCTAA 


7860 


TTTTTCTTGC 


TTTTTCATTC 


AAAATCACTT 


TCTGTTCCGT 


CTTCTTGCAT 


GACCTTGACC 


7920 


AAGGTCTTTT 


CAGCCTTGTC 


CAGCGTAGCT 


TGGAGCTCTT 


TTGACAAGAC 


CATGCCCTTT 


7980 


TGAAAGGCAG 


TAATCGCATC 


TTCCAGAGCA 


ATTTCACCAT 


TTTCCAAACT 


TTGGACAATG 


8040 


GTTTCCAGTT 


CTGCTAGATT 


TTCCTCAAAT 


TTCTTTTGTT 


TTGACATCTT 


TAACCTCTAA 


8100 


TTCTACTTGA 


CCATCTCGCA 


TCAAAAGCGT 


TACTTGGTCT 


TTTTTCTTCA 


AACTCTCAAC 


8160 


CGAATCTACA 


ACGGACTCTT 


CTTTTTTGAC 


AATAGCATAA 


CCACGCGCCA 


CGATTCGGCT 


8220 


AGTATCCAAC 


ATGAGCAAAG 


CTTCCGAAAG 


TCGCTTGGCC 


TCAGCAACCT 


TGGCGTCATA 


8280 


AACTAACGCC 


ATTTGGCTAC 


CTAAGAGCTT 


GTCCAACTGT 


CCTAAACGGT 


CTTGATAGCG 


8340 


TTGGATTTTG 


GTAACAGGTG 


ATAATTGTAC 


TAATTGATGA 


GTTCTTGCTT 


GAACTAATTG 


8400 


TTTGTTATCA 


GAAATCCGAG 


TTCGCAAACT 


TTGTTTCAAA 


CGCAGTTGCA 


GTTGGTCCAA 


8460 


GCGTTGCAAA 


TAACCGTCAT 


ACAAGCGCTC 


AGGTTGTCTA 


AAGATAACAG 


ACTGACTGCA 


8520 


TTTTTTCAAA 


GCCTCTTGTT 


TCTTAGATAG 


AACATTTCGG 


ACTGCCGTTA 


CCATCCGTTT 


8580 


TTCCTGATTT 


TGCAAATGAG 


CTAATACATC 


CAACTTGGTC 


ACAGGTGTTG 


CCAGTTCAGC 


8640 


CGCCGCTGTT 


GGCGTTGCAG 


CGCGTCGATC 


TGCCACAAAA 


TCTGCCAAGG 


TCACATCCGT 


8700 


CTCATGCCCC 


ACACTAGAGA 


TAACTGGCAA 


ACGAGATTCA 


AAAATAGCTC 


GTACCACAAT 


8760 


TTCTTCGTTA 


AAGGCCCAGA 


GATCCTCAAT 


AGAACCACCT 


CCACGACCAA 


TAATGAGCAA 


8820 


ATCCAAATCG 


TCCCGTTGAT 


TAGCACGCGC 


AATATTTCTA 


GCAATTTCCT 


CCGCAGCCCC 


8880 


TTCACCTTGA 


ACCTTGGTCG 


GATAAAGAAG 


GATGTCAACA 


CCTGGGAATC 


GCCTGCTGAC 


8940 


GGTCGTGATA 


ATATCTCGAA 


TAACGGCTCC 


ACTACGGCTG 


GTTACTACAC 


CAATTCTCTT 


9000 


AGAAAATTGG 


GGCAGAGCTT 


GCTTGAAGCG 


TTCTTGAAAC 


AGGCCTTCTT 


CTGTCAATTT 


9060 


TTTCTTAAGT 


TGTTCAAACT 


GAATCGCAAG 


CGCCCCAACC 


CCATCAGGCT 


CAGCTTTTTG - 


9120 


AATGATGATG 


GAGTAGCTAC 


CACTTGGTTC 


ATAGACCTGT 


ACACGCCCAA 


TCACATTGAT 


9180 


CTTCATTCCT 


TCTTCCAGGT 


CAAACCCTAA 


TTTCTGATAA 


ATCCCAGACC 


AGATGGTCGC 


9240 


TTGAATAACT 


GCATGGTCAT 


CCTTTAGGGA 


GAAATATTGG 


TGAGTAGGTC 


GTTTACGAAA 


9300 
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GTTGGAAACT 


TGACCAGTTA 


AATAGACCCG 


TTCCAAGTAT 


GGGTCTTTAT 


CGAATTTCAT 


9360 


TTTCAGATAC 


TTGGTCAAAG 


TTGTTACCGA 


TAAATACTTT 


TCCATCTCCA 


CCTACTATTC 


9420 


ATTTACTTGC 


TCTTTCATGG 


GTATTATTAT 


ACCAAAAATA 


TGCCTAAAAA 


TCTCCATTTA 


9480 


TGTACCATTA 


TGAGGGAAAA 


ATAGAAAAAG 


GAGGCAAGGC 


CTCCACATGT 


GATTATTTGC 


9540 


TGTTTCGAGC 


TTCTTCCAAA 


ATCTTTGCAA 


TCTTGGTCGT 


CAACAGGTCG 


ATAGCCACGG 


9600 


TATTGCTAAC 


CCCTTCAGGA 


ATGACGATAT 


CAGCATAACG 


CTTAGTTGAC 


TCGATAAACT 


9660 


GGTGGTACAT 


TGGTTTGACC 


ACACCTAAGT 


ACTGGTTAAT 


AACGCTATCA 


AGGCTACGGC 


9720 


CACGCTCCTC 


CATATCACGC 


TTGATACGAC 


GAATAATGCG 


CACATCGTCA 


TCCGTATCCA 


9780 


CAAAAATCTT 


GATATCCATC 


AAATCGCGCA 


GACGCTTGTC 


CTCCAAGACC 


AAAATACCCT 


9840 


CAACGATAAA 


GACATCTTGA 


GGTTCCTGAC 


GATAGGTCTT 


GCTACTCCGT 


GTATGCTCTG 


9900 


TATAGTCGTA 


GGTCGGGATG 


TCCACCGGAC 


GCCCTGCCAA 


CAATTCCTTA 


ATCTGCTCGA 


9960 


TCATCAAGTC 


TGTATCAAAG 


GCAAAAGGAT 


GGTCATAGTT 


GGTTTTGACG 


G 


10011 



(2) INFORMATION FOR SEQ ID NO: 77: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5365 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS : double 
(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 77: 

CGTGTGGTCT TAAAAATAGA AGACAAAGAA CAAACTGTTG GAGGCTTTGT CCTTGCAGGC 60 

TCAGCCCAAG AAAAAACCAA AACAGCTCAA GTTGTGGCTA CTGGACAAGG TGTTCGTACC 120 

TTGAACGGTG ACTTGGTTGC TCCAAGTGTT AAAACTGGAG ATCGTGTCTT AGTTGAAGCC 180 

CACGCAGGTC TTGATGTCAA AGATGGCGAT GAAAAGTACA TCATCGTAGG CGAcTAACAT 240 

TTTGGCAATC ATTGAGGAAT AGAAGGAGAA AGTAAGTATG TCAAAAGAAA TTAAATTTTC 300 

ATCAGATGCC CGTTCAGCCA TGGTTCGTGG TGTCGATATC CTTGCAGACA CTGTTAAAGT 360 

AACCTTGGGA CCAAAAGGTC GCAATGTCGT TCTTGAAAAG TCATTCGGTT CACCCTTGAT 420 

TACCAATGAC GGTGTGACCA TTGCCAAAGA AATCGAATTG GAAGACCATT TTGAAAATAT 480 

GGGTGCTAAG TTAGTATCAG AAGTAGCTTC TAAAACCAAT GATATCGCAG GTGACGGAAC 540 

TACGACTGCA ACAGTCTTGA CCCAAGCTAT CGTCCGTGAA GGAATCAAAA ACGTCACAGC 600 

AGGTGCAAAT CCAATCGGTA TTCGTCGTGG GATTGAAACA GCAGTTGCCG CAGCAGTTGA 660 

AGCTTTGAAA AACAACGCCA TCCCTGTTGC CAATAAAGAA GCTATCGCTC AAGTTGCAGC 720 
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CGTATCTTCT CGTTCTGAAA AAGTTGGTGA GTACATCTCT GAAGCAATGG AAAAAGTTGG 780 

CAAAGACGGT GTCATCACCA TCGAAGAGTC ACGTGGTATG GAAACAGAGC TTGAAGTCGT 840 

AGAAGGAATG CAGTTTGACC GTGGTTACCT TTCACAGTAC ATGGTGACAG ATAGCGAAAA 900 

AATGGTGGCT GACCTTGAAA ATCCGTACAT TTTGATTACA GACAAGAAAA TTTCCAATAT 960 

CCAAGAAATC TTGCCACTTT TGGAAAGCAT TCTCCAAAGC AATCGTCCAC TCTTGATTAT 1020 

TGCGGATGAT GTGGATGGCG AGGCTCTTCC AACTCTTGTT TTGAACAAGA TTCGTGGAAC 1080 

CTTCAACGTA GTAGCAGTCA AGGCACCTGG TTTTGGTGAC CGTCGCAAAG CCATGCTTGA 1140 

AGATATCGCC ATCTTAACAG GCGGAACAGT TATCACAGAA GACCTTGGTC TTGAGTTGAA 1200 

AGATGCGACA ATTGAAGCTC TTGGTCAAGC AGCGAGAGTG ACCGTGGACA AAGATAGCAC 1260 

GGTTATTGTA GAAGGTGCAG GAAATCCTGA AGCGATTTCT CACCGTGTTG CGGTTATCAA 1320 

GTCTCAAATC GAAACTACAA CTTCTGAATT TGACCGTGAA AAATTGCAAG AACGCTTGGC 1380 

CAAATTGTCA GGTGGTGTAG CGGTTATTAA GGTTGGAGCC GCAACTGAAA CTGAGTTGAA 1440 

AGAAATGAAA CTCCGCATTG AAGATGCCCT CAACGCTACT CGTGCAGCTG TTGAAGAAGG 1500 

TATTGTTGCA GGTGGTGGAA CAGCTCTTGC CAATGTGATT CCAGCTGTTG CTACCTTGGA 1560 

ATTGACAGGA GATGAAGCAA CAGGACGTAA TATTGTTCTC CGTGCTTTGG AAGAACCCGT 1620 

TCGTCAAATT GCTCACAATG CAGGATTTGA AGGATCTATC GTTATCGATC GTTTGAAAAA 1680 

TGCTGAGCTT GGTATAGGAT TTAACGCAGC AACTGGCGAG TGGGTTAACA TGATTGATCA 1740 

AGGTATCATT GATCCAGTTA AAGTGAGTCG TTCAGCCCTA CAAAATGCAG CATCTGTAGC 1800 

CAGCTTGATT TTGACAACAG AAGCAGTCGT AGCCAATAAA CCAGAACCAG TAGCCCCAGC 1860 

TCCAGCAATG GATCCAAGCA TGATGGGCGG GATGATGTAA GCTTTCTATA GAAAACAACT 1920 

TATAAAAAAC ACAAAAGGAG GGAATGACTA ACCCTTCTTT TTATAGGCTC TTTGTCAACT 1980 

GTAGTGGGTT GAAGTCAGCT AAGCTCGAGA AAGGACAAAT TTCGTCCTTT CTTTTTTGAT 2040 

GTTCAAAGCG ATAAAAATCC GTTTTTTGAA GTTTTCAAAG TTTCGAAAAC CAAAGGCATT 2100 

GCGCTTGATA AGTTTGATGA GATTATTGGT CGCTTCCGGT TTGGCGTTAG AATAGTGTAG 2160 

TTGAAGGGCG TTGATAATCT TTTCTTTATC TTTGAGGAAG GTTTTAAAGA CAGTCTGAAA 2220 
AATAGGATGA ACTTGCTTAA GATTGTCCTC AATAAGTCCG AAAAATTTCT CCGGTTCCTT " 2280 

ATTCTGAAAG TGAAACAGCA AGAGTTGATA GAGCTGATAG TGATGTTTCA AGTCTTGTGA 2340 

ATAGCTCAAA AGCTTGTCTA AAATCTCTTT ATTGGTTAAA TGCATACGAA AAGTAGGACG 2400 

ATAAAATCGC TTATCACTCA GTTTACGGCT ATCCTGTTGT ATGAGCTTCC AGTAGCGCTT 2460 
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GATAGCCTTG TATTCATGGG ATTTTCGATC CAATTGGTTC AT,™ 

AT AATTTGAA CACGCACACG 

~. ccg„»,g„ g^acaat ««, ° 

~ ~G AGACTG1TTC _ _ 

~' — — ™ — ™ CAC^caa 

-ACGGCTC CAGCTTGTGT TCGCC^ 

*~ ~ ~-Ac™ 

««~ AGACATAATC gaga^ atgcTCAAAO 

™* «=n„ aatgacagtt caag^a, T0OCCKCTO „^ 
tcagtcatag Mmc UMm 

"~ "*» ~ ~-~ ™A ««« acag^ 

-~ GACCC™ AACCACAA„ „^ TACCAGTCGT 

™° — — — — — « ~A „a 

— « ATTTCCTrCT ^ 

— T G^AATA 

—A ^ m _ 

™ «~ AACAAAATAG CCTCCA™ ,^ T ^ 

;, T ^ «— — TTGAAA'TCSAA ATTAAAAAAA 

—AT.CC ««« «,« Mroac „ 

GCTTTATA7T T^cc _^ 
C-TAGATT AAATAGCATT ^ 

ATGTOGAAAT ACAAA^A T ^ 
TATACTAATC ATTOTTOTAT Aa^ t ,„ 

«««. -« ™>CACT„. „ ATMjnT ^ 

GCATAAATCT CTTTCT.g™ AC^GCTAT A.TAC.™ TTCC™™ 

™»«C ATTTATPTTA TAGTACCAAA A^ACC ATTTTCTTCC 
C-TTTACC GATTC1TTAG T^.c.™ CGCTTGTACC A» TCITT aC .TAGCC^ 
»»«GGAC C.CGTCCAT, gTTAATCCAA TCUKAACAA 
ATATAG.cc. ^^g. w 

TCCaAGAGAC GTPmcCCC ATCTGTAAAA ATTTTAACAT CCaAATCCTA ATCAAT.TAC 



2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 
4140 
4200 
4260 
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TTCAGTGCTT CTTCATCCAG ATAGTAGGGG CTAGCCATAT 
TTATCACGAA TCATGGCAAT GATATTAAAC CAATATTTCT 
GGTTCTCGAG TGACCCAACG ACGACCATCA CTTTCGGTAA 
CCAATAATGG CGTTTTCTGT TGTTTTTAGT ACCATGGTGT 
CTCCCATCAT GCTTATAACT TTGAATTGTA ATAAAGTCGC 
ACTAACCAAC TTTCTACAAT TTATAAGTTT ATCATTTACT 
AAAATCTGTG AATTTCACTT GGAAATATTA AAGATATTCT 
CGAAAAATCG TAGCCCTTTC GTGCTAAAAC TTGAGTTAAA 
TTCATACTTT CGGGCATACT TAGTATATTG CTTATCAAGT 
AGTCGTTTCT TCATCAACTT GACTATCCAA TTCGTCAAAG 
AGAGAAGCCC TTGTTAGTCA AGTTCTGGAT AATCTTATCT 
TTTTCCCTCA TATTTTTTCA ATAGTTTATT GGCTACACGT 
AAAATCATTC AAGATTTCTT CTATAGTAGA TTTTGAAATT 
AGTCAGTACA TAAGGTCCCT TGTCTCCTGA AAGTTGATTG 
GTACTGGCTA TCATTAATCC ACTTCTCTTC TTTAAGATTA 
GTTTTCATTA ATATCATATT TTTTCAGATA TTCTCTGACC 
AAAGGATAAG TGGTAGAGGG CCAGATTCTT ACCATAAGAA 
CTCTTTCAAT TCCTCTTCGC TTATCACCTT ATCTCTCGAT 
GTCTTCGGTG ATATAGCATT TGTCG 
(2) INFORMATION FOR SEQ ID NO: 78: 

(i) SEQUENCE CHARACTERISTICS - 

(A) LENGTH: 3636 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



TGCAATAGTA 
TGTGAAAGTA 
CAAGTGTATG 
CCCGCCAAGT 
CTTCTTTTGG 
ATTGTACCAT 
CTAAGAGCGC 
CGCTGCTTCA 
TCCTTGAAGA 
GCAATTTTAG 
TGCAGGGCAC 
TGAGCAACTT 
CCTTTTTGTG 
GCATTGATGA 
GCAATGACTT 
TCTTTTTCAG 
AATTGAGCAA 
AACATAAAAC 



AGAAGTTCCA 
AACAATAGCC 
ATCGTTGACA 
TCGGTGGAGA 
AAGCTTCATA 
AAAATTACCC 
TTGCTATATC 
GTTCGTATCC 
TGAGTTCCTG 
CATCAAAATA 
GAGCTGGAAG 
CCGAAAAATC 
CTAATTTCTG 
TAGCATAAGC 
GAGAAACGAT 
TACGTGCTTT 
AGTCTTGAAT 
GAACAATTGT 



4320 

4380 

4440 

4500 

4560 

4620 

4680 

4740 

4800 

4860 

4920 

4980 

5040 

5100 

5160 

5220 

5280 

5340 

5365 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 78: 
TTTCCAGAAA GAAGTTGAGT AAAGTCTTTA TCAAAGAGAA TGACTTCCGT ATTGGAACTG 
ACATTAGGTT TTATTTCTAC TTTACTAGCG TCCGCCCTAG CATTTTCTAA ATCTTTAATC 
TCTTCTGTTG CCCTA^AT AGCCAGCTGA ATAACTGCTT GAGGATTTTC AC^AGTCCA 
TGAAGCTTAT CGTCCACCGA AGTATAAAGA CTCGAATGCA TGACTTGTAA AATAATCAGA 



60 
120 
180 
240 
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GTCATTGTAG AAAAAATCAG GGTGAAGACA CCGAAGTTGC GGATAAAATA ACTAAAGTCA 300 

TCCGCATACC ATGTTTTTTT AAGTTTACTG AACATCTTTT AAAAGATACC CAACACTACG 360 

CAAAGTTTGC AAATTCTCTG CAAAAGTGGT TCCCTTTAAT TTCTTACGGA CTTTTGAAAC 420 

ATAGACTTCG ACAACCGAAA TCGTTGTATC ACTATCAAAT CCCCATAGAC GGTCAAAAAT 480 

CTGCGTCTTA GGCAAAATCA CATTTTGATT TTGAAGGAAA TAAACTAGTA AATCGAACTC 540 

TTTCCCCAGC AATTCGACAG GAGTATCTTC AACTTTAACG GTATTGGTTG ATAAATTAAC 600 

CACGATATTC CCATAAGTCA AGGTGTTTTC ATTAAACTTC CCTGAACGTT TGAGAAGGGC 660 

CTGAATCCGC ATTTTAAGTT CTTCTAGGTA GAAAGGTTTG GTCAGATAAT CATCCGCTCC 720 

CAGTTCAAAT CCATGTCCCT TGTCATCCAA ACTTTCCTTG GCAGTCATAA TCAGAACTGG 780 

TGTCGTAATT CCCTTTTCAC GCAATTCTTT TAAGACTTGG AAACCATTTT TTTCTGGCAA 840 

CATCAAATCC AGCAAAATCA AGTCATAGAC ACCACTCTCA GCTTCGTAGA GACCTTCTTC 900 

TCCATCAAAT ACCTGCATAA CATCCGCAAA ATCGTCTAAA AAGTCAAATA CTGAATTTGA 960 

CAGACCTAGG TCATCCTCAA CCAATAAGAT TTTTATCATG AGAAACTCCT CCTTATTAAA 1020 

ACTATTATAC CAAATTTGCC TTAAAAAAAA CTCAACTCTC TGCATTTTAC ATGAGATAGC 1080 

TGAGTTTTCT TTTTATTTTA GGCTTATTTA TGCATTTCCG TATTGAAGAA CAACTGCTTC 1140 

GACTGCAGCT TTTTCACGGC TAATCAAGTC AACACGCGCT GCAATTTCCT TGATTCCCAT 1200 

ACCGATGTTA CGGCTAAGAG CAAGGTCAGA AAGTTGCGGT TCAAAGAACT CCTTGTATTC 1260 

CGCCAAGCGT TGCTGAGTCT TAAATACATG AGCAGGAAGG ATAACAAAGC TATCAAAGCT 1320 

CATATCTCCT CCAAGGGCTG CCTTAATCCA AGCCCAGTTT TCACGCGCCC AAGACCAAGC 1380 

TGTTTTCTGA GTTGCTTGAT GAGCTAGGAA TTGGTAATAC CAAGCAGACA AGTCCTGTGG 1440 

TTTGACCACA AATTTGTCCT TCCAAGAAGT AATCAGGTTT TGGATATTAT CCGCATCTGT 1500 

ACTGTATGCA AGAGCTGCTG CCAACTGGCG TTTAAAGACA GCATCTGTTG CGTGAGTATA 1560 

AGTATCAAGA TAAAGTGCTA ACAAGTCTTT AGTCTCATGA TGTTTCATCT CATTAATCAG 1620 

AACTTGTGAG CGAATAGCTG CTGGGAGTCC TGCAAGATTC TCCTTGTGTG TTGCGAAGAT 1680 

TTGGCTAGCG ACTTGACTAG CTTCTGCATC ATTTGAGCGA ATCATCATCG AAACAGCCAG 1740 

CTGACGAACC AATTCATCCT CATCTGATTC TCCGTCTTTA GCTTCAAAAC CAAGACGGTC 1800 

ATAGTTATGA CGAGCCAATT TAGCAACCAG TCCTTTGAAG GCTGTTTCAG CATCCGTTCC 1860 

TTCATCAATA AAGCGCTCAA GGGCTGAAAT CACTTGAGAA ACAGCTGAAA CCACCAGATA 1920 

AGACTCTTCC TTAGCAAGTT TATCAAGAAC TGGAAGCAAG TCTGCATAAG AAATGTGCCC 1980 

TGCCTCAGCC AACAAACGAC GTTCTTGAAC AATTTGCAGT TTGCTTGTGT TATCAAGTGT 2040 
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CTCTAGCTCA GCAAGAACAG CTGCTAACAA GTCTCCTTGA TAGTCGGTAA TATAGTGGGC 2100 

AGTATTTTCA GTGTTGAGAC GAAGAGCTCC TTCATTTTCA GCAAGAAGAG CTGCGTAGCC 2160 

AGGGATTTCG ATACTTTCAG TTTCGAGTGT ATCAGGCAAG CCTTTCCAGT TGCTATTGAG 2220 

GGGCACCACC CAGAGACGGT TCTTGTCTTC GTTCTCACCG ATGAAGAATT GTTTTTGTGA 2280 

AATCTTCAAG ACATCATTTT CAACTTTAAC AGTAAGAACT GGGTAACCAG GCTGTTCCAA 2340 

CCAAGAATCC ATGAAGGCTG CGACATCACG TCCTGACGCT TGACCAAGGG CATCCCAAAG 2400 

GTCACTACCA ATGGTGTTGC TGTATTGGTG TTTTTCAAAG TAGGCGTGCA AACCTTTAGC 2460 

AAAATCAGCA TCTCCTAGCC AACGGCGAAG CATGTGCATG AGACGGCTTC CTTTGGCATA 2520 

GACGATAGCG CCGTCAAAGA GTGTATTGAT TTCATCTGGA TGTTTAACTT CGACGTGGAC 2580 

AGACTGAACG CCATCAGTAG CGTCACGTTC AAGAGCAAGA GGTACTCCAC CTGTTTGGAA 2640 

ATCTTCAAAG ATATTCCAGC TTGGTTCGAT GGTATCCACA CAGACGTATT CCATCATATT 2700 

AGCGAAACTT TCATTGAGCC AAAGGTCATC CCACCATTTC ATAGTCACGA GGTTCCCAAA 2760 

CCATTGGTGA GCCAATTCAT GGGCCACAAC AAGGGCAACT TGTTGACGGC TAGCAAATGT 2820 

AGAGTTCTCA TCGACAACCA AGTAAACTTC ACGGTAGGTC ACAAGACCCC AGTTTTCCAT 2880 

AGCACCAGCT GAGAAGTCAG GAAGGGCGAT GTGGAGAGAT TGAGGAATTG GGTACTTAAC 2940 

TCCATAGTAA TCTTCGTAAA ACTCGATAGA GCGAACAGCG ATATCCAGTG AGAAATCAAG 3000 

ATTTGAAAGT GGATGTGCTT TGGTTGAGTA GACACCTACC AGGGTACCAT TTTTAGTTTT 3060 

AGCGGTCACC CCTTGCAAAT CACCAGCAAC AAAGGCCAAC AAGTAAGAAG ACATGCGAGG 312 0 

TGTTGTCTCA AACTTCCAGA TACCTGTTTC CTTACGGTTT TCAACATCGA TTTCTGGCAT 3180 

GTTTGACAAG GCCAATTCAC CTTCTGCTTG GTCAAAGCGA AGAGAGAGGT CAAAAGTTGC 3240 

TTTGGCTTCA GGCTCATCCA CACATGGGAA AGcTTCGCGC GCAAAATGGC TCTCGAACTG 3300 

AGTAGACAAG ACCTCCTTCT TGACTCCATC AACTGTATAA TAAGAAGGGT AAATCCCTGT 3360 

CATGTTGTCT GTAATTTTAC CAGAAAAGGC AAGAACCAAT TCAACTTGAC CAGCCTCAGC 3420 

CAATTCGATA TGAAGGGCTT CATTGTCATG GTCAACTGTA AATGGACGAG CTTGACCTGC 3480 

AACTTCTACA GAGGTGATTT CCAAATCTTT TTGGTGGAGG GAGATGCGGT CACTCTGTGC 3540 

TTGACCAGTG ATGGTCACTT TCCCAGAAAA AGTCTTGGTC TCACGACTCA AATCTAAAAA " 3600 

TAAATCATAA TGTTCAGGAA CAAATTGCTT AATGGG 3636 

(2) INFORMATION FOR SEQ ID NO: 79: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 5066 base pairs 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
{D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 79: 



ATAGCGTGTA 


ATAATCGATT 


TTAGAGGTAC 


CATAAGCCAC 


CTCCTACAAA 


TAGAAACCGA 


60 


TATAAATCAA 


TGCCTTCCAC 


CCTTAGACTT 


CCCTAGTTCC 


TGTCTCAAGC 


GAAACATTTC 


120 


TTTGAAACAG 


GAATAAGTTA 


ACCAATTCAT 


ACCAATAGCT 


AGCAGAATAA 


AAAGAAACCA 


180 


AATGCCCCAT 


AACTTGATAT 


CTGTCACATT 


TCTCAAGACG 


GTATTGAAAA 


ACAGAACTGA 


240 


AACAACTGTC 


CAAGCAAGGC 


TAAAAAGAGA 


ATAGAAGGGG 


ATGTAAAACC 


AGTAAAAATA 


300 


ATAAAAAATT 


GGAAAAAACT 


TACTATTTCT 


GTTGGCCTTT 


TCAATCCAGT 


TATCAAAATA 


360 


AAAGTACGGT 


GCTAAAAGTA 


AGAATTTAAA 


CAAATGTTCC 


ATCACCGACA 


TCCCCCCTTC 


420 


TTTTGATAGC 


GTTTTCTATT 


ATTTTATTAT 


ATCAAAAAAA 


TCCGGAACTG 


TCATTCCAGA 


480 


TTCTACTTTT 


TTATTTGCGT 


TTTCTTGCGA 


TGAGATGAAT 


CGGTGTTCCC 


TCAAAAACAA 


540 


AGGCCTTGCG 


GATTTGATTT 


TCCAAGAAAC 


GCAGGTAAGA 


AAAGTGCATG 


AGTTCTTCTT 


600 


CATTGACAAA 


GATGACAAAG 


GTTGGTGGTT 


TGGTTGCCAC 


TTGGGTCGCA 


TAGAAAATCT 


660 


TGAGACGTTT 


TCCTTTGTCT 


GTCGGTGTTG 


GGTTGATGGC 


AATGGCATCC 


ATGATGACAT 


720 


CGTTCAAGAC 


AGCTGATGGA 


ATACGTGTAT 


TTTGACTTTC 


GCTGATTTGC 


TTAATCATCT 


780 


CAGGAAGTTT 


GTGGAGACGT 


TGCTTGGTTA 




JVAAGATAATC 


GGTGCGTAAG 


840 


GCAGGTATTG 


GAACTGCTCA 


CGGATATCTT 


CTTCCCAGTT 


TTTCATAGTG 


TGGTTATCTT 


900 


TTTCAAGCGT 


ATCCCACTTG 


TTGACCACGA 


TAATCATCCC 


TTTACCAGCT 


TCATGGGCAA 


960 


ATCCTGCGAT 


ACGCTTGTCG 


TACTCACGAA 


TGCCTTCTTC 


CGCATTGATG 


ACCATCAAGA 


1020 


CCACATCTGA ACGGTCAATA 


GCACGCATGG 


CACGCATAAC 


AGAGTATTTC 


TCAGTATTTT 


1080 


CATAAACCTT 


ACCAGACTTA 


CGCATACCAG 


CCGTATCAAT 


CATGGTAAAC 


TCTTGACCAT 


1140 


CTGTATCTGT 


AAAGTGGGTA 


TCAATGGCAT 


CACGAGTTGT 


TCCAGCAACA 


GGACTAGCAA 


1200 


TAACACGGTC 


TTCTCCCAAG 


ATAGCATTGA 


TCAAGCTTGA 


TTTTCCAACG 


TTAGGACGAC 


1260 


CAATCAAGCT 


AAACTTAATG 


ACATCTGGAT 


TTTCTTCCTC 


ATATTCATTT 


GGAAGATTTT 


1320 


CTACGATCGC 


ATCTAGCACA 


TCCCCTGTAC 


CGATTCCATG 


GACAGATGAG 


ATAGGCAATG 


1380 


GTTCACCCAA 


ACCGAGAGCA 


TAGAAATCAT 


ATATATCATT 


TCTCATCTCA 


GGGTTGTCCA 


1440 


CCTTGTTGAC 


TGCGAGGATA 


ACTGGTTTGT 


GGGTCTTATA 


AAGCTTACGA 


GCTACGTATT 


1500 


CGTCTGCATC 


AGTAATTCCT 


TCCTTACCAG 


ACACGACAAA 


AACGATAACA 


TCTGCTTCTT 


1560 
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CCATGGCAAT TTCTGCCTGG TGCTTGATTT GTTCCATGAA AGGAGCATCG ACATCATCAA 1620 

TTCCTCCTGT ATCAATCATG CTAAAAGAAC GATTGAGCCA CTCACCCGTT GCATAAATAC 1680 

GGTCACGTGT CACTCCTTCG ACATCTTCTA CAATGGAGAT TCGCTCACCA GCGATCCGAT 1740 

TAAATAGGGT TGATTTCCCA ACATTGGGAC GTCCTACAAT GGCAATAGTT GGTAGGGCCA 1800 

TAATTTCTCA CTTTCTACAA TAATTTCTTC TGTTCAAGAT TTTTTCTAGT TGAGCTTGGT 1860 

TCAGCTTGAC CAAACTGTTC TGCTAGGCGC TGACTCCAGC TTGTGGTCGC ACGCGCCCCA 1920 

GCATAGTCAG CCTGAACACG GTCATAAGCT TGGATTGCCT CAGTTGACTG TTCTTGGTAT 1980 

TCTTCCTCAA AGACAACATT CTCTAGTGGC AGTCTCGGTT TCATATCATG ATGTTGATTT 2040 

GGCACACCCA GTGCCATCCC AAAGACAGAA TAGGTGTAGT CAGGTAGGTT AAAGAGCTCT 2100 

GCCACTTCTT CAGACTTGTA TCGAACCAAA CCGATAATCA CACCACCATA GCCCAAGCTT 2160 

TCAGCTGCCA ACAAGGCGTT TTGTCCAGCA AGAGCTGCAT CGACCGAACT AATCAAGAGA 2220 

CCTTCCACAC CTTGGGGTTG GAAGGTGTCG GTATGAAGTC GGGCTCCCTT TTCTGCTCGG 2280 

TTCAAATCTC CGACAAAGAG AAGGAAAACA GCAGACTGGC GAATGGCTTC TTGAGGTACC 2340 

AATTCATACA AGGCATCTTT CTTCTCTTGA CTTCGTACCA CAATCACAGA GTAGGATTGG 2400 

AAATTCTTCC AAGATGATGC CATCTGGGCT GCTGTCAAAA TCTCATTTAA GTCTACTTGG 2460 

GGAATTTCTT GCTCTTTAAA CCTGCGCACT GAAGTATGAG CCTTCATCAA TTTAATGGTT 2520 

TCTGTCATCG ACGGTTTACT CCTTCTAAAC GAGTCTCCTC AGCCAAATAA CGGATGCGTT 2580 

CCATGACCCG TCTGGCTTCC CAGGTTTCGT CATTTCCATG TTTCACTTTC GCAAAATGCT 2640 

TCTCCAAATC TTCAAAGTTG AAGTTGGATG TGAAAAAGGT CGGTAAATTT TCCTGCATCC 2700 

GATATTGGAG AATGACCTGC AGGATTTCGT CACGCACCCA AACGGTTGAT TGCTCGGCGC 2760 

CAATATCATC TAAAATCAGG ACCTCAGACA GCTTAATCTC ATCCACCAAG GTCTTAACAT 2820 

TGCCATCACT GATAGCATTT TTGACATCAA TGACAAAGCT AGGATAGTGG AGGAGAGTTG 2880 

ATGAAACACC ACGTTTTTCT GATAAATCAT GAGCTAAGGC CGCCACCATG AAACTTTTAC 2940 

CCACACCAAA GTCTCCATAT AAGTAAAGAC CTTTTCGAAT AGCTGGATAT TGCTCCACGA 3000 

AGGCTAGTAG CTTTTCAAAA ACTGGTAAGC GCCCCAAATC ATCCAAGTCA ACTTGAGCCA 3060 

AACTAGCTTT CTTGAGACTG GCTGGTAGAT TGATTAACTT GAGACGGTTC TTAATAGCCG" 3120 
CTTCTTTTTC AGCCGCGATT AGCTCAGGAG TTTCTTCATA TGAAACATCT GCATAACCAT . 3180 

GATTCTTAAC CAAAATCGGC TTGTAGCCTT TGGCAATATA ATCCGTATCC CCACGGAGAA 3240 

ACTTGTCACG CTCGGTGATG TACTGATTAA ACTTGGAGAT ACTGCGATTT AATTCCTTTG 3300 
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GAGTTAAGGA TTCTTGCTGG ATAAAGGCCG CAACATCAGG GTCCTTCATG ATTTTCTGGA 3360 

CCAAATCTTG ATAATAAAAA CGGCTGGGTT GACGTTTGAG TACGTCTCCG ACACTTTCCA 3420 

TCTAATCTCC TCCTTTTTCT AATCGAGCTA ATAGTTCTTG CTTCTTACGT TCTAGTTCCA 3480 

GACGAGTTTC CTCGCTGGTT TCATTCTTAT ATTCAGGATT ACTCCATTTA GGAACATTGG 3540 

TTTTTTCTGG GGCAGTCTGA TTCTGTTTTT GTGTTTTTGC TTTCTGCCCT CGATCACGAA 3600 

TTCGTAAAAC GGCCTCTTCT GCCGAATGAA TCTTTTGATA GGCATAGTCA TTGGCTACCT 3660 

TCATGGCATA TTTCTCATTG ATATTTGCCG AATCCACCTT ATTAAAGGTC AATAAGAGAA 3720 

TAATATTGAT GACTTCGTCC AGTAAGCCCA AGCCAGCCAT CTGTTGCAAG AGTTCTCTTT 3780 

CTGTTTGGGT AATGGTTCCC TTGCGTGTTT GCTTGATTTC TGCTAAGAAC TGCAGGGCAG 3840 

TTTTACTTTT AGCTTCTTTG ATAATGGTCG CTTCCTTAAG ACTAAAGTCA GAGGAAACTG 3900 

GTTTTTGAGC AATTTTTTCA CGCATGCGTT TGGTTGAAAT AACCTGGGAA ACAGCTGTTG 3960 

ACTTGGCCAA TTGATAGGTT TCAAACCAAG TCCATTTCTT CTCCTCGGCA ATAGCAAAGA 4020 

GGTTTAAGAC ATCGGACTGC TCATCCGCAA AACGAAGTCC ATCTCGAGCC ATCAGCTGGC 4080 

GAAAATGTTC CAAGTCAAAA TCATTGGCCA CTTTCTTCTT GAGACCAAGG TCTTCTTGAC 4140 

TGCCTAGTTC TGCCAATTCT GGAAAGACTT GATTGAGTGA GACAGGTATT TCTTCACCAT 4200 

CAGCACTTTC AACTTTCAAA TCCTCCACAG CTACATCGCC AATCTTTTTC TCTAAGAGTC 4260 

TGCGATAAAC AGGATGCCCC AAGAAGTCTT GACTAGATAG AGGAGCATGG AGGGCTAGCT 4320 

GATAAACATC ACCCTTTTGA TAGAGGGTCA AGAGATTAAA AGCAGATAAG ATTTTCAATG 4380 

ATTTTATCAG TCTATCCATC CCAAAGTTGA GATGGTTGAG AATGCTTGAA AAAAGATATT 4440 

CCTTTCTACC ATTATCCCAA AAACTGATTG TATAAAGATA AAGGCTCAGT GCCTCCTGAC 4500 

CGATAATCGG GAGGTAGCAC TGTACCAGAG ATGAGGTATC TTGCGACACC CGATTATTCT 4560 

TTAGATAAGA AAAACGGTCA ATTGGCTTCA TTTATCTTTC CTTTTTCTTT TTAGAGGACT 4620 

GGGTGATTTG TTGGAGCAAG CTCTCTAACT CACTGACATC CTTAAAACTA CGATAGACAC 4680 

TAGCAAAACG TACATAGGTA ATCTCGTCCA ATTCAGCCAA CTCCTCCATG ACGAGTGAAC 4740 

CAATGTCCTC ACTTTGAATT TCATTTTCAT TTCGACCACG GAGTTTCTGT TCGATACGAT 4800 

TGACTACCAT GTTGATTTCA TCACTTGACA CAGGACGTTT CTGGGCTGAG CGGATAATCC 4860 

CATTAAAGAT TTTATCTCTG GAGAATTGTT CCCGTGTGCC ATCTTTTTTA ACAACCACTA 4920 

AGGTTCTTTC TTCTACTCGT TCGTAGGTTG TAAAACGGTG TTGGCATTCG TCGCACTCAC 4980 

GTCTTCTACG AATGGTGTTC CCTTCTTCTG CTTGGCGACT ATCGATAACA CTTGACTTGG 5040 

TAGCCCCACA TTTTGGACAG GGTACC 5066 
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(2) INFORMATION FOR SEQ ID NO: 80: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9607 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(XX) SEQUENCE DESCRIPTION: SEQ ID NO: 80: 

CACTTGAAGT ATTTGAAACA GCTATGGAAA ACATCATGCC TGTACTTGAA GTACGTGCAC 60 

GTCGTGTTGG TGGTTCTAAC TACCAAGTCC CAGTTGAAGT TCGTCCAGAA CGTCGTACAA 120 

CACTTGGACT TCGTTGGTTG GTAACAATCG CTCGTCTTCG TGGTGAACAC ACAATGCAAG 180 

ACCGTCTTGC AAAAGAAATC TTGGATGCTG CTAACAACAC TGGTGCAGCA GTTAAGAAAC 240 

GTGAAGATAC TCACCGTATG GCTGAAGCTA ACCCTGCATT CGCACACTTC CGTTGGTAAG 300 

ATAGGATGCG AAAGCGTTAA GAAAGTCCCA GAGAAAATAG GGAATCGAAG CAGGTTGCGG 360 

TTGCAACCAA TGAGATTCAT CTTTTTCTCC AGACTTTTAG CTTGAGCTCA ACTAAATCAT 420 

GATGCTAGGA ACGGTAAGGA TGCAAGGTAA AAATAGGAAA CTGACGCAGT ATTCGACGAA 480 

TACAAGGAGT TTTATCTTTT TCACGCAGCA TCCCGTTCCA GCTCACATCG GCTAACTAAC 540 

TTTAGCCCGG GTTCAAATTA GCTAAATCGA TTAGTATTAG CTATAACTCA GCTTACCATC 600 

TCGTAAGTTG AAACCAACAA TAGCATGAAA ACATTGAGAA CGGGTAGGTC CTGCCTATCC 660 

GTTTTTATTA AAATCGTGTT ATAATAGAAT AGAAATCAAA AATAAATAGG AGAAACAAAC 720 

CTCATGGCAC GCGAATTTTC ACTTGAAAAA ACTCGTAATA TCGGTATCAT GGCTCACGTC 780 

GATGCCGGTA AAACAACAAC TACTGAGCGT ATTCTTTACT ACACTGGTAA AATCCACAAA 840 

ATCGGTGAAA CTCACGAAGG TGCGTCACAA ATGGACTGGA TGGAGCAAGA GCAAGAACGT 900 

GGTATCACGA TCACATCTGC TGCGACGACA GCTCAATGGA ACAACCACCG CGTAAACATC 960 

ATCGACACAC CAGGACACGT GGACTTCACA ATCGAAGTAC AACGTTCTCT TCGTGTATTG 1020 

GATGGTGCGG TTACCGTTCT TGACTCACAA TCAGGTGTTG AGCCTCAAAC TGAAACAGTT 1080 

TGGCGTCAAG CAACTGAGTA CGGAGTTCCA CGTATCGTAT TTGCCAACAA AATGGACAAA 1140 
ATCGGTGCTG ACTTCCTTTA CTCTGTAAGC ACACTTCACG ATCGTCTTCA AGCAAATGCA ' 1200 

CACCCAATCC AATTGCCAAT CGGTTCTGAA GATGACTTCC GTGGTATCAT TGACTTGATC 1260 

AAGATGAAAG CTGAAATCTA TACTAACGAC CTTGGTACGG ATATCCTTGA AGAAGACATC 1320 

CCAGCTGAAT ACCTTGACCA AGCTCAAGAA TACCGTGAAA AATTGATTGA AGCAGTTGCT 1380 
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GAAACTGACG AAGAATTGAT GATGAAATAC CTCGAAGGTG AAGAAATCAC TAACGAAGAA 1440 

TTGAAAGCTG GTATCCGTAA AGCGACTATC AACGTTGAAT TCTTCCCAGT ATTGTGTGGT 1500 

TCAGCCTTCA AAAACAAAGG TGTTCAATTG ATGCTTGATG CGGTTATCGA CTACCTTCCA 1560 

AGCCCACTTG ACATCCCAGC AATCAAAGGT ATTAACCCAG ATACAGACGC TGAAGAAATT 1620 

CGTCCAGCAT CTGACGAAGA GCCATTTGCA GCTCTTGCCT TCAAGATCAT GACTGACCCA 1680 

TTCGTAGGTC GTTTGACATT CTTCCGTGTT TACTCAGGTG TTCTTCAATC AGGTTCATAC 1740 

GTATTGAATA CTTCTAAAGG TAAACGTGAA CGTATCGGAC GTATCCTTCA AATGCACGCT 1800 

AACAGCCGTC AAGAAATCGA CACTGTTTAC TCAGGTGATA TCGCTGCTGC CGTTGGTTTG 1860 

AAAGATACTA CAACTGGTGA CTCATTGACA GATGAAAAAG CTAAAATCAT CCTTGAGTCA 1920 

ATCAACGTTC CAGAACCAGT TATCCAATTG ATGGTTGAGC CAAAATCTAA AGCTGACCAA 1980 

GACAAGATGG GTATCGCCCT TCAAAAATTG GCTGAAGAAG ATCCAACATT CCGCGTTGAA 2040 

ACAAACGTTG AAACTGGTGA AACAGTTATC TCAGGTATGG GTGAACTTCA CCTTGACGTC 2100 

CTTGTTGATC GTATGCGTCG TGAGTTCAAA GTTGAAGCGA ACGTAGGTGC TCCTCAAGTA 2160 

TCTTACCGTG AAACATTCCG CGCTTCTACT CAAGCACGTG GATTCTTCAA ACGTCAGTCT 2220 

GGTGGTAAAG GTCAATTCGG TGATGTATGG ATTGAATTTA CTCCAAACGA AGAAGGTAAA 2280 

GGATTCGAAT TCGAAAACGC AATCGTCGGT GGTGTGGTTC CTCGTGAATT TATCCCAGCG 2340 

GTTGAAAAAG GTTTGGTAGA ATCTATGGCT AACGGTGTTC TTGCAGGTTA CCCAATGGTT 2400 

GACGTTAAAG CTAAGCTTTA TGATGGTTCA TATCACGATG TCGACTCATC TGAAACTGCC 2460 

TTCAAGATTG CGGCTTCACT TTCCCTTAAA GAAGCTGCTA AATCAGCACA ACCAGCTATC 2520 

CTTGAACCAA TGATGCTTGT AACAATCACT GTTCCAGAAG AAAACCTTGG TGATGTTATG 2580 

GGTCACGTAA CTGCTCGTCG TGGACGTGTA GATGGTATGG AAGCACACGG TAACAGCCAA 2640 

ATCGTTCGTG CTTACGTTCC ACTTGCTGAA ATGTTCGGTT ACGCAACAGT TCTTCGTTCT 2700 

GCATCTCAAG GACGTGGTAC ATTCATGATG GTATTTGACC ACTACGAAGA TGTACCTAAG 2760 

TCAGTACAAG AAGAAATTAT TAAGAAAAAT AAAGGTGAAG ACTAATCCGT CCTCACTCTA 2820 

GAAGGAAGTC ACTTAGTGGC TTCCTTTTGT CTTTAGAAAA TACCTCTAAA TATGGTAAAA 2880 

TAGTAGAAGA ATAATGTGAG GAAAATGAAT GTCAAATAGT TTTGAAATTT TGATGAATCA 2940 

ATTGGGGATG CCTGCTGAAA TGAGACAGGC TCCTGCTTTA GCACAGGCCA ATATTGAGCG 3000 

AGTTGTGGTT CATAAAATTA GTAAGGTATG GGAGTTTCAT TTCGTATTTT CTAATATTTT 3060 

ACCGATTGAA ATCTTTTTAG AATTAAAGAA AGGTTTGAGC GAAGAATTTT CTAAGACAGG 3120 

CAATAAAGCT GTTTTTGAAA TTAAGGCTCG GTCTCAAGAA TTTTCAAATC AGCTCTTGCA 3180 
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GTCCTACTAT AGGGAGGCTT TCTCTGAAGG TCCATGTGCT AGTCAAGGTT TTAAGTCCCT 3240 

TTATCAAAAT TTGCAAGTTC GTGCTGAGGG TAATCAGCTA TTTATTGAAG GATCTGAAGC 3300 

GATTGATAAG GAACATTTTA AGAAGAATCA TCTTCCTAAT TTAGCCAAAC AACTTGAAAA 3360 

GTTTGGTTTT CCAACTTTTA ACTGTCAAGT CGAGAAGAAT GATGTCCTGA CCCAAGAGCA 3420 

GGAAGAGGCC TTTCATGCTG AAAATGAGCA GATTGTTCAA GCTGCCAATG AGGAAGCGCT 3480 

CCGTGCTATG GAACAACTGG AGCAGATGGC ACCTCCTCCA GCGGAAGAGA AACCAGCCTT 3540 

TGATTTTCAA GCGAAAAAAG CTGCAGCTAA ACCCAAGCTG GATAAGGCGG AGATTACTCC 3600 

TATGATCGAA GTGACGACAG AGGAAAATCG TCTGGTATTT GAAGGGGTTG TTTTTGATGT 3660 

GGAGCAAAAA GTGACTAGAA CAGGTCGTGT TTTAATCAAC TTTAAAATGA CGGACTATAC 3720 

TTCAAGTTTT TCTATGCAAA AGTGGGTTAA AAACGAGGAA GAGGCCCAGA AGTTTGACCT 3780 

CATCAAGAAG AATTCTTGGC TCCGAGTTCG AGGGAATGTG GAGATGAATA ACTTCACACG 3840 

CGATTTGACT ATGAACGTAC AGGATCTGCA GGAAGTTGTT CACTATGAGC GGAAGGATTT 3900 

GATGCCAGAA GGTGAGCGTC GGGTTGAGTT TCATGCTCAT ACTAACATGT CGACTATGGA 3960 

TGCTTTGCCA GAGGTCGAAG AGATTGTTGC AACAGCTGCT AAGTGGGGAC ACAAGGCGGT 4020 

TGCTATCACG GACCATGGGA ATGTCCAGTC CTTTCCACAT GGCTATAAGG CGGCTAAGAA 4080 

AGCGGGAATC CAGCTGATCT ATGGGATGGA AGCCAATATC GTGGAGGACC GTGTCCCTAT 4140 

CGTCTATAAC GAAGTGGAGA TGGACTTGTC AGAAGCAACC TACGTGGTCT TTGACGTGGA 4200 

AACGACGGGA CTTTCAGCTA TCTATAATGA CTTGATTCAG GTTGCGGCTT CTAAGATGTA 4260 

CAAGGGGAAT GTTATTGCTG AATTTGATGA ATTTATCAAT CCTGGGCATC CCTTGTCAGC 4320 

CTTTACTACA GAGTTAACTG GAATTACAGA TGATCATGTC AAAAATGCCA AACCACTAGA 4380 

ACAAGTTTTG CAAGAATTCC AAGAATTTTG CAAGGATACG GTCCTAGTTG CCCACAATGC 4440 

TACCTTTGAC GTTGGCTTTA TGAATGCTAA TTATGAGCGG CATGATCTTC CAAAGATTAG 4500 

TCAGCCAGTT ATTGATACGC TGGAGTTTGC TAGAAACCTC TATCCTGAGT ATAAACGCCA 4560 

TGGTTTGGGG CCTTTGACCA AGCGTTTTGG TGTGGCCTTG GAACATCACC ACATGGCCAA 4 620 

CTACGATGCG GAAGCGACTG GTCGTCTGCT TTTCATCTTT ATCAAAGAGG TAGCAGAAAA 4680 

ACATGGTGTG ACCGATTTAG CTAGACTCAA CATTGATCTA ATCAGTCCAG ATTCTTACAA 4740 

AAAAGCTCGG ATCAAGCATG CGACCATCTA TGTCAAGAAT CAGGTAGGTC TAAAAAATAT 4800 

CTTTAAGCTG GTTTCCTTGT CTAATACCAA GTATTTTGAA GGAGTGCCAC GGATTCCGAG 4860 

AACGGTTCTA GATGCCCATC GAGAGGGCTT GATTTTAGGT TCAGCCTGTT CAGAGGGTGA 4920 
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AGTTTTTGAC GTGGTCGTTT CTCAAGGTGT GGATGCGGCG GTTGAGGTGG CCAAGTATTA 4980 

TGATTTTATC GAGGTCATGC CACCGGCTAT CTATGCACCC TTGATTGCCA AAGAGCAGGT 5040 

CAAGGATATG GAGGAACTCC AGACCATTAT CAAGAGTTTG ATAGAGGTTG GAGACCGCCT 5100 

TGGCAAGCCT GTTCTGGCTA CGGGAAATGT TCACTATATC GAACCGGAAG AAGAGATTTA 5160 

TCGTGAAATT ATCGTCCGTA GTTTGGGACA GGGTGCGATG ATTAATCGAA CTATCGGTCA 5220 

TGGTGAACAT GCCCAACCAG CACCACTTCC AAAGGCTCAT TTTCGAACGA CTAATGAGAT 5280 

GTTGGATGAA TTTGCCTTTT TGGGAGAGGA ACTGGCTCGT AAACTGGTTA TTGAAAACAC 5340 

CAATGCCTTG GCAGAAATAT TTGAATCCGT TGAAGTCGTT AAGGGTGACT TGTATACGCC 5400 

TTTCATCGAC AAGGCTGAAG AAACAGTTGC TGAGTTGACC TATAAGAAAG CTTTTGAGAT 5460 

TTATGGAAAT CCGCTGCCAG ATATTGTTGA TTTGCGGATT GAAAAAGAAT TAACATCCAT 5520 

ACTGGGGAAT GGATTTGCTG TGATTTATCT GGCATCGCAG ATGCTGGTGC AACGTTCTAA 5580 

TGAACGGGGT TATTTGGTTG GTTCTCGTGG GTCTGTCGGA TCTAGTTTCG TTGCGACCAT 5640 

GATTGGGATT ACGGAGGTCA ATCCTCTCTC TCCTCACTAT GTCTGTGGTC AGTGTCAGTA 5700 

CAGTGAGTTT ATCACAGATG GTTCGTACGG TTCAGGATTT GATATGCCCC ATAAGGACTG 5760 

TCCAAACTGT GGTCACAAAC TCAGTAAAAA CGGACAGGAT ATTCCGTTTG AGACCTTCCT 5820 

TGGTTTTGAT GGGGATAAGG TTCCTGATAT TGACTTGAAC TTCTCGGGAG AAGATCAGCC 5880 

TAGCGCCCAC TTGGATGTGC GTGATATCTT TGGTGAAGAA TATGCCTTCC GTGCGGGAAC 5940 

GGTTGGTACG GTAGCTGCCA AGACTGCCTA TGGATTTGTC AAAGGTTACG AGCGAGATTA 6000 

TGGCAAGTTT TATCGTGATG CAGAAGTAGA ACGCCTCGCT CAAGGAGCGG CGGGTGTCAA 6060 

GCGGACAACA GGCCAACACC CGGGGGGAAT CGTTGTTATT CCGAACTACA TGGATGTCTA 6120 

CGATTTTACG CCTGTCCAGT ATCCAGCAGA TGATGTCACG GCTGAATGGC AGACCACTCA 6180 

CTTTAACTTC CACGATATCG ATGAGAACGT CCTCAAACTC GATGTACTGG GACATGATGA 6240 

TCCGACTATG ATTCGAAAAC TTCAGGATTT GTCTGGTATT GACCCTAATA AAATTCCTAT 6300 

GGATGACGAA GGCGTGATGG CACTCTTTTC TGGGACTGAT GTGCTAGGGG TAACACCTGA 6360 

ACAAATTGGA ACGCCTACGG GTATGTTGGG GATTCCAGAG TTTGGAACAA ATTTCGTACG 6420 

TGGAATGGTA GACGAAACCC ATCCGACAAC CTTTGCGGAA TTGCTTCAGC TGTCTGGTCT 6480 

GTCCCACGGT ACTGATGTTT GGTTGGGGAA TGCTCAGGAT CTGATTAAGC AAGGAATAGC 6540 

GGACCTATCG ACTGTTATCG GTTGTCGGGA CGACATCATG GTTTACCTCA TGCATGCGGG 6600 

TCTGGAACCT AAGATGGCCT TTACCATTAT GGAACGGGTA CGTAAGGGTT TGTGGCTAAA 6660 

GATTTCAGAA GAGGAGAGAA ATGGCTATAT CGAAGCAATG AAGGCTAATA AGGTGCCAGA 6720 
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GTGGTATATC GAATCCTGTG GGAAAATTAA GTACATGTTC CCTAAGGCCC ATGCGGCAGC 6780 

CTACGTTATG ATGGCCTTGC GTGTAGCTTA CTTCAAGGTT CACCATCCTA TTTATTACTA 6840 

CTGTGCTTAC TTCTCCATTC GTGCTAAGGC TTTTGATATC AAGACCATGG GTGCGGGCTT 6900 

GGAGGTCATC AAGCGCAGAA TGGAAGAAAT CTCTGAAAAA CGGAAGAACA ATGAAGCCTC 6960 

TAATGTGGAA ATCGATCTCT ATACAACTCT TGAGATTGTC AATGAGATGT GGGAACGAGG 7020 

TTTCAAGTTT GGTAAATTAG ATCTCTACTG TAGTCAGGCG ACAGAGTTCC TCATCGACGG 7080 

GGATACCCTT ATCCCACCAT TTGTAGCAAT GGATGGTCTG GGAGAGAACG TTGCCAAGCA 7140 

ACTGGTGCGG GCGCGTGAAG AGGGAGAATT CCTCTCTAAA ACAGAACTAC GCAAGCGTGG 7200 

TGGACTCTCA TCAACCTTGG TTGAAAAGAT GGATGAGATG GGTATTCTTG GAAATATGCC 7260 

AGAGGATAAC CAGTTGAGTT TGTTTGATGA GTTGTTTTAA AAAATTGCTT AATAATCTAT 7320 

TAAAAGAGGC TAACGTATAT CCAATAGATT TACATTAGCT TTCTTTTTTG TTAAAATAGT 7380 

CTATGGAAAG AGGGTGAGAG TATGTCAAAG ATGAGTATAA GCATCCGTCT GGATAGTGAG 7440 

GTTAAGGAGC AGGCCCAACA GGTGTTTAGT AATCTGGGAA TGGATATGAC AACAGCTATT 7500 

AATATTTTCC TTCGTCAGGC AATTCAATAT CAGGGATTAC CTTTTGATGT TAGACTAGAC 7560 

GAAAATCGGA AGTTGCTCCA AGCGTTAACG GATTTAGACC AAAATCGTAA TATGAGCCAG 7620 

TCTTTTGAAT CAGTCTCAGA TTTGATGGAG GACTTACGTG CTTAAGATTC GTTATCATAA 7680 

ACAGTTTAAA AAAGATTTTA AGTTGGCTAT GAAGCGTGGT TTGAAGGCAG AATTATTAGA 7740 

AGAAGTTTTG AATTTTCTGG TTCAAGAAAA AGAACATCCT GCCAGAAATC GTGATCATTC 7800 

ATTGACGGCA TCCAAGCATT TTCAAGGAGT TCGTGAATGC CATACCCAGC CAGATTGGCT 7860 

TTTGGTTTAT AAAGTAGACA AGTCGGAATT GATTTTAAAT TTGCTGAGGA CAGGCAGTCA 7920 

CAGTGATTTA TTTTAATCTA TTTTAAGGGG GTTCTCATGA AACTAAGAAT ATTTGCGGAA 7980 

GATAAGCCGG CTAAGAAGGT ATTTGAATAT CAATTAGAAC TTGCTGATCG TACAATTCTT 8040 

CTATCGACAG CACTCTTGTC AGGTGCTATT GCTTTAGCAG GAATCTTTTC TGCTTTGAAA 8100 

GAAAAATAAA AATAGAAAAG AGAAAACAGA ATGGTTTTAC CAAATTTTAA AGAAAATCTA 8160 

GAAAAATATG CGAAATTGTT GGTTGCGAAC GGAATTAACG TGCAACCTGG TCACACTTTG 8220 
GCTCTCTCTA TTGATGTGGA GCAACGTGAA TTGGCACATC TAATCGTGAA AGAAGCTTAT " 8280 

GCCTTGGGTG CGCATGAGGT CATCGTTCAG TGGACAGATG ATGTGATTAA CCGTGAGAAA 8340 

TTCCTCCATG CCCCGATGGA GCGTTTGGAC AATGTGCCAG AATACAAGAT TGCTGAGATG 8400 

AACTATCTCT TGGAGAATAA GGCTAGCCGT CTTGGAGTTC GTTCATCTGA TCCAGGTGCC 8460 
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TTGAACGGAG TGGACGCTGA CAAGCTTTCA GCTTCTGCTA AAGCTATGGG ACTTGCCATG 8520 

AAGCCTATGC GTATCGCAAC TCAATCTAAC AAGGTTAGCT GGACTGTAGC AGCTGCAGCA 8580 

GGACTTGAGT GGGCTAAGAA AGTCTTCCCA AATGCTGCGA GCGACGAAGA AGCAGTTGAT 8640 

TTCCTTTGGG ACCAAATTTT CAAAACTTGC CGTGTCTACG AAGCAGATCC TGTTAAGGCT 8700 

TGGGAGGAAC ATGCAGCCAT TCTCAAGAGC AAGGCCGATA TGCTTAATAA GGAGCAATTT 8760 

TCAGCCCTTC ACTACACAGC GCCAGGAACA GATTTAACAC TTGGTTTGCC AAAGAACCAC 8820 

GTTTGGGAAT CAGCTGGTGC TGTCAATGCA CAGGGCGAAG AATTCTTGCC AAATATGCCA 8880 

ACAGAAGAGG TCTTCACAGC GCCTGACTTC CGTCGTGCAG ATGGTTATGT CACTTCTACA 8940 

AAACCGCTTA GCTACAACGG AAATATCATT GAAGGCATTA AGGTGACCTT TAAGGATGGA 9000 

CAAATCGTAG ATATCACTGC TGAGAAGGGT GATCAGGTTA TGAAAGACCT TGTCTTTGAA 9060 

AATGCGGGTG CGCGTGCCTT GGGTGAATGT GCCTTGGTAC CAGATCCAAG TCCAATTTCT 9120 

CAGTCAGGCA TTACCTTCTT TAACACCCTT TTCGATGAAA ATGCGTCAAA CCACTTGGCT 9180 

ATCGGTGCAG CCTATGCGAC TAGCGTTGTT GATGGAGCGG AGATGAGCGA AGAGGAGCTT 9240 

GAAGCTGCAG GGCTTAACCG TTCAGATGTT CACGTAGACT TTATGATTGG TTCTAACCAA 9300 

ATGGATATCG ATGGTATTCG TGAGGATGGA ACGCGGGTAC CTCTTTTCCG TAATGGGAAT 9360 

TGGGCAAATT AAGGAGATAA TATGTTAGGA AGTATGTTCG TTGGTCTCCT AGTGGGATTT 9420 

TTAGCAGGTG CTATGACCAA TCGTGGAGAG CGAATGGGAT GTTTTGGAAA AATGTTTCTC 9480 

GGTTGGATCG GAGCCTTTCT AGGTCACTTG CTCTTTGGAA CTTGGGGGCC AGTTTTATCA 9540 

GGAACAGCTA TTATCCCAGC GATTTTAGGA GCCATGATTG TTTTAGCTAT TTTTTGGAGA 9600 

CGAGGAA 9607 
(2) INFORMATION FOR SEQ ID NO: 81: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14231 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 81: 

CTACAAGATA ATTCCAGCTA TAACATCCGC TATAATAGTA AGAGCGAGCT CTATGATAAG 60 

GCTCATTAGT TTCACCTCCT CTCACGAACC CATAGGAACG TAATCGGTAA CCGATGACAA 120 

AAATAGTATA CCACAATACA TTTAGATCAT CAAGGTCACT TAATTCTTGA AATATCAGAT 180 

CTAAGAGAAA AATCTTTAAA ATCAGAAAAA CGCATAATAT CAGGTGTGCA AAAACTTGAT 240 
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ACTATGCGTT TTATTGTGGG AAGGTTTACT CCATTTTCTC CTGAAATTGA GTTTTTGTCC 300 

AGCCTCTGTT TTTAGGGTTG CTAAGAAAAT AATGTCATGT GGTGAATATT TGTAAATCAG 360 

TCAGCAGACA GAACGATACT CTTCGAAAAT CTCTTCACAT CATGTCAGCT TCGTCTTTCC 420 

GTATATATGT GACTGACTTC ATCAGTTCTA TCTACAACCT CAAAACAGTG TTTCGAGCTG 480 

ACTTGATCAA TTTTCAAATC TGTACTTTGA GCAAGCTGAG ACTAGCTTCC TATTTGATTT 540 

TCATTGAATA TCAGAAACCC ATTCTCCATC AAATAATTCG ACTGCGTCTA ATAATTTTTG 600 

ATCTGGCACG GTGTCTGAAA TAAAGGTTGT GTATTTGGAG AGGGGATTAA TTTTAAAAAA 660 

TCCAGTCTTG TAAAATTTAG AACTATCAAT CAGTAAGATG GTTTCATGGG CTTTGTCAAT 720 

AATATTCTTT TTTGAAATAG CTTGGCTGAG AGAAGCTTCA TAAACATATT GGTCATCAAT 780 

ACCTCTTGCT GAACAAAATG CTAAATCGAT ATTAAAATGA TCTAATAAAG AATTTTCCTT 840 

ATCATAGTTG ACCACGGAAC AGGATTGATG TTTGACCTCG CCAGATGTGA TAAAGATTTT 900 

GGAGCTATCT TTAACAGTTT CAGATAGGGT TTGTGCAGTA TGTAAACCAT TTGTAAAAAT 960 

AATCAAATTA TCAAGTTCAG AAAGATAGGG ACAGAGTTCG TAGACAGTAG TACTAGAATC 1020 

TAGATAGATA CACATACCAG ACCGAATAAA GTCTTTAGCG AGACTAGCGA TTAGTCTTTT 1080 

TTGCCTAGTA CTTTCTCCTT CACGTATTTG ATGAGAAAGT TCAATTGTGT TCATAGAGGA 1140 

CAGGGTCACG TATCCGTGCT TTCTTTTGAT AAGACCTTGA TTTTCTAAGA AAATTAAATC 1200 

ACGACGTAAG GTACTTGTGC TGGAGAAAGT GATTTCTGCC AGCTCTTTTA CGGCAATTCT 1260 

TTTTTTCTTT TTGATAATTT CAATCAATTC AAGTACACGT TCATCTTTTA TCATAAGCTC 1320 

CTCCTAATTT ATCATTTCAA CTATATTATA GCACAAATTG GAGGAATTTG AATTATTTTT 1380 

ATGAATATTG GGTTAACATT TGAACATTAT TCAAGTAAGC GTTCACATAT TGAAAAAATA 1440 

AAACGTGGGG ATTATAATAA AGTTAATCmA GGACGAAGAG AGAAGAAAAA TGGAAGCGGT 1500 

TTTAGCAATA GATTTAGGTG CGACTTCTGG AAGAGCAATC GTTGGTTACC TTTCTGAAAA 1560 

TAAACTAGTA ATGGAAGAAA TAAATCGCTT TTCTAATCTA CCTATTAGAG TAAAAGGGCA 1620 

TTTATCTTGG GATATTGACT TTCTACTAGC TAAAATTCTT GAAAGTATCC GCTTGGCTAA 1680 

TACTAGTTAC AAGATTTTAT CTATCGGTAT TGACACATGG GGAGTTGATT TTGGACTGAT 1740 

TGATAATGAA GGTAAGCTGT TATTACAACC TGTTCATTAT CGTGATGAAA GAACAAAGGG" 1800 

AGTGTTAAAG GAAATATCTG AAATGACTGA ATTAGAAAAA CTGTATTCAG AGACAGGAAA 1860 

TCAGATTATG GAGATAAATA CCTTGTTTCA ACTCTTTAAG GCACGTCAAG AATCTCCTGA 1920 

CTCTTTCTAT AAGACCAATA AGATTCTTTT AATGCCAGAT TTGTTTAATT ATCTCTTGAC 1980 
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AGGTAAGTTT GCTACAGAAA AAAGCATTGC TTCAACAACT CAATTATTTG ATCCTAGGAG 2040 

TCAAAATTGG AATCAGAATA TCTTAAAACT ATTTGAATTG GATTCATCTT TACTTCCTGA 2100 

AATTGTTTCA GAGGGAAATG TTCTTGGAAG GATAAAAGAG GAGTATGGTT TAGGCGATAT 2160 

TCCTGTTGTG AATGTTTGTA GTCATGATAC AGCAAGCGCG ATTGTCTCAG TACCTAAGAC 2220 

AGAAGGTAGT TTATTTATTT CATCAGGTAC TTGGTCTTTG GTTGGAGTGG AACTTACTTC 2280 

ACCGATTCTT ACTACCGAAT CCTTCAGTTA TGGATTTACA AATGAAGTCG GTAAAGATGG 2340 

AGTGATTACA TTTCTGAAGA ATTGTACAGG GTTGTGGATC ATAGAGGAAC TAAGACGTTC 2400 

ATTTGAACGA AGAGGGAAAG CCTATTCTTT TGATGATATT AGGACAATGG TGGAGAAAGA 2460 

AAAAGAAAAT CTTCCTCTGA TTGATACTGA ATCAACTGAA TTTGCAACAG AATCTGATAT 2520 

GCACAAGACT TTGACAGAAT ATCTAGCTTA TCATCATGAA ACTAGAGAGT GGACAGATGG 2580 

ACAACTATTT AAGATTGTTT ATGAAAGCCT AGCTGAAACG TATAGGAAAG CGATAGAGTT 2640 

ACTAGAAGAA CTAACTCATA AGGTTTATAA GAGGATATAT GTGATTGGAG GAGGTGCTAG 2700 

AGCCAGTTAC TTTAACCAAA TGATTGCTGA TAGAACTGGT AAAGAGGTTC TTACAGGTTT 2760 

GACTGAGGCT ACAGCTGTGG GGAATATTGT TGTGCAGCTC ATAGCTATGG GACAATTAAA 2820 

AGGGATGGAA GAGGCTCACC ATGTTATTGA GGAGTTTCTA CAATTAGAGA GTTATTACTC 2880 

CCAAAAGAAT TAAAAAGATT GAGAGTTTGT AAATTTGCCT CCCTCCCCCT TCTTAGCTTT 2940 

TGTGCAGGAA GGGGGGATAA TTGGTGAATT GAAAAATATT TAGTGTTTTG ATATGAGGAG 3000 

GACAAGGATG TCAGATGTAA AACAAGAATT AATTAAATAT GGTAAGAAGC TAGTAGAAAC 3060 

AGATTTGACG AAAGGAACAG GTGGGAATCT CAGCGTTTTC GATCGTGAAA AACAATTGAT 3120 

GGCAATTACC CCGTCGGGTA TTGATTTCTT TGAAATCAAA GAATCCGATA TTGTAGTGAT 3180 

GGATATTAAT GGAAATGTTG TAGAGGGAGA ACGCTTGCCA TCTAGCGAAT GGTATATGCA 3240 

TTTGATTCAA TATCAAACTC GTGATGATAT CGATGCAATT ATCCATGCTC ATACAACTTA 3300 

TGCAACAGTA TTAGCTTGTC TCAGAGAACC ACTTCCAGCG AGTCATTATA TGATTGCAGT 3360 

GGCAGGGAAA GATGTTCGGG TAGCTGAGTA TGCAACATAT GGCACGAAAG AATTGGCTGT 3420 

GAATGCAGCT AAAGCAATGG AAGGTCGTAG AGCAGTTTTA CTAGCGAATC ATGGAATTTT 3480 
AGCAGGTGCA- CAAAATTTAT TGAATGCATT TAATATTGTT GAAGAAGTTG AATATTGTGC . 3540 

AAAAATTTAT TGTTTAGCTA AGAATTTTGG AGAGCCAGTA GTTCTTCCTG ATGAGGAGAT 3600 

GGAATTGATG GCAGAAAAAT TTAAAACATA CGGTCAGAGA AAATAGGGAG GATATTAATG 3660 

TTAAAACATA TACCGAAAAA TATTTCTCCA GATTTATTGA AGACTTTAAT GGAAATGGGA 3720 

CATGGAGATG AAATAGTATT AGCTGACGCG AATTATCCTT CTGCCTCATG TGCAAATAAG 3780 
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CTAATTCGTT GTGATGGTGT AAATATTCCA GAATTATTAG ATTCCATTCT GTATTTAATG 3840 

CCATTAGATA GTTACGTCGA TAGTTCAATT CAGTTTATGA ACGTTGTTTC GGGTGATGAT 3900 

ATTCCTAAGA TATGGGGTAC CTATAGACAG ATGATTGAAG GTCATGGTAC AGATCTTAAA 3960 

ACGATTACTT ATCTTAGAAG AGAAGACTTT TATGAACGTA GTAAGAAAGC TTATGCTATT 4020 

GTTGCTACAG GAGAAACTTC ACTTTATGCT AATATTATCC TTAAGAAAGG AGTAGTTGTT 4080 

GAAAGAGAAA ATGTTCAATA GAGGAATTTT AGTTGCCAGT CATGGTAATT TTGCTAGCGG 4140 

AGCTCTCATG ACCGCAGAAA TGTTTGTTGG TGAGACAACA AATGATAGAG TTAGGACATT 4200 

AGGTTTGATG CCTGGAGAGA ATATTGTAGA GTTTGAGCAT TATTTTAAAA ATCAAGTGGA 4260 

TGAACTGTTA GACTCAAATC AAGAGGTTAT CGTTTTGACT GACTTGATTG GAGGAAGTCC 4320 

TAATAATGTG GCTTTGTCAC GGTTTTTAAA TTTGGATTCA GTTGATATTG TAACAGGGTT 4380 

TAATATCCCT CTCCTAGTGG AATTAATATC AAGTTATGAT TCAAAAATCA ATTTAGAAGA 4440 

AATTGTTCAC AATGCTCAAA ATAGTTTGTT TAATGTTAAA CAACAACTTA ACGTAGAGGA 4500 

GGAAGAAGAT TTATGTCTAT AGAGTTTGTT CGTATTGATG ACCGTCTGGT ACATGGTCAA 4560 

GTTGTCACTA CGTGGCTAAA AAAGTATGAT ATTGAGCAAG TTATCATTGT TAATGATCGC 4620 

ATCTCAGAAG ATAAAACACG ACAATCTATT TTAAAGATTT CTGCACCGGT AGGTTTAAAA 4680 

ATTGTTTTCT TTAGTGTAAA ACGGTTTGTG GAAGTTTTAA ACTCTGTGCC AATAAAAAAG 4740 

AGAACAATGC TGATATATAC AAATCCAAAA GATGTGTATG ATTCTATTGA AGGAAATTTA 4800 

AAATTGGAGT ACCTCAATGT AGGACAGATG AGTAAAACGG AGGAAAATGA AAAGGTAACG 4860 

GGAGGTGTAG CTCTAGGTGA AGAAGACAAA TATTATTTTA AGAAAATAGT TGATAAGGGA 4920 

ACGAGAGTTG AAATTCAAAT GGTTCCTAAT GATAAAGTTA CAATGTTGGA AAAATTTTTA 4980 

TAAAAATAAT TTAAGGAGGT ACAGTATATG CTATTCACAC AAGCATTACT GGTGACATTA 5040 

GTTGGGATTA TTGCCACTAT TGACTATAAT GGACCGTTAT TTATGATTCA CCGTCCGTTA 5100 

GTTACAAGTG CAATGGTTGG CTTAGTATTA GGAGATTTCA CCCAAGGTGT TCTTATTGGT 5160 

TCAGCTCTTG AATTAACTTG GCTCGGTGTA ACAGGTATTG GAGGTTATAC TCCACCAGAT 5220 

ACTATTTCAG GTGCGATTAT TGGTACTGCA TTTGGTATTT TATCTGGTCA AGGAGAAACT 5280 

GCTGGTATCG CTATAGCAGT TCCAATTGCA GTTGCTACCC AACAGTTGGA TGTTCTTGCA " 5340 

AAAACTTTAG ATGTTTATTT TGTGAAAAAA GCTGATAATG ATGCTAAAAA CGGAGATTAT 5400 

TCAAAGATCG GTTTTTATCA TTATTCAAGT TTGGTTTTAA TCACGTTATT TAAAATTGTA 5460 

CCAATTTTCC TAGCTATTAT GCTTGGAGGG GAATATGTGG CAGACTTGTT TGCTAAGGTT 5520 
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CCACCAATCG TTATGCAGGG ACTTAACTCT GCAGGTGCTT TACTACCTTC AATTGGTTTT 5580 

GGTATGCTTT TAAATATGAT GCTCAAGAAA AATATGTGGG TATTCTTGTT GATTGGATTC 5640 

ATTTGTTCTG TGTATGGAGG AATGTCAACC ATTGGGATCT CACTAGTTGG TATTGCGGTA 5700 

GCATACTTCT ACGATATGAT TGGAAGCAAA CCACAAGAAA CAACTTCAAG TAGTGATGTT 5760 

GAGGAGGATC TTGATCTATG ATGAATAATA AAGTAACTAA AGTTGAACTT AAAAAAGTTT 5820 

TCAAACGAAG TTTTATGTAT GGTTCTTCAT GGAACTATGA GAGAATGCAG AACCTAGGTT 5880 

TTCTATATAC AATTCTTCCA GTATTGAAAA AACTATACCC AGACAAAGAT TCAGCTTCTC 5940 

CTGCAATGAA ACGTCACCTT GAGTTTTTCA ATACTCATCA AACAGCGGCA CCATTTATTC 6000 

TTGGAGTTAC TTCCGCTATG GAAGAACAAG AAGGAAATGA AGGTGCAGCT TCAATTACTG 6060 

GTATTAAAGT TGGCTTGATG GGGCCACTGG CTGGTCTAGG AGATAGTTTG TTCTGGCTGA 6120 

CACTAGTTCC TATCTGTTTT AGTATTGGTG CGTCTTATTC TAAAGACGGC GGTGCTTTAG 6180 

GTATCTTTAT CGCCTTAATA TTGTTTAATA TTATTAATAT TCCTGTTAAA TATTTCGGTT 6240 

TGAAATATGG GTATACTAAG GGTTCTAGTC TTATCCAAGA AAATAATACA AAAGGAACAT 6300 

TGAATCGCGT TACGAGTATG GCGACAGCAT TAGGGCTAGT ACTAGTGGGT GGTTTGATTC 6360 

CATCAATGGT TGGTATTAAT TTTGGATTAG AATTTAAGCA GGGGGAACTT GTTATTTCTG 6420 

TTCAAGAAAT GATTACAAAA TTAATTCCAG GATTTATCCC TATGGCTTTG ACTTTATTAA 6480 

TGTGTAAATT AATTAGAAAA GGAAAGAATC CGGTTGTACT AATCTTTAGT GTTATGGCTA 6540 

TTGGAGTTAT TCTAGTTGTT TTAGGAATTT TGAAGTAGTA GAAAGTGTGG AGGTGGTATT 6600 

TGGGATATCA CCTCCATTTT GGAAGAGAGG TAAAGAGTGA AATTATGGTA TAAGAAAGCT 6660 

GCCGCAAATT GGAATGAAGC CTTGCCGATT GGGAACGGTC ATTTAGGTGG TATGATTTAT 6720 

GGTTCAGCTA CAAAAGAATG TATTCAACTA AACGATGAGA CTATTTGGTA TAGAGGAAAG 6780 

TCAGATAGAA ATAATCCAGA CTCACTATTG CATCTTAAAA AAATTCGGGA ATATCTTTTA 6840 

GATGGAGAAA TTCAGAAAGC CGAAGAATTG ATAAAGTTAA CAGTGTTTGC TACCCCAAGA 6900 

GATCAAAGCC ACTATGAATT ACTTGGGGAA CTTTACATTG AGCATATAGA TATTCAGTCT 6960 

TGTGCTCTTT CATTGTATGA AAGAGAGCTA GATTTAGATA CAGCTATTTC TAATGTTGTG 7020 

TTTGAGCCTA ATAGTTGTAA TTTACAAATA AAAAGAGAAT ATTTTACGAG TTTTAATAAG 7080 

AATATTTTAT GTTGCCGTAT AGTGTCATCA GTTCAAAACA CATTAAATTT AAACATTAAT 7140 

TTGGGTAGAA ATAAACGGTT TAATGACGAA GTATCTAAAC TGGATTCAAG TACAATTTTA 7200 

ATGTCGGCCT CTGCTGGAGG TAGAAAAGGT GTTCAGTTTA AAGTAGTATG TCATTCTAAG 7260 

GTTACGGATG GTGAAGTAAG TGTATTGGGA GAGACAATAG TTATTCGGAA TGCTACAGAG 7320 
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GTATTTCTTT ATCTCAAATC AATGACGGAT TATTGGGGAA AT AT AG AT AT TTCTTCTCTT 7380 

CAGGGAGAAT TTAGTAGTAT TGATTACTTT ACAGAAAAAG ATGAACATGT AAAAAAATAT 7440 

CAGGAGCAAT TTAATAGAGT TGATTTTAAA CTAGACTATA GTAAAGGTTG TCTTAGCATT 7500 

CCAACGAATC TACTTCTTGA AAACACTAAA AAGTATAGTA ACTACTTGAC TAACTTGTTA 7560 

TTTCATTATG GAAGATATCT GTTAATATCG TCTAGTCAAC CGAATGGTTT ACCTGCCAAT 7620 

CTTCAAGGAA TATGGTGTGA TGAATTAAAT CCAATTTGGG GTTCTAAATA TACGATTAAT 7680 

ATTAATACTC AAATGAATTA TTGGATGGTA GGTCCATGTG ATTTACCAGA AGTAGAATAT 7740 

CCATTATTTG ATATGCTCGA AAGAATGAGA GAACCGGGAA GACTAACCGC TAAGAAAATG 7800 

TATGGAGCTA GAGGTTTTAC AGCACATCAT AATACGGATG GTTTTGGCGA TACGGCTCCC 7860 

CAATCTCATG CCATGGGGGC TGCAATTTGG GTATTAACTA TTCCATGGTT ATGTACTCAT 7920 

ATTTGGGAAC ACTATTTATA TTTCCAAGAT GAGCGTATTC TTACGGAACA TTTTGAAATG 7980 

ATAAAAGAAG CATTTCTTTT CTTTGAAGAT TATTTATTTG AGGTGGATGG CTACTTGATG 8040 

ACAGGTCCAA GTGTCTCACC GGAAAATAAA TATCGCTTAA AAAATGGTAT TGAAGGAAAT 8100 

GCTTGTCTAT CATCTACAAT TGATAATCAA ATTCTAAGAT ATTTTTGTGA TTCATGCATT 8160 

GGCATTGCAA AACAATTAGG AGACAATTCG GATTTTATTA GTCGTGTGAA GGAGTTAAAA 8220 

AAGAAACTAC CTAAAACAAA AATAGGTAGT AATGGGCAAA TCCAAGAATG GTTAGAAGAT 8280 

TATGAAGAAG TAGAGCCTGG GCATAGACAC ATTTCACCTC TATTTGGGCT TTATCCTTAT 8340 

AATGAGATTG ATATTCATAA AACTCCGGAA TTAGCAGAAG CAGCTAAAAT CACTATCAAT 8400 

AGGAGATTAT CAAACGCTAA TTTTTTATCT TCACAGGAGA GGGAGCAAGC GATTAATAAT 84 60 

TGGTTAGTAA GTGGTTTGCA TGCTAGTACA CAAACAGGTT GGAGTGCTGC ATGGCTGATT 8520 

CATTTTTTTG CGAGACTATA TCAAGGTGAA CCTGCTTATA ACCAGATTAA TGGTTTGTTA 8580 

AATAATGCGA CTCTTGGCAA TTTATTTCTT GACCATCCAC CATTTCAAAT TGATGGTAAT 8640 

TTAGGTTTGG TGAGTGGAAT TTGTGAATTA TTAGTACAGA GCCATCATAA TTGGTTATCA 8700 

CTAATTCCAG CTTTACCTTC TGCTTGGTCA GAAGGAGAAG TGAAAGGTTT CAGAGTAAGA 8760 

GGAGGATATA AGGTATCGTT TGCTTGGAAA AATGGGGATA TAACATTCCT AAAATTGGAA 8820 

GGAGGAAACA AAGATCAAAA AGTAAGAGTA AGAATATATG GCAAAAATAC TGATGTACAA 8880 

AATATTGAAT TGGTATTTAA TTCAGAAAAA ATTATTGAGT TAAATTTTTA GGTATAAGTC 8940 

ATGAATAAAG AAAAAATAAA AAGAAAATTA ATCACAATAT TGTTTGTATG TATTGGGATG 9000 

TTATGTTTTG GATTGTTAGC AGGAGTTAAG GCTGATAATC GTGTTCAAAT GAGAACGACG 9060 
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ATTAATAATG 


AATCGCCATT 


GTTGCTTTCT 


CCGTTGTATG 


GCAATGATAA 


TGGTAACGGA 


9120 


TTATGGTGGG 


GGAACACATT 


GAAGGGAGCA 


TGGGAAGCTA 


TTCCTGAAGA 


TGTAAAGCCA 


9180 


TATGCAGCGA 


TTGAACTTCA 


TCCTGCAAAA 


GTCTGTAAAC 


CAACAAGTTG 


TATTCCACGA 


9240 


GATACGAAAG 


AATTGAGAGA 


ATGGTATGTC 


AAGATGTTGG 


AGGAAGCTCA 


AAGTCTAAAC 


9300 


ATTCCAGTTT 


TCTTGGTTAT 


TATGTCGGCT 


GGAGAGCGTA ATACAGTTCC 


TCCAGAGTGG 


9360 


TTAGATGAAC 


AATTCCAAAA 


GTATAGTGTG 


TTAAAAGGTG 


TTTTAAATAT 


TGAGAATTAT 


9420 


TGGATTTACA 


ATAACCAGTT 


AGCTCCGCAT 


AGTGCTAAAT 


ATTTGGAAGT 


TTGTGCCAAA 


9480 


TATGGAGCGC 


ATTTTATCTG 


GCATGATCAT 


GAAAAATGGT 


TCTGGGAAAC 


TATTATGAAT 


9540 


GATCCGACAT 


TCTTTGAAGC 


GAGTCAAAAA 


TATCATAAAA 


ATTTGGTGTT 


GGCAACTAAA 


9600 


AATACGCCAA 


TAAGAGATGA 


TGCGGGTACA 


GATTCTATCG 


TTAGTGGATT 


TTGGTTGAGT 


9660 


GGCTTATGTG 


ATAACTGGGG 


CTCATCAACA 


GATACATGGA 


AATGGTGGGA 


AAAACATTAT 


9720 


ACAAACACAT 


TTGAAACTGG 


AAGAGCTAGG 


GATATGAGAT 


CCTATGCATC 


GG AACC AG A A 


9780 


TCAATGATTG 


CTATGGAAAT 


GATGAATGTA 


TATACTGGGG 


GAGGCACAGT 


TTATAATTTC 


9840 


GAATGTGCCG 


CGTATACATT 


TATGACAAAT 


GATGTACCAA 


CTCCAGCATT 


TACTAAAGGT 


9900 


ATTATTCCTT 


TCTTTAGACA 


TGCTATACAA 


AATCCAGCTC 


CAAGTAAGGA 


AGAAGTTGTA 


9960 


AATAGAACAA 


AAGCTGTATT 


TTGGAATGGA 


GAAGGTAGGA 


TTAGTTCATT 


AAACGGATTT 


10020 


TATCAAGGAC 


TTTATTCGAA 


TGATGAAACA 


ATGCCTTTAT 


ATAATAATGG 


GAGATATCAT 


10080 


ATTCTTCCTG 


TAATACATGA 


GAAAATTGAT 


AAGGAAAAGA 


TTTCATCTAT 


ATTCCCTAAT 


10140 


GCAAAAATTT 


TGACTAAAAA 


TAGTGAGGAA 


TTGTCTAGTA 


AAGTCAACTA 


TTTAAACTCG 


10200 


CTTTATCCAA 


AACTTTATGA 


AGGAGATGGG 


TATGCTCAGC 


GTGTAGGTAA 


TTCCTGGTAT 


10260 


ATTTATAATA 


GTAATGCTAA 


TATCAATAAA 


AATCAGCAAG 


TAATGTTGCC 


TATGTATACT 


10320 


AATAATACAA 


AGTCGTTATC 


GTTAGATTTG 


ACGCCACATA 


CTTACGCTGT 


TGTTAAAGAA 


10380 


AATCCAAATA 


ATTTACATAT 


TTTATTGAAT 


AATTACAGGA 


CAGATAAGAC 


AGCTATGTGG 


10440 


GCATTATCAG 


GAAATTTTGA 


TGCATCAAAA 


AGTTGGAAGA 


AAGAAGAATT 


AGAGTTAGCG 


10500 


AACTGGATAA 


GCAAAAATTA 


TTCCATCAAT 


CCTGTAGATA 


ATGACTTTAG 


GACAACAACA 


10560 


CTTACATTAA AAGGGCATAC 


TGGTCATAAA CCTCAGATAA ATATAAGTGG 


CGATAAAAAT . 


10620 


CATTATACTT 


ATACAGAAAA 


TTGGGATGAG 


AATACCCATG 


TTTATACCAT 


TACGGTTAAT 


10680 


CATAATGGAA 


TGGTAGAGAT 


GTCTATAAAT 


ACTGAGGGGA 


CAGGTCCAGT 


CTCTTTCCCA 


10740 


ACACCAGATA AATTTAATGA 


TGGTAATTTG 


AATATAGCAT 


ATGCAAAACC 


AACAACACAA 


10800 


AGTTCTGTAG 


ATTACAATGG 


AGACCCTAAT 


AGAGCTGTGG 


ATGGTAACAG 


AAATGGTAAT 


10860 



/ 
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TTTAACTCTG GTTCGGTAAC ACACACTAGG GCAGATAATC CCTCTTGGTG GGAAGTCGAT 10920 

TTGAAAAAAA TGGATAAAGT TGGGCTTGTT AAAATTTATA ATCGCACAGA TGCTGAGACT 10990 

CAACGTCTAT CTAATTTTGA TGTGATTCTA TATGACAATA ATAGAAACGA AGTTGCTAAG 11040 

AAACATGTTA ATAATTTGTC GGGTGAATCT GTTAGTCTAG ATTTCAAAGA AAAAGGAGCA 11100 

AGGTATATTA AAGTTAAATT ACTAACGAGT GGAGTGCCTT TGAGTTTAGC AGAAGTAGAG 11160 

GTTTTTAGAG AATCAGATGG TAAGCAATCT GAAGAGGATA TAGATAAAAT AACAGAAGAT 11220 

AAAGTAGTCT CTACAAATAA GGTAGCTACT CAAAGTTCAA CCAATTATGA GGGTGTAGCT 11280 

GCTTTAGCAG TTGATGGTAA TAAAGATGGA GATTACGGAC ATCATTCGGT GACTCATACT 11340 

AAGGCAGATT CTAACGCTTG GTGGCAGGTC GATCTGGGAG AAGAGTTTAC GGTTTCTAAA 11400 

GTTGATATTT ATAATAGAAC AGATGCCGAA CCTCAGCGTT TATCTAATTT TGATGTTATT 11460 

TTTCTATCTT CATCAGGAGA AGAAGTTTTT AGAAGACATT TTGATAAAGT AGTTGATGGT 11520 

TTGTTATCTT TAAAAGTACC TTCTGTAGGG GCTAAGCTAG TCAAAATAGA ATTAAAATCA 11580 

GCAGCTATTC CGTTAAGTTT AGCGGAAGTT GAAGTCTATG GTTCAAAGAG AACTCCGAAG 11640 

AAACTTTCTA ATATTGCATT AACAAAAGAA ACTCGACAGA GTTCAACGGA TTACAATGGT 11700 

TTTTCTCGTC TAGCAGTTGA TGGAAATAAA AACGGAGATT ATGGTCATCA TTCAGTGACT 11760 

CATACCAAAG AAGATTCTCC TTCATGGTGG GAGATAGATT TAGCACAAAC CGAAGAATTA 11820 

GAAAAGTTAA TTATTTATAA TAGAACAGAT GCTGAAATTC AGAGATTATC AAATTTTGAT 11880 

ATTATTATAT ATGATTCAAA TGATTATGAA GTTTTTACAC AACATATTGA CAGTTTAGAA 11940 

AGCAATAATC TATCCATAGA CTTAAAAGGA CTGAAGGGAA AAAAGGTTAG AATTTCTTTG 12000 

AGAAGCGCAG GAATTCCTTT AAGTTTAGCA GAGGTAGAGG TTTATACTTA TAAGTAATTT 12060 

TAAAAATTAT CACCCAGGCT ACCGTAAATA TAATGGAGAT GGTAGTATGA AAGAAACAGA 12120 

AAAATAAGAG GAAAATAGTA TGATTCAACA TCCACGTATT GGGATTCGTC CGACTATTGA 12180 

TGGTCGTCGT CAAGGTGTAC GCGAATCACT TGAAGTGCAA ACAATGAACA TGGCTAAAAG 12240 

TGTGGCAGAT TTGATTTCAA GCACATTGAA ATATCCAGAT GGGGAACCTG TGGAATGCGT 12300 

GATTTCTCCA TCTACTATTG GCCGTGTACC AGAGGCTGCA GCTTCCCATG AGTTGTTTAA 12360 

AAAATCAAAT GTTTGCGCAA CAATTACAGT TACACCATGC TGGTGTTATG GTAGTGAAAC - 12420 

TATGGATATG TCTCCAGATA TTCCTCATGC TATTTGGGGA TTTAATGGGA CAGAACGCCC 12480 

AGGAGCTGTC TATCTTGCAG CTGTACTAGC TTCACATGCT CAAAAAGGGA TTCCAGCCTT 12540 

TGGGATTTAT GGAAGAGATG TTCAGGAAGC TAGTGACACA GATATTCCAG AAGATGTCAA 12600 
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AGAAAAACTT TTACGCTATG CGCGTGCAGC TCTTGCAACT GGCTTGATGA GAGACACTGC 12 660 

TTACCTATCA ATGGGTAGTG TTTCGATGGG GATTGGTGGT TCTATTGTAA ATCCGGATTT 12720 

CTTCCAAGAA TACTTAGGAA TGCGAAATGA ATCGGTAGAT ATGACGGAGT TCACGCGCCG 12780 

TATGGACCGT GGTATTTACG ACCCTGAAGA GTTCGAACGT GCGCTCAAAT GGGTGAAAGA 12840 

AAACGTAAAA GAAGGATTCG ACCATAACCG TGAAGACCTT GTTTTAAGCC GTGAAGAAAA 12900 

AGATAGACAA TGGGAATTTG TTATTAAGAT GTTCATGATT GGACGTGACT TAATGGTTGG 12960 

TAACCCAAGA CTTGCTGAAC TTGGTTTTGA GGAAGAAGCG GTTGGTCACC ATGCTTTAGT 13020 

AGCTGGTTTC CAAGGTCAAC GTCAGTGGAC AGACCATTTT CCAAATGGGG ACTTTATGGA 13080 

AACTTTCCTC AATACTCAGT TTGACTGGAA TGGTATTCGA AAACCATTTG TATTTGCGAC 13140 

AGAGAATGAT TCACTAAATG GTGTGTCTAT GCTCTTTAAT TATCTATTAA CAAATACTCC 13200 

ACAAATCTTT GCTGATGTGC GTACTTATTG GAGCCCAGAG GCTGTTAAAC GTGTAACGGG 13260 

ACATACTTTA GAGGGTCGTG CTGCAGCTGG CTTCTTACAT CTAATCAACT CTGGTTCTTG 13320 

TACATTGGAT GGTACAGGTC AAGCTACTCG AGATGGCAAA CCTATTATGA AACCATTCTG 13380 

GGAGTTGGAA GAAAGTGAAG TGCAGGCTAT GCTTGAAAAT ACAGACTTCC CACCAGCAAA 13440 

CCGCGAATAC TTCCGTGGAG GAGGATTCTC AACTCGTTTC TTGACGAAGG GGGATATGCC 13500 

AGTAACAATG GTACGTCTCA ATCTTCTAAA AGGGGTTGGT CCAGTGCTAC AAATTGCAGA 13560 

AGGTTACACA CTTGAACTTC CTGAAGATGT TCACCATACT TTAGATAATC GTACAGATCC 13620 

AGGATGGCCA ACTACTTGGT TTGCTCCACG TTTGACAGGA AAAGGTGCTT TCAAGTCTGT 13 680 

CTATGACGTC ATGAATAATT GGGGAGCTAA TCACGGAGCC ATAACATATG GACACATTGG 13740 

AGCAGACTTG ATTACCTTGG CTTCTATGTT GAGAATTCCT GTCAATATGC ATAATGTACC 13800 

TGAGGAAGAT ATCTTTAGAC CTAAAAATTG GTCCTTATTT GGAACAGAAG ATCTAGAATC 13860 

AGCAGACTAT CGTGCATGTC AGTTGTTGGG GCCACTACAT AAATAAAACT TGTTTATATA 13920 

GGAGGTGAAC TTACGTCCCT CCTATCCTTT TAAAAAGATT TGTTAAACAA TTCACAAATA 13980 

ATTGAAAACG AATACAAAAA GTAATATAAT GATGTTAAAT AGATAGCGCG GAGGCGCAGG 14040 

AGGAAAATTA TATGGCTATA TTTTATGTTC CGGCAGTCAA CCTTATTGGA AAAGGTGTTG 14100 

TAAATGAAGT GGGTCCTTAT ATCAAGGAAC TTGGCTATAA AAAGGCACTT TTGGTGACAG 14160 

ATAAGTACAT CGAAGGCAGT GATATTTTAC CTAAGACTTT AAAACCACTG GATACAGAAG 14220 

GAATCGAATA T 14231 
(2) INFORMATION FOR SEQ ID NO: 82: 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 16995 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 82: 



AGTTCTCTTA 


ACTTTTTTAG 


GATGGCATTC 


TCCGCTCTCA 


GGTACTCATT 


TTCTGCTgAA 


60 


GACGTTCTAA 


TTCTGTCCTC 


TCTTCAGGTC 


TCGTTTTTGG 


CTTACGTCCC 


ATTTTAGGTA 


120 


CTCTCCCTCT 


TGTTTTCTCA 


ACAATAGTAT 


ACCCGTTTTT 


CCTGTATTGT 


GCTAGCCAGT 


180 


TAAGAAGTAT 


CGTACGACTT 


GGGAGACCGT 


ATTCAAGAGA 


AACTCTATCT 


TTAGTCCAGC 


240 


CTTCATGTCA 


GACTTTATTA 


CTCATTTCTT 


GTTTTAAATC 


AGGAGAATAG 


TAACGATTTT 


300 


TTCCTTTTTT 


GACGAACTCT 


ATTCCGTAAC 


GATCAATCAA 


TTTAATCATG 


TACCTAATAT 


360 


TAGAATTGCT 


TATCCCAAAT 


TTATTTGAAA 


GCTTCTCTAA 


GCTATATCCT 


TGTTTTCTAA 


420 


GTTCATAGAT 


CTGAACTTTA 


TCATCATAAG 


TTAGTTTCAT 


AATAAAAACA 


CCCCAAAAGT 


480 


TAGATTTTTT 


CTGTCTAACT 


TTTGGGGTGT 


AGTTCATGTA 


CACCTGATAT 


GATGCGTTTT 


540 


ATAATTTTTA 


AGCCTTTTTG 


CCCAGCCTCG 


TCAAAAGTAA 


TGTTTTGACA 


CAAAATCTGT 


600 


GACAAAACTT 


TAGTTTTAAA 


GGTTTTTAAC 


TTTGTATATA 


CTAGTTTTAA 


GAAAAGGAGG 


660 


ATGATCTAAT 


GGAAGAAAAA 


GTATCATTGA AAGTCAGGGT 


TCAAAAACTA 


GGGACATCGC 


720 


TTTCAAATAT 


GGTTATGCCC 


AATATTGGAG 


CATTTATTGC 


TTGGGGAGTA 


TTGACTGCCC 


780 


TCTTTATCGC 


TGATGGCTAT 


CTGCCAAATG 


AACAGTTAGC 


TACTGTTGTT 


GGTCCTATGT 


840 


TAACGTATTT 


ATTGCCAATC 


CTGATTGGTT 


ACACAGGTGG 


ATATATGATC 


CATGGCCAAC 


900 


GTGGTGCCGT 


TGTAGGAGCT 


ATTGCTACTG 


TTGGTGCAAT 


CACAGGTTCT 


AGTGTTCCTA 


960 


TGTTTATCGG 


AGCTATGGTA 


ATGGGCCCAC 


TGGGAGGATG 


GACTATCAAG 


AAATTTGATG 


1020 


AGAAGTTCCA 


GGAAAAAATT 


CGTCCCGGAT 


TTGAAATGTT 


AGTTAATAAC 


TTCTCAGCTG 


1080 


GTCTCGTTGG 


TTTTGCATTA 


TTGCTTTTGG 


CTTTCTACGC 


AATCGGTCCA 


GTCGTATCGA 


1140 


CTCTTACTGG 


AGCTGTTGGG 


AATGGTGTTG 


AGGCTATTGT 


CAATGCTCGC 


CTCCTTCCTA 


1200 


TGGCTAATAT 


TATCATCGAA 


CCGGCTAAAG 


TCCTTTTCCT 


CAATAATGCC 


CTCAATCATG 


1260 


GCATTTTTAC 


TCCTCTGGGA 


GTAGAACAGG 


TAGCTCAAGC 


TGGTAAGTCA 


ATTCTCTTCC 


1320 


TATTGGAAGC 


TAATCCTGGA 


CCAGGTCTGG 


GAATTCTATT 


AGCTTATGCT 


GTATTCGGTA 


1380 


AAGGTTCTGC 


TAAATCTTCT 


TCTTGGGGGG 


CAATGGTTAT 


TCATTTCTTC 


GGAGGGATTC 


1440 


ATGAAATTTA 


CTTTCCTTAT 


GTTATGATGA 


AGCCTACTCT 


ATTTTTAGCT 


GCTATGGCAG 


1500 



WO 98/18931 



PCT/US97/19588 



652 

GAGGTATCTC TGGAACTTTT ACTTTTCAAC TCTTAGACGC TGGTCTTAAA TCTCCAGCTT 1560 

CACCAGGTTC TATTATTGCG ATTATAGCTA CGGCGCCAAA AGGTGTTTGG CCCCATCTAA 1620 

ATGTTCTTTT AGGTGTTTTA GTGGCAGCAG TTGTTTCTTT CCTTGTAGCA GCCCTTATTC 1680 

TTCATGCAGA CAAGTCAACT GAGGATTCGC TCGAAGCTGC TCAGGCGGCT ACCCAAGCAG 1740 

CTAAGGCTCA GTCTAAAGGT CAGTTAGTAT CAACTTCTGT TGATGCAGTT GTTTCGACAG 1800 

ACTCAGTGGA AAAAATCATT TTCGCCTGCG ATGCTGGTAT GGGAAGCTCT GCTATGGGAG 1860 

" CTAGTATTCT TCGAGATAAG GTTAAAAAAG CAGGTCTAGA GATTCCAGTA TCTAATCAGG 1920 

CAATCTCAAA TTTGCTTGAT ACACCAAAAA CATTAATTGT TACTCAGGAA GAACTGACAC 1980 

CAAGAGCTAA AGACAAGAGT CCAAGTGCTA TTCATGTTTC TGTTGATAAT TTCTTAGCGT 2040 

CCTCTCGTTA TGATGAAATT GTAGCTTCAT TAACAGGAGC TTCTCCAATA GCAGAAATTG 2100 

AAGGAGATAT ACCAACTTCA GCACCAGTAG ATAGTCAGGA AAGTGACCTT AACCATATTG 2160 

ATGCTGTAGT AGTTGCTTAT GGTAAAGCAC AGGGAACTGC AACTATGGGC TGTGAAACGA 2220 

TTCGGGCTAT TTTTAGAAAC AAGAATATTC GTATTCCAGT TTCTACTGCC AAAATTTCAG 2280 

AATTAGGTGA ATTTAATTCT AAAAACATAA TGATTGTAAC AACTATTTCT TTACAGGCAG 2340 

AAGTGCAGCA AGCAGCACCG AATTCTCAAT TTCTTATTGT GGATAGTTTA GTAACAACAC 2400 

CAGAATATGA CAAAATGGCT GCTAGAATGT ACAAATAGAA CTAGAGGTTT CTAAATTACG 2460 

AATGCTATTA ACCAAACGAG AAGAACAATT ATTGAAGGCT TTCCTACATG TAGGGAAGCT 2520 

TTCAATGCAA GATATGACTG AAATCTTACA GGTTTCATCT AGAACAATTT ATCGAACTTT 2580 

ATCAGATTTG ACAGATAGCA TGGAGCAATA TGGAATCGAA ATAACGAAGC ATGGGAAATA 2640 

CTATATTTTG ACTGGAGAGT TGGATGATTT GCCGACAGAA CTTGAAGTGT TAGTTGAGTA 2700 

TAGTCCCCAA GAAAGACAAG AGTTGATTAC CTATCGCCTT CTGACTGAGA GTGGTTTTGT 2760 

CACCAATGAA GCATTGCAAG AGTGCACGAA AGTCAGTAAT GTAACTATTA TTCAGGATAT 2820 

TTCAGATATT GATAAGCGTC TTTTAGACTT TGATCTGAAA ATTGAACGAC AAAAAGGTTA 2880 

TCGGATTTCT GGTGATTCAG TTGGTAAGAG AAGATTTTTG GCTATTTTAC TGACAAACTG 2940 

TATCTCAGTA GCAGATTTTT CAACCGGTAA TTTTGGGAGC TTTGATATTT TAGAAGCAGA 3000 

TAGAACTGGG CTGGCCAGTC AGATTGTTAA TAAGCAACTG TCAGGTTTTC CAGATATGGA 3060 

TGCTAGGATG AAGATGTTTT TTGCGATCTT GTTATCTCTT ATAGGTCAGG AGCAAAACAT 3120 

TGAAAATTCA CCTAATACTA GTAAGCAGGC TTTGGAAATT TCTCAAAAAA TTTTTCAAGC 3180 

TTACTCTAAG CAGACTGCAC AATTTTATAG TATTCAGGAA ATTATCTATT TTGCGAGCAT 3240 

CTTGGATGAA TTAATCATTA AACGTCAGGA CAATCCGCTC TTTACGGAGA AATTTGATGG 3300 
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TGAATTTTTC 
CTTTTTTAAG 
CGTCCCTATC 
GAATAAATTT 
TCATACAGAG 
.TCCAGAGATT 
TTTATTAGTC 
TCTAGTAGAT 
GCTGACTAAT 
ATTACAGGAA 
CGCTCCAGAG 
TCAATTCGAG 
CATCCAATAT 
CTTCCAGAAT 
TAGCAAAGTA 
GTCAATGAAA 
AGCTAGCGAG 
TCTTTATACA 
TATTTTTAAC 
TTAAGCTTAA 
TTTATGAGGG 
AGCTATCTGT 
ATGATGTCCT 
ATGTATCTAA 
ACTTAGAAAT 
TAGCAGATGC 
ATTTAGTCAT 
AAACATTCTG 
TTTAAAAATG 



TACAATATTT 
GACAAGGTTT 
CTTTTTCAGG 
CTTTATACAG 
TATGAGTATG 
TATCCAGTCC 
AGTAAAATTA 
TACCACAGTA 
CAGGAAATCG 
CGACTTCAGT 
AAAAGTTATG 
TTGGTTCAAT 
CAGAAGAATG 
AGTCCTATGG 
ACAACAAATA 
CGAGAGGAAG 
GAAGCTAGAG 
GAGATTTACA 
GAAAAAATTA 
TAAACAATTT 
TGGATATGTT 
TTACATGGGT 
CAAGTCTGGT 
CCCTCAAGTG 
TATTCAGAAA 
TCAGTCAAAA 
TTGTCATCTA 
TTCATTTTGG 
GTTTCCATAT 



CAAATCTGAT 
TATTCAATTT 
GTGAAAATTT 
TCATCAGTCT 
GCATGATTGC 
GTGTTTTGCT 
AGAGTGTTGC 
TTGATCTCAG 
ATGTAATTTC 
ATGTACAGGC 
ACTTGCAAGA 
TGGAGAATAA 
TGAGTGACAG 
CTATTCCTAA 
GTTTTACTAT 
AAGAGGTCAA 
ATTTAATGAC 
AGACGGGAAA 
AGAAATTGGA 
TCTAACAAGG 
AATGAAGACT 
AACTTTATCG 
ATTACAGTCG 
GCAACGGTTC 
ATTTCTATCT 
GAGGAAGTAT 
GTATATATGT 
TGCCGGTAAT 
TGATTTTGTG 
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TGATACGGTT 
TCTTTTCCAT 
GCCAGAATCT 
TTTAGTGAAT 
CCTACATTTT 
TTTAACGGAT 
TCCTTTTGTA 
TCAGTATGAT 
TAGTTTTCCA 
ACATCGTACA 
TTATTTAATA 
TCAATCATTT 
AGCTTACCTA 
TACTGGTCTG 
GTTTGAACTC 
AAGGTGTCTG 
AGCTATTAGT 
TCAATCCATT 
GAACTAATAT 
AGGAAGCTAT 
ATATTGAAGC 
CCATACCGCA 
TTCAAGTCCC 
TTTTTGGTAT 
TCTGTGCAGA 
TGCGCTTATT 
CCCTCAAATA 
ATCGGTCGTG 
GATGTCAATA 



TCCATGTATA 
CATATTCGGC 
ATCCAGATTT 
GATATTTTTC 
ATCTCTAGCT 
GAACGTCGGG 
GAGTTGATAG 
TATATTTTAT 
ACCGTCAAAG 
ATTGTCGCGC 
TCTAGTAGTC 
GAGCACACGG 
ACAAGAAAAT 
GTGCTTTTAC 
AAACTACCTA 
CTAATGCTAA 
CAGTCGATTA 
ATTTATCAGA 
GAAACTTGAA 
TTGTTATTGT 
CATGATTGAG 
TGGAACAGAT 
TAGAGGGGTT 
TGCTGGTATT 
TGTAGATAAT 
TGATGCTGTT 
GGAAAAGGAG 
GTTTTATAGG 
ATCAGATAAT 



CCAAGATTGA 
TCAGTTTAGG 
TAGTTGAAAG 
CGAAATATCT 
TAGGCCGTAG 
TCACTAGAGA 
ATATTCAATC 
CTACCAAGCC 
AATTGCTTGA 
GTGATGCTAT 
AGCTTTTGAG 
TAGAACAAAT 
TGTTATCTCA 
ATAGTCAGTC 
TCTCCGCATT 
TGTCTAAAGA 
TTGAAAATCA 
TGCTAAATAC 
AAACATTTGA 
GGGCAAGTTC 
CGAGATAAAG 
GCAGCAAAAA 
GATTTTGGGA 
GGTAATGAAC 
GTTCTTAAAC 
GAATAATTGA 
AAATTGAATG 
TGAAATTCTA 
TCATGCTCTG 



3360 

3420 

3480 

3540 

3600 

3660 

3720 

3780 

3840 

3900 

3960 

4020 

4080 

4140 

4200 

4260 

4320 

4380 

4440 

4500 

4560 

4620 

4680 

4740 

4800 

4860 

4920 

4980 

5040 
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AATGAAAAGG GCAAGTATGA AATTGAAATT GCACAGAAAG GACAGTCTCG TATAGAAGTA 5100 

ACTAATGTGG CTGGCATTAA TAGCAAAGAA CATCCTGAGC AAGTCATTGA AGCGATTCAA 5160 

AAGACGGATA TT ATT ACT AC TGCAATCGGA CCTAATATAC TCCCTTTTAT CGCCGAACTT 5220 

CTAGCCAAAG GAATCGAAGC TCGCCGAGTT GCAGGAAATA CACAGGCATT GGATGTTATG 5280 

GCCTGTGAAA ATATGATTGG CGGGTCTCAA TTTCTTTATC AAGAAGTCAA GAAATATTTA 5340 

AGTCCGGAAG GTTTGACATT TGCTGATAAC TACATAGGTT TTCCAAATGC TGCAGTAGAC 5400 

AGGATTGTTC CAGCACAAAG TCACGAAGAT TCCCTTTTTG TTGTGGTCGA GCCCTTTAAT 5460 

GAATGGGTCG TGGAAACCAA GCGTCTTAAA AATCCAGATT TACGTCTAAA AGATGTGCAT 5520 

TATGAAGAAG ATTTAGAACC CTTTATTGAG CGAAAACTTT TTTCAGTCAA TTCTGGACAT 5580 

GCAACTTCAG CTTACATTGG TGCGCATTAT GGTGCCAAGA CAATTTTGGA AGCTCTTCAA 5640 

AATCCTAATA TTAAATCTCG GATTGAATCT GTATTAGCTG AAATTCGGAG TCTCTTGATT 5700 

GCCAAATGGA ACTTTGATAA AAAAGAATTG GAGAATTATC ACAAAGTCAT TATAGAACGA 5760 

CTTGAAAACC CTTTCATAGT GGACGAGGTT AGTCGCGTAG CTCGTACTCC AATCCGAAAA 5820 

TTAGGCTATA ATGAACGATT CATCCGGCCG ATACGTGAAT TGAAAGAACT CAGTTTGTCA 5880 

TATAAAAACC TACTTAAAAC AGTTGGCTAT GTCTTTGACT ATCGCGATGT AAATGATGAA 5940 

GAAAGTATTC GATTAGGTGA ATTGTTGGCT AAACAATCAG TCAAAGATGT TGTTATACAA 6000 

GTTACAGGTT TAGACGACCA AGAATTGATT GAGCAAATTG TAGAGTATAT TTAATCTTTT 6060 

TCGAAAATCT CTTCAAATCA GGTTAGCATC GCTTTGTCTT AGGCATATGT TGTTCTATCT 6120 

ACAACCTCAA AGCAGTGCTT TGAGCTGACT CCGTCAGTCT TATCTGCAAT CTCAAAACAC 6180 

TGTTTGAGTT ATCTGCGGTA ATCTTTCTAG CTTGTCTTTG ATTTTTGTTG TTATTTATAA 6240 

GGTAAAAGAA GCTGGACAAA AAGTCTTCAA AATCGGGAAA AGGCAGCCTA TCGGGTGTTC 6300 

AAAAATCTTG ATAGGATGTC CTTTATTATG GAAAGCCTTA TTGGATTTTC TCCTCAGATT 6360 

GAGTTTTTGA TCAGCTTTAT GAGATAGGTC TTGCTAGAGA TGTAGCCCAT CATGTTATTT 6420 

TTATGGACAG TGGGAAAATT GTTGAAAAAA ATAATGCCCA TCAATTCTTT AGTCGTCCAA 6480 

GAGAAGAACG AACCAAGCAA TTTTGGAACG AATTCTTTCG AATGCGATCT ATATAGTAAA 6540 

ATGAAACAAG AACAGGACAA ATCGATCAGG ACAGTCAAAT CGATTTCTAA AAATGTTTTA 6600 

GAAGTAGAGG TGTACTATTC TAGTTTCAAT CTACTATATA ACTGAAAAAT TAGATAAATT 6660 

AGTTTTGGAA AATGACTAAC CAAAAGATAT CCAAAGTAGT CTAAAATTGT CTATACTTTA 6720 

TGAGTGTTTT AGTTAGGAAA AAGGCTTGTT GTCTATAATT GTCTGCATTA GTCTAGATTT 6780 

TATTTATAGA AAATGTTATA ATAGACTGTA TTTAAAAAAT TTTAAGGAGA AATGACAGAA 6840 
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TGTCTGTATC ATTTGAAAAC AAAGAAACAA ACCGTGGTGT CTTGAC™: ACTATCTCTC 
AAGACCAAAT CAAACCAGAA ^ CCGTG TCTTCAAGTC AGTGAAGAAA TCTCTTAATG 
TCCAGGTTT CCGTAAAGGT CACCTTCCAC GCCCTATCTT CGACCAAAAA TTTGGTGAAG 
AAOCPC^TA TCAAGATGCA ATGAACGCAC TTTTGCCAAA CGCTTATGAA GCAGCTGTAA 
AAGAAGCTGG TCTTGAAGTG G^CCCAAC CAAAAATTGA CGTAACTTCA ATGGAAAAAG 
• GTCAAGACTG GGTTATCACT GCTGAAGTCG TTACAAAACC TGAAGTAAAA TTGGGTGACT 
ACAAAAACCT TGAAGTATCA GTTGATGTAG AAAAAGAAGT AACTGACGCT GATGTCGAAG 
AGCGTATCGA ACGCGAACGC AACAACCTGG CTGAATTGGT TATCAAGGAA GC^CTGCTC 
AAAACGGCGA CACTGTTGTG ATCGAC™* T^G^AT CGACGGTGTT GAA^ACG 
^GGAAAAGG TGAAAACTTC TCACTTGGAC ^CAGG TCAA^ATC «!«*«« 
AAGACCAATT ggtagg^ac tcagctggcg AAACCG^A ^AO^TA aca™:ccag 
aagactacca agcagaagac OT GCAGG T A aagaagotaa a^aca acta TC cacg 

AAGTAAAAGC TAAAGAAGTT CCGGCO^ ACGATGAACT TGCAAAAGAC ATTGATGAAG 
AAGTTGAAAC AC^CTGAC TTGAAAGAAA AATACAGCAA AGAA^GC. GC TC CTAAAG 
AAGAAGCTTA CAAAGATGCA GTTGAAGGTG CAGCAATTGA TACAGCTGTA GAAAATGCTG 
AAATCGTAGA ACTTCCAGAA GAAATGATCC ATGAAGAAGT TCACCG^A GTAAATGAAT 
TCCTTGGGAA TTTGCAACGT CAAGGGATCA ACCCTGACAT GTACTTCCAA ATCACTGGAA 
CTACTCAAGA AGACCTTCAC AACCAATACC AAGCAGAAGC TGAGTCACGT ACTAAGACTA 
ACCTTGTTAT CGAAGCAGTT GCCAAAGCTG AAGGATTTGA TGCTTCAGAA GAAGAAATCC 
AAAAAGAAGT TGAGCAATTG GCAGCAGACT ACAACATGGA AGTTGCACAA GTTCAAAACT 
TGCTTTCAGC TGACATGTTG AAACATGATA TCACTATCAA AAAAGCTGTT GAATTGATCA 
CAAGCACAGC AACAGTAAAA TAATCTTAAT AAACAGAAAA CCCACC^AA TTGGTGGGTT 
TTCTGATGCA CTATTTTCCA AAAATCTCTT TGAGGTCTGT GTCTGTAATC CCAATCATGG 
CTGGGATGCG GTCCCAGTTT TCTTCGGTTA GGATGTAGGA TTGTTCAGAG GCACTTGATG 
-AC^ AGAGACAGCT TGTTGCTTTT CTTCAACATT CTCCAGTAGA TCACTGAAGC 
6TTCAATCAG ATAGGTTTTT CGGGCAGTTC CGATGTGTTG GGTAGCATAG TCGAAGGCTT 
GTAATTCGCC TAGTAAGATG AGTTTGCTTT TGGCACGTGT AATGGCTGTG TAGATGAGAT 
^CCTCCAG CATACGTCGG CTAGCACTAG TAATCGGTAG GATGACAACT GGGAACTCAC 
TTCCCTGAGA CTTATGAATA CTCA^GCA, AGGCCAAGCG AATCTTGTAC cmwmc 
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GGGGGTAAGA GACTTCATTA CCATCAAAAT CAATGACAAT CTCGTCTTGT TTCGATTCGG 8640 

TGTATTTACC AGGAATCAGG TCTGTGATAG CTCCTAAATC CCCATTAAAG ACATTGATTT 8700 

CAGCATCGTT AACCAAATGA ATGACCCTGT CTCTCTTACG ATAGTGACAC TGAGGAGCTT 8760 

CAAAACTGAG TTGATCTTTT TGTGGGGGAT TGAGCAGGTC TTGCATGAGC TGATTGATAG 8820 

CATCAATCCC TGCCGTCCCT CGGTACATAG GAGCCAGAAC TTGGATATCA CGGGCGGGAA 8880 

TACCATTTCT GAGGGCGGCA CCTAAGATTT TTTCAATGGT GGCAGGAATA TGGCCACTAG 8940 

CAATTTCAAA GTAGGAACGG TCAGCTTTTT TTTGGGTGAA ATCAGCTGGC AAGATGCCCT 9000 

GTCGAATCTG ACTAGCTAGG GTGACGATGG TTGATTCTTT GCTTTGTCGA TAAATTTTTT 9060 

CCAAGCGAGT CTGAGGAATC AAAGGAATAT GAAGTAGATC CGCTAGAACC TGTCCAGGAC 9120 

TGACAGAAGG TAGCTGATCA CTGTCACCTA CGATGAGGAT CTTACTGTTA GAAGAGATAT 9180 

TGGAGAAGAG TTGATTGGCC AGCCAAGTAT CTACCATAGA GAATTCATCC ACGATGATAA 9240 

AGTCAGCATC TAGGTAATCT TCCAGATGAC TGGTATCATC GTCACCTGTC ATTCCCAAGT 9300 

GGCGATGTAT GGTCGCGCTA GGCAAACCTG TCAATTCATT CATGCGACGA GCAGCTCGAC 9360 

CAGTTGGAGC AGCAAGAAGA ATGGGCAGAT TGCTTTTCTT CCTGAAGTCA AGTCCTTCTA 9420 

AAAGGGCATA AACAGCAATG ATTCCATTGA TAACAGTTGT CTTACCAGTA CCAGGCCCAC 9480 

CTGTCAGGAT AAAGACCTTA TTCTGGATAG CATCACAGAT AGCCTGTTTT TGAATGTTAT 9540 

CATACTCAAT TCCCAGTTCT TGCTCGACAG TAGTGATATG TTTTTGAATG GTTTCTAAAT 9600 

CATGACTCTT CTGTTTTCCT TTTTCAAGGA TACGAACCAA GTGACTGCGG ATGCCTTCCT 9660 

CAGCGAAAAA GAGGCTGTTG TCAAAGATCT TGGTATCAAT CTGCTGAACC TTGTCTTCTT 9720 

CGATCAGGTA GGAGAGCTCT TGGGCAACTT GGCTGGGGTC TAGTTCCACG GGACGGGAAG 9780 

ACTCAAGGAG AGTAAGGGTT TGTTCCAGCA AATCCCGTGC TTCAACATAG GTGTCCCCTG 9840 

TTTCCATACA GGCCTGAAAA AGACTGTGAA CTAGACCGGC GCGGAAGCGT TCAGGAGCCT 9900 

GACTTTCGAT GCCTAGTTCC TCAGCTAGTT GGTCAGCAAT GGTAAAGCCC AAACCCTTGA 9960 

TATCCTCAAC CAGCTGGTAG GGATAATTTT CAACCACATC AAGGGTTTCT TCCTTGTAAA 10020 

AGTCTTGAAT CTGAAAGGCT AGTTTGTTGG GAATGCCGTA GTTGGCTAGT TTGGCCAAAA 10080 

TCATCTCCGT TCCGTAGTTG AGACGGAGAG TGGAGACGAA AGCCTCGCGA TTTTTGGCAG 10140 

AGAGTCCTGC GATGCCTTCT AACTTTTCTG GGTGTTGCAA AATTTCGTCA ATGGTATTTT 10200 

CGCCATAGGT ATCCACGATT TTCTGAGCTG TCTTGAGACC AATCCCCTTG AAATGGCTAC 10260 

TTGAAAAGTA CTTGACCAAG CCCTTACTAG TTGGTTTTGC GCGATCATAA CGACTGATTT 10320 

GCAGTTGTTC TCCATACTTG GAGTGCTGGA CAATTTGCCC CCAAAAAGTA TAGTCTTCGC 10380 
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CCTCAATTAC ATCAGCCATG GTTCCTGTGA CAATGATTTC AAAATCATCA AAATCCTCTG 
CGTCCGTATC GTCGATTTCT AGGAGGAGGA TGCGATAAAA ATTGCTGGGA TTTTCAAAAA 
TAATCCGTTC AATAGTTCCT GAAAAATAAA CTTCCATAAA ATTCCTTTGC ATGAATAGGT 
GAGAGTTGGG ATTGTTTTTA TTTTATACTC TTCGAAAATA TCTTCAAACC ACGTCAGCTT 
CCATCTGCAA CCTCAAAACA GTATTTTGAG CTGACTTCGT CAGTTCTATC CACAACCTCA 
AAACACTGTT TTAAGCAGCC TACGGCTAGC TTCCTAGTTT GTTCTTTGAT TTTCATTGAG 
TATTTGTAAA TAAACAATCA CTTCTCACGA TAGAAGAAGA GGCTGAGATT GGTGATTCTC 
TGCCTCTTAG GTTTCTTAAA ATGTTCCGAT ACGGGTCATT GGCCATAAGC GGAATTTAGC 
TTCCCCTGTG ATATCTTTTG CTTTGAAGGT ACCTACGTGG CGGCTGTCGC TCGAAACCAA 
GCGGTCATCT CCGAGGAGAA GGTATTCTCC TTCTGGAACA GTAAAGCTAA AGTTGGTGTT 
GTAGTTGACA TCAACTGTGA AGGCTTGAGC TTTTTGAGCG ATACTTCTAA AGAAAGTTCC 
TTTATTTCCT TCAAAGCCCT TGCCTGAGTA AGTGCTTTGG AGTTTGTCAT CCTTGAAGCG 
TTTGATATAG TCTGCTAGAT AAGGCTCGTC CGTTTCTTTG TCATTGATGT AGAGTTTATC 
ATTTTCGTAA CGAATGGTGT CGCCAGGCAT TCCAATCACG CGCTTGACGA TGTCCTTATT 
GCCATCTTCC TCATGGGCCA CCACGATATC AAAACGGTCA ATAGGAAGGT GTTTTACAAC 
GAAGAGAATT TCGCCATCCG CTAGGGTCGG ATCCATGGAA TGTCCTTCTA CGCGAACATT 
GCTCCAAAAA AAGATACGAC TTAAAGCTAG TAATGACAGA ATTAGGAGGA ACAATCCCCA 
CTCTTTTAAG AAATTTTTAA ATGAATTCAT AACTTACCTT TCTAAGCGTT TTTTCGCTTT 
TTCAGTGTTT TTAAAGTGCA ATTTGGCGCA GAAGCTGAGT CCCTGCATAC CATAGGCTTG 
CAAAATCTGG CTAGCCACCT TGTCAGAAGC CGTTCCAGCT CCACTTGGGA GCTGATAACC 
CAGTTCTCGT CCCAAATTTT CAAGATTTTC CAGAAAGAGA TCACGCGCAA TGACAGAAGA 
AACTGCGACA GACAAGTATT TGCCCTCAGC CTTTTCTTCT AAGCTGATAG GATTGCTGAA 
ACGATTGGCC TCTTGTGCCA AGTACTTGTC ATAATTTTTA GCACTGGTAA AGGCATCAAT 
CACAATTTTC TCAGGCTGAA CACCTTTTTG AAGGAGGAGA TAGATAGCCT GATTATGGAG 
GGCAACCTTA ACCGAAACAG CGTTGTAGCG GTCTCCGATG ACCTCGTTGT ACTTGCTGGG 
TGAGAGAAGG AGTGCCTGGT GCTGAATTTT TTCCTTGAGA ATAGGAGTAA TCTGACGGAT 
CTTTTGGTCG GTCAGAGTCT TAGAATCCCC CACACCGAGT TTTCGTAAAA AGTCGTGCTG 
GTCAGGTGTG ACAAAGGCAG CCACAACTGC AAGCCCACCA AAGTAGGAAC CATTTCCCAC 
CTCATCTGTC CCAATTAAAG GAAGATTTTG TCCGCTGGTT TGCTCTACAG CTTGATAGCC 
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AAAGAAACTG GCGTATTTTT CAGCCCCTTC ACCCTGAAGC AAGATTTTTC CAGAAGTATA 
GATAGAAACC GTTGCTTGAG GTAGTTTCAA AAAGTAGCGG ATATAGGGAT TCTTGCTAGG 
AGCCAGACTG GTTTGATAGT GTTCAAGAAA AGCCTGAATA TCCTTTTCGC TTGGTGTGAG 
TGTGATACTT GCCATAGTTT CTATTGTACC ACAAAAGCAG TAAAATTTGT AAAAACTGAC 
AAAATTAGCG AATTTTGGTA TAATATCGTG AGGTGAATTT TATGGCAAAT CTAAATCGAT 
TCAAATTTAC ATTCGGGAAA AAATCGTTAA CCTTGACAAG CGAACATGAC AACCTTTTTA 
TGGAGGAAAT CGCTAAGGTT GCGACAGAAA AATACCAAGC AATTAAAGAA CAAATGCCTA 
GCGCAGATGA TGAAACAATC GCTCTTTTGT TGGCAGTCAA CTGTTTATCA ACTCAGCTCA 
GCCGTGAGAT TGAATTTGAC GATAAGGAGC AAGAGCTAGA AGAACTCCGT CACAAGCTTG 
TGACTTGTAA GCAAGAACAG AGCAAGATTG AGGATTCCTT ATGATTTCAT TCCTTCTTCT 
ATTGGTCTTG GTTTGGGGAT TTTATATCGG CTATCGGAGA GGCCTGCTCT TACAGGTTTA 
TTACCTGATT TCAGCCATGG CATCGGCTTT TATGGCTGGC CAGTTTTATA AGGGGCTTGG 
AGAGCAATTC CATTTATTGC TCCCTTATGC AAATTCGCAG GAAGGTCAGG GGACTTTCTT 
TTTCCCATCG GATCAACTCT TTCAGCTGGA TAAGGTCTTT TATGCAGGTA TCGGCTACTT 
GCTTGTATTT GGGATTGTCT ATAGCATTGG TCGTTTACTT GGTCTTCTCT TACACTTGAT 
TCCTAGCAAA AAACTGGGTG GTAAGTTGTT CCAAGTTTCA GCAGGTATCT TGTCCATGTT 
GGTGACCTTA TTTGTCTTGC AAATGGCCTT GACAATCTTG GCGACCATCC CCATGGCAGT 
TATACAAAAT CCTCTTGAAA AGAGTATCGT CGCAAAACAC ATCATCCAGA GCATACCGGT 
AACAACCAGT TGGCTCAAAC AAATCTGGGT GACAAATTTA ATCGGATAAA AAGGGCAGGA 
GTTTTCCTAG CCCTTTGTTT ACAGATTTGA CTCGAATCTA TCAGAATGTA AAAAGCTACC 
AC AC CT AG AC ATTCAAAGAC AAGGAAATAA AGATGAATAA GAAAATATTA GAAACATTAG 
AGTTCGATAA GGTCAAGGCC TTGTTTGAGC CTCATTTGTT GACCGAGCAG GGCTTGGAGC 
AATTGAGACA ACTGGCTCCG ACTGCCAAAG CAGATAAAAT CAAACAGGCT TTTGCTGAGA 
TGAAGGAAAT GCAGGCTCTT TTCGTCGAGC AACCGCATTT TACTATTCTC TCAACTAAGG 
AAATTGCAGG AGTCTGCAAG AGGTTGGAGA TGGGAGCGGA TCTCAATATC GAGGAGTTCC 
TACTCTTGAA ACGCGTGCTT CTTGCCAGCC GAGAACTTCA AAATTTTTAC ACCAATCTGG 
AAAATGTCAG CTTGGAAGAA TTAGCCCTTT GGTTTGAGAA ATTACATGAT TTTCCGCAAT 
TACAAGGAAA TCTTCAGGCC TTTAATGATG CGGGTTTCAT TGAAAATTTT GCCAGTGAAG 
AATTGGCGCG AATCCGTCGA AAAATACATG ATAGCGAGAG TCAGGTACGC GATGTTTTAC 
AAGACTTGCT CAAGCAAAAA GCGCAGCTGT TGACGGAAGG AATTGTTGCT AGCAGAAATG 
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GCCGTCAGGT 
ATATTTCTGC 
AAGAAATTGC 
TTTCTGAGCG 
ATCTGGACTT 
AGCTGTCAGA 
CCGTCGCAAA 
ATACAGGTGG 
CAGGATTGCC 
CTGATATTGG 
CCAATATCGT 
TGGGGGCTGG 
TTCGCCTGCG 
GTATTGAGAC 
CGACCTATCG 
GTCTAGGCCT 
ACGTCAATCG 
ACAATATCCG 
ACAACGAGCT 
AGATTGTGGA 
CCCAACTCAA 
AAAAAGTGGA 
AGGTGGGAGA 
TCAAGGACGG 
AGTTTGATCT 
TGAAACGAAC 
AAGAAGCCAT 
AAGTTGATAT 
TGCAAAGAAA 



TTTACCAGTC 
TAGTGGAAAC 
TAGTCTGCGA 
TGTCCGCCCT 
GATTCGTGCC 
AAATCAAGAG 
TGATGTCTAT 
GAAGACCATC 
GATTTTAGCA 
AGATGAGCAG 
GGATATTCTT 
TACTGATCCC 
TCAAATCAAG 
AGCCTTTGTG 
CTTTATGCAG 
ATCTGAAGTT 
TATCATTGAG 
TGAGGTGGAG 
TAATCGTGAA 
TATGGCCCTA 
GCCCCACGAA 
CTTGTCTAAA 
TGATATCGTG 
TCGCTGGGAA 
TGTTCAAGCC 
TTCTGGGCGA 
GAATGAGCTA 
CATCCATGGT 
CAAACATGTC 



AAAAACACCT 
ACCGTCTATA 
GCAGATGAGC 
CATGCGGCTG 
AAGGTTCGAT 
ATTCAACTGC 
TTTGGTCAAG 
ATGCTCAAAA 
GACAAGGGAA 
TCTATTGAGC 
GGCAAGGTCA 
CAAGAGGGAG 
ACCATGGCGA 
CAAAATGCCA 
GGTGTTCCTG 
ATCGTAGGAG 
CAATTAGAAG 
CAAGAAAATC 
AAGGAAACCG 
AGTGAAAGTG 
ATCATTGAAG 
AATAAGGTCC 
GTTCTCAGTT 
GCCCAAGTTG 
CAGCAAGAAA 
GGACCTCAAG 
GATACCTTCA 
ATCGGAACAG 
AAGAGTTTCG 
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ACCGCAATAA 
TCGAACCCCG 
GCTATGAAAT 
AGATTGCTAA 
TTATCCAAGA 
TCCATGTCTG 
ATTTAACAGC 
CTCTGGGCTT 
GTCGTGTTGG 
AGAGCTTGTC 
ACCAACATTC 
CAGCCCTTGC 
CGACCCACTA 
GTATGGAGTT 
GCCGAAGTAA 
ATGCCAGTCA 
AGCAGAOGCT 
TCAAGATGAA 
AGCTTAACAA 
ACCAGATTCT 
CCAAGGCCAA 
TTCAAAAGGC 
ATGGTCAGCG 
GCTTGATTAA 
AACCAGTCAA 
CTAGACTGGA 
TCGACCAAGC 
GAGTCATCCG 
GCTATGCCCC 



GATTGCAGGT 
TGAGGTAGTC 
GCTTCGCATT 
TGACGCTTGG 
AAGACAAGCA 
CCATCCTTTG 
TATTGTCATT 
GACACAGGTC 
TATTTTTGAA 
TACCTTCTCT 
ACTCTTACTT 
CATGGCTATT 
TCCAGAACTC 
TGATACTGCA 
TGCCTTTGAA 
GCAGATCGAT 
GGAAAGCCGC 
CCGTGCGCTA 
GGCGCGTGAA 
CAAAAATCTC 
GTTGAAAAAA 
CAAGAAAAAA 
TGGTACCTTG 
GATGACCTTG 
GAAGAAACAG 
TCTTCGAGGC 
CTTGCTTAAC 
TGAAGGAGTT 
ACAAAATGCT 



GTCGTTCATG 
AAACTGAGCG 
CTCCAAGAAA 
ATTATCGGTC 
GTCGTGCCTC 
GTCAAAAATG 
ACAGGTCCCA 
ATGGCCCAGT 
GAAATCTTTG 
AGTCATATGA 
TTGGATGAGT 
CTGGAGGACC 
AAGGCCTACG 
ACTCTTCGCC 
ATTGCCAAAC 
CAGGACAATG 
AAACGTTTGG 
AAAAAACTCT 
CAGGCTGCTG 
CACAGTAAAT 
TTGGCTCCTG 
CGAGCTCCAA 
ACCAGTCAAC 
GAAGAGAAAG 
GTCAATGTTG 
AAGCGCTATG 
AATATGGCTC 
ACCAAATACT 
GGAGGCAGTG 
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GTGCGACTAT TGTCACTTTT AAAGGATAGC AGTATTCTGG ACTTTATAAA GTAAAAACTG 15720 

TTGAACTAAT TTTTACTAAT AAACACATTG ACAAAAGCCA ACATTTTTTG TAAAATTAGA 15780 

ATCAATTAAA TACCAACACC GAATGAAGTT TAATAGAAGT GGGGAATCGT TTGATTTTCC 15840 

ATGACTGTAA ATGGACGGAA CTCTGGAGAG ACCGTAAAGG CACCGAAGGG CAAGGCAGGC 15900 

AACTGCTCAA ACTCTCAGGT AAAAGGACAG AGCTAGGATA GACCGCTTTT TAGCATTTAT 15960 

CTAAGCATTC CAGAGTACAT GTATCTTGCA TGTGCTCTTT CTTTTGGGGT TGAAACGATA 16020 

GGAGAAGGAA ATGTTAGAAT TGCTTAAATC AATCGATGCT TTTGCTTGGG GACCGCCCCT 16080 

CTTGATTTTA TTGGTCGGAA CAGGGATTTA CCTAACTATT CGGCTAGGAC TCTTGCAGGT 16140 

TTTGCGTCTA CCCAAGGCCT TTCAGCTTAT TTTTATCCAG GATAAGGGAC ATGGTGATGT 16200 

ATCCAGTTTT GCAGCTCTGT GTACAGCCTT GGCATCAACT GTTGGAACAG GAAATATCAT 16260 

AGGAGTTGCG ACGGCTATCA AGGTTGGTGG ACCAGGAGCT CTATTTTGGA TGTGGATGGC 16320 

GGCTTTCTTT GGAATGGCTA CCAAGTATGC GGAAGGACTC TTGGCCATCA AATACCGCAC 16380 

CAAGGACGAC CATGGTGCAG TAGCGGGAGG TCCCATGCAT TATATCCTTC TAGGGATGGG 16440 

AGAAAAGTGG CGACCACTTG CTGTTTTGTT TGCAGTAGCA GGAGTATTGG TTGCTCTCTT 16500 

GGGAATCGGA ACCTTCACCC AAGTCAACTC GATTGCAGAA TCTATCCAAA ATACAACGAC 16560 

GATTTCGCCA GCCATCACAG CTCTCGTCTT GTCTGTCTTT GTAGCGATTG CAGTCTTTGG 16620 

TGGACTCAAG TCTATTTCTA AGGTTTCAAC TACTGTTGTT CCTTTTATGG CCATCATTTA 16680 

TATCTTAGGA ACTCTTACAG TTATTTTCTT TAATATCGGA AAAATCCCTG GCACAATCGC 16740 

TTTAGTCTTT ACCTCAGCTT TTAGTCCCCT TGCTGCGGTA GGTGGATTTG CTGGTGCTAG 16800 

CGTTCGGATG GCTATTCAAA ATGGTGTGGC GCGTGGTGTG TTCTCAAACG AATCTGGTCT 16860 

GGGTTCTGCT CCTATTGCAG CTGCAGCTGC CAAGACAAAT GAACCAGTAG AGCAAGGTTT 16920 

GATTTCCATG ACAGGAACCT TTATTGATAC CCTCATCATT TGTACTCTAA CTGGTTTGAC 16980 

CATCTTGGTA ACTGG 16995 
(2) INFORMATION FOR SEQ ID NO: 83: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28473 base pairs 

(B) TYPE: nucleic acid 
(C> STRANDEDNESS : double 
(D) TOPOLOGY : linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 83: 

CCGGGGCTTT TGTAGTATAA TAGAGATACG TTTTGAAAGT AGGAGGTATC TATGGACTTA 60 
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ACTAAGCGCT 


TTAATAAACA 


GTTAGATAAA 


ATTCAAGTTT 


CGTTGATTCG 


TCAGTTTGAC 


120 


CAGGCTATTT 


CGGAGATTCC 


TGGGGTCTTG 


CGTTTGACCT 


TGGGGGAACC 


TGATTTTACA 


180 


ACGCCAGACC 


ATGTCAAGGA 


GGCGGGCAAG 


CGAGCGATTG 


ATCAGAACCA 


ATCCTACTAT 


240 


ACAGGGATGA 


GTGGTCTGCT 


GACTCTACGT 


CAGGCAGCCA 


GTGACTTTGT 


TAAGGAAAAG 


300 


TACCAACTGG 


ACTATGCTCC 


TGAAAATGAA 


ATCTTGGTTA 


CAATTGGGGC 


GACAGAGGCT 


360 


TTATCTGCGA 


CTTTGACGGC 


TATTTTGGAA 


GAGGGAGACA 


AGGTACTTTT 


GCCAGCTCCT 


420 


GCTTATCCAG 


GCTATGAACC 


GATTGTTAAC 


TTAGTTGGGG 


CAGAAATTGT 


TGAGATTGAT 


480 


ACGACTGAAA 


ATGGTTTTGT 


CTTGACTCCT 


GAGATGTTGG 


AGAAGGCCAT 


TTTGGAGCAG 


540 


GGTGATAAGC 


TCAAGGCGGT 


TATTCTCAAC 


TATCCAGCCA 


ATCCGACAGG 


AATTACCTAC 


600 


AGTCGAGAGC AGTTAGAGGC 


CTTGGCAGCT 


GTTTTACGCA 


AGTACGAAAT 


TTTTGTTGTC 


660 


TGTGATGAGG 


TTTACTCAGA 


ATTGACCTAC 


ACAGGCGAAG 


CCATGTGTCT 


CTAGGAACGA 


720 


TGTTGAGAGA CCAGGCTATT 


ATTATCAATG 


GTTTGTCTAA 


ATCGCATGCC 


ATGACAGGTT 


780 


GGCGTTTGGG 


GCTGATTTTC 


GCTCCTGCGA 


CCTTCACAGC 


CCAGTTAATC 


AAGAGTCACC 


840 


AGTACTTGGT CACTGCCGCA AATACCATGG CGCAACATGC 


TGCGGTAGAA 


GCCTTGACGG 


900 


CTGGTAAAAA CGATGCGGAC 


CCATGAAGAA 


GGAATATATC 


CAACGTCGGG 


ACTATATCAT 


960 


CGAAAAAATG 


ACTGCTCTTG 


GTTTTGAGAT 


TATCAAACCA 


GACGGTGCCT 


TCTATATTTT 


1020 


TGCTAAAATT 


CCAGCGGGCT 


ACAATCAAGA 


CTCCTTTGCT 


TTTCTGAAGG 


ATTTTGCTCA 


1080 


GAAGAAGGCC 


GTTGCCTTTA 


TCCCTGGTGC 


AGCCTTTGGA 


CGTTACGGGG 


AAGGCTACGT 


1140 


CCGCCTATCT 


TATGCAGCCA 


GCATGGAGAC 


TATCAAAGAA 


GCCATGAAAC 


GACTTGAGGA 


1200 


GTACATGAGA GAAGCATGAT 


TCAGTCTATC 


ACGAGTCAAG 


GCTTGGTGCT 


TTACAATCGC 


1260 


AATTTTCGTG AGGATGACAA 


GCTCGTCAAA 


ATTTTTACAG 


AGCAGGTTGG 




X J £ u 


TTTTTTGTCA AACACGCTGG 


TCAGTCTAAG 


CTGGCGCCTG 


TTATTCAGCC 


CTTGGTGCTG 


1380 


GCACGATTTC 


TCTTGCGAAT 


CAATGATGAC 


GGACTCAGTT 


ACATCGAAGA 


CTATCATGAG 


1440 


GTCATGACTT 


TTCCCAAGAT 


TAATAGTGAC 


CTCTTTGTCA 


TGGCCTATGC 


GACCTATGTG 


1500 


GCAGCTCTTG 


CAGATGCTAG 


TTTGCAGGAC 


AATCAGCAGG 


ATGCTCCCTT 


GTTTGCTTTT 


1560 


TTGCAAAAGA 


CTTTGGAGTT 


GATGGAAGCA 


GGCTTGGATT 


ATCAGGTTTT 


GACCAATATT 


1620 


TTTGAAATTC 


AAATTTTGAC 


TCGATTTGGA 


ATCAGCCTCA 


ATTTTAATGA 


GTGTGTCTTC 


1680 


TGCCATCGGG 


TTGGTCAGGC 


TTTTGACTTT 


TCTTTCAAAT 


ATGGAGCCTG 


CCTCTGTCCA 


1740 


GAGCATTATC 


ATGAGGATAA 


GAGACGTTGT 


CATCTCAATC 


CCAATATCCC 


CTATCTGCTC 


1800 
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AATCAATTTC AAGCTATTGA TTTTGAGACT TTGGAGACCA TTTCGCTCAA GCCTGGAATC 1860 

AAGCAAGAGC TACGCCAATT TATGGATCAA TTATATGAAG AGTACGTTGG GATTCACCTA 1920 

AAATCAAAGA AATTTATTGA TTCCCTAGCA GACTGGGGAC AATTACTAAA AGAGGAAAAG 1980 

AAATGAAAAA AATCGCAGTA GATGCCATGG GGGGCGATTA CGCACCTCAG GCCATTGTTG 2040 

AGGGTGTCAA TCAAGCCCTA TCTGACTTTT CAGATATCGA GGTTCAACTT TACGGAGATG 2100 

AAGCTAAAAT CAAGCAATAT CTGACAGCGA CAGAGCGCGT CAGCATTATC CATACGGATG 2160 

AGAAGATTGA TTCGGATGAT GAACCTACGA GAGCTATTCG GAATAAGAAA AATGCCAGTA 2220 

TGGTATTGGC AGCCAAGGCT GTCAAAGATG GTGAAGCAGA CGCTGTCCTT TCGGCTGGGA 2280 

ATACAGGTGC CTTGTTGGCA GCAGGATTCT TCATCGTGGG TCGTATCAAG AATATCGACC 2340 

GTCCTGGACT CATGTCTACC TTGCCTACCG TTGATGGAAA AGGTTTTGAC ATGCTAGACC 2400 

TTGGTGCCAA TGCAGAAAAT ACAGCCCAGC ACCTCCATCA ATATGCGGTT CTAGGTTCCT 2460 

TCTATGCTAA AAATGTCCGT GGCATTGCGC AACCACGCGT TGGTTTGCTC AACAACGGAA 2520 

CAGAGAGTAG CAAGGGCGAC CCGCTTCGTA AGGAAACTTA TGAATTACTG GCGGCTGATG 2580 

AAAGTTTGAA CTTTATCGGA AACGTGGAAG CGCGTGATTT GATGAATGGC GTTGCAGATG 2640 

TTGTTGTGGC AGATGGTTTC ACGGGAAACG CTGTGCTCAA ATCCATCGAA GGGACAGCTA 2700 

TGGGAATCAT GGGCTTGCTC AAGACAGCTA TTACAGGTGG TGGTCTTCGA GCGAAACTAG 2760 

GTGCCCTCCT TCTCAAGGAC AGCCTCAGTG GTTTGAAAAA ACAGCTCAAT TATTCAGATG 2820 

TTGGTGGAGC GGTCTTGTTT GGTGTTAAGG CACCTGTTGT CAAGACTCAT GGCTCAAGCG 2880 

ATGCCAAGGC TGTTTATAGT ACGATTCGTC AGATCCGTAC CATGCTAGAA ACAGACGTGG 2940 

TTGCCCAGAC TGCGCGTGAA TTTTCAGGAG AATAAAAGAG ATGACAGAAA AAGAAATTTT 3000 

TGACCGTATT GTGACCATTA TCCAAGAGCG ACAGGGAGAG GACTTTGTCG TGACAGAATC 3060 

CTTGAGTCTG AAAGACGATT TGGATGCGGA TTCTGTTGAC TTGATGGAGT TTATCTTGAC 3120 

TCTGGAAGAT GAATTTAGTA TCGAAATCAG CGATGAAGAA ATTGACCAAC TCCAAAACG ? 3180 

AGGAGATGTG GTTAAAATCA TTCAAGGAAA ATAGCAATCG GAGTTCCAAG TCAACGGAAG 3240 

TAGATGGTTT TTAGAAATGA GAAATATCGG ACAAGCTGGT AAAATCTTGG CTGACAGTGG 3300 

TTATCAAGGG CTCATGAAGA TATATCCTCA AGCACAAACT CCACGTAAAT CCAGCAAACT 3360 

CAAGCCGCTA ACAGTTGAAG ATAAAGCCTG TAATCATGCG CTATCTAAGG AGATAAGCAA 3420 

GGTTGAGAAT ATCTTTGCCA AAGTAAAAAC GTTTAAAATG TTTTCAACAA CCTATCGAAA 3480 

TCATCGTAAA CGCTTCGGAT TACGAATGAA TTTGATTGCT GGTATTATCA ATCATGAACT 3540 

AGGATTCTAG TTTTGCAGGA AGTCTAATAG TAAAAAAGTG ATTAGAAAAC ATCTTTTTTA 3600 
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AAAATAGAGA TGATTTTGAA ACAAAAAAGC TAATTCAAGA CGTTTCGATG CCAATTCAAG 
ATTTGGATGA AAAAAATTAA TAGATACTGT TATACTAAAC TTGTCAAGTT TGTAACAAGA 
CAAATATTAA AAATAAAAAA GAGGTATTCG TTATGAATAC AAAAACGATG TCACAATTTG 
AAATTATGGA TACTGAGATG CTTGCTTGCG TTGAAGGTGG CGGATGCAAT TGGGGAGATT 
TTGCCAAAGC AGGTGTTGGA GGAGGAGCAG CACGAGGTCT TCAGCTAGGA ATTAAAACAA 
GAACATGGCA AGGTGCAGCA ACTGGTGCTG TGGGAGGAGC TATACTTGGA GGTGTGGCCT 
ATGCAGCGAC ATGTTGGTGG TAATTATGGA TTTTAAAAGT TTTATTATTG GTTTAGTAGT 
TGGTATATTT GGTCCTTATA TGGATGATTT AATTAGAAAA AAATTTTTAA AGTCTTCGGA 
GAAGAAAACA GAAAAATCTG TTAAAAAATA ATCAAAACTA TAAATGATGA ATCTGAATCA 
AAATTATTTT GCGCATGTAA AGAGGAGTCT TATAGTAACG AGTCAAAAAA GGAGTAACTA 
TGAATCGTAA TTTAGAACGG TGTTATCTAT TCTGACTAGG AATAGATCAT ACCAGAGGTA 
GCTTAGAAAT AGCAGAGACA TTAGAAATTG AAGTAATAAA TAGGATGTCG TAAGTGTTAC 
TATCAATGAT TTATTTGTTT CAAGCTTGCC TAGGGTGACA GTAAAAAATC AATTTCCTTT 
CAATAGCATA TTTTTAGTGG GCAGGACTCT TGTTCTGCCT ATTTTTTTAT CCAAAAAGTG 
CAGTTGGGAG GGAGATAGGC TCATTTGGGA AGGAAGTCCA GTTTTTGTTT AGTGATTGGG 
GTAAGATAGT TGTTATCAGA TGAGTTAATA CTCTTCGAAA ATCAAATTCA AACCACGTCA 
ACGTCGCCTT GCCGTATATA TGTGACTGAC TTCGTCAGTC CTATCTACAA CCTCAAAACA 
GTGTTTTGAG CAGCCTACGG CTAGTTTCCT AGTTTGCTCT TTGATTTTCA TTGAGTATTA 
GGGAAAAGGA GATGAATATG AAATTTGGGA AACGTCATTA TCGTCCGCAG GTGGATCAGA 
TGGACTGCGG TGTAGCTTCA TTAGCCATGG TTTTTGGCTA CTATGGTAGT TATTATTTTT 
TGGCTCACTT GCGAGAATTG GCTAAGACGA CCATGGATGG GACGACGGCT TTGGGCTTGG 
TCAAGGTGGC AGAGGAGATT GGTTTTGAGA CGCGAGCCAT TAAGGCAGAT ATGACGCTTT 
TTGACTTGCC GGATTTAACT TTTCCTTTTG TTGCCCATGT GCTTAAGGAA GGGAAATTGC 
TCCACTACTA TGTGGTGACT GGGCAGGATA AGGATAGCAT TCATATTGCC GATCCAGATC 
CCGGGGTGAA GTTGACTAAA CTGCCACGTG AGCGTTTTGA GGAAGAATGG ACAGGAGTGA 
CTCTTTTTAT GGCACCTAGT CCAGACTATA AGCCTCATAA GGAACAAAAA AATGGTCTGC 
TCTCTTTTAT CCCTATATTA GTGAAGCAGC GTGGCTTGAT TGCCAATATC GTTTTGGCAA 
CACTCTTGGT AACCGTGATT AACATTGTGG GTTCTTATTA TCTGCAGTCT ATCATTGATA 
CCTATGTGCC AGATCAGATG CGTTCGACAC TAGGGATTAT TTCTATTGGG CTAGTCATCG 
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TCTACATCTT CCAGCAAATC 
AACGCTTGTC GATTGACGTG 
CCTTCTTTGC GACACGCAGG 
TCATCGATGC GCTGGCTTCG 
TTATTTCCCT TGTTCTATTT 
TTCCTATCTA CACAGTGATT 
ATACCATGGA AGCCAATGCG 
AGACTATCAA GTCCTTGACC 
TGGATTATCT GAAGAAATCC 
AAAAGGTTGC CCATCTCTTG 
TGGATGGCAA GATGAGTTTG 
CTAATCCTTT GGAAAATATC 
ATAACCGTCT AAATGAAGTG 
AGGATTTGAG CTTGATGAAG 
ATGGTCGAGA TGTCTTATCG 
TTGTGGGGAT TTCAGGGTCA 
ACCCAAGTCA AGGGGAGATT 
CCCTGCGCCA GTACATCAAC 
TGGAGAATCT TCTTTTGGGA 
TCGAATTGGC AGAGATTCGA 
TGACTTCGGA TGGGGCAGGG 
CTCTCTTGAC AGATGCGCCG 
TGACAGAGAA GCGGATTGTC 
CTCACCGCTT GACTATTGCT 
TTGTCGAAGA AGGAAAGCAT 
TCAATAGCTA GAAAGAGGAG 
ATCGTCGTTA CCATAATTTT 
TTTTACTTGG CTTTGCAACT 
TCGAACCTAG TCGTATCCTT 
ATCATTTGGA AGAAAATAAG 



TTGTCTTACG 
ATTTTGTCCT 
ACAGGGGAGA 
ACCATCCTTT 
TCACAAAATA 
ATCTTTGCCT 
GTTCTGTCTT 
AGTGAAAGTC 
TTTACCTATA 
CTTAATGTCG 
GGGCAGTTGA 
ATCAATCTGC 
TATCTAGTAG 
GGAGATATGA 
GATATCAATT 
GGTAAGACGA 
AGTCTGGGTA 
TATCTGTCTC 
GCCAAGGAGG 
GAGGATATCG 
ATTTCAGGTG 
GTCTTGATTT 
GATAATCTCA 
GAGCGGACAG 
GCTGATTTGC 
AGGATGAAAC 
TCCAGTAGTG 
GTTGCAGAGA 
GCAAATATCC 
CTGGTTAAGA 



664 
CTCAGGAGTA 



ATATCAAGCA 
TCGTGTCTCG 
CGATTTTCCT 
CCAATCTCTT 
TTATGAAGCC 
CTTCTATCAT 
AGCGTTACCA 
GTCGAGCAGA 
GCATTCTCTG 
TTACCTATAA 
AAACCAAGCT 
CTTCTGAGTT 
CCTTCAAGCA 
TAACCGTTCC 
CTTTGGCCAA 
GTGTCAATCT 
AACAGCCCTA 
GGACGACACA 
AGCGCATGCC 
GTCAACGTCA 
TGGATGAGGC 
TTGCTTTGGA 
AGAAGGTAGT 
TTGCACAGGG 
CAGAATTTTT 
TGATTGTACC 
AGGAGATGAG 
AGTCAACTAG 
AGGGGGATCT 



TCTCTTGCTT 
TGTTTTTCAC 
TTTTACAGAT 
AGATGTGTCA 
TTTCATGACT 
GTTTGAAAAG 
TGAGGACATC 
AAAAATTGAC 
GAGTCAGCAA 
GATGGGGGCT 
TACCTTGCTG 
TCAGACAGCG 
TGAGGAGAAG 
GGTTCATTAC 
CCAAGGGTCT 
GATGATGGTT 
CAATCAGATT 
TGTCTTTAAC 
GGAAGATATC 
ACTGAATTAC 
GAGAATCGCT 
GACTAGCAGT 
CAAGACCTTG 
TGTCTTGGAT 
TGGCTTTTAC 
AGAAAGTGCG 
CATGGCCCTT 
TTTGTCCACT 
CAACAATCGT 
TTTGGTTCAA 



GTTTTGGGGC 
CTCCCTATGT 
GCTAACAGTA 
ACGGTTGTCA 
TTATTGGCGC 
ATGAATCGGG 
AACGGTATTG 
AAGGAATTTG 
AAGGCTCTGA 
GTTCTGGTCA 
GTTTACTTTA 
CAGGTTGCCA 
AAAACAGTTG 
AAGTATGGCT 
AAGGTGGCTT 
AATTTTTACG 
GATAAAAAAG 
GGAACGATTT 
TTACGGGCGG 
CAGACAGAAT 
TTGGCGCGTG 
TTGGATATTT 
ATTTTCATTG 
CAGGGCAAGA 
GCCCATTTGG 
GAGTTTTATA 
CTGCTTGTGT 
AGAGCTACTG 
ATTCTTGTCA 
TACCAAGAAG 



5400 
5460 
5520 
5580 
5640 
5700 
5760 
5820 
5880 
5940 
6000 
6060 
6120 
6180 
6240 
6300 
6360 
6420 
6480 
6540 
6600 
6660 
6720 
6780 
6840 
6900 
6960 
7020 
7080 
7140 



WO 98/18931 



PCT/US97/19588 



665 

GGGCAGAGGG TGTCCAAGCG GAGTCCTATG CCAGTCAGTT GGACATGCTA AAGGATCAAA 7200 

AAAAGCAATT GGAGTATCTG CAAAAGAGCC TGCAAGAAGG GGAGAACCAC TTTCCAGAGG 7260 

AGGATAAGTT TGGCTACCAA GCCACCTTTC GCGACTACAT CAGTCAAGCA GGCAGTCTTA 7320 

GGGCTAGTAC ATCGCAACAA AATGAGACCA TCGCGTCCCA GAATGCAGCA GCTAGCCAAA 7380 

CCCAAGCCGA AATCGGCAAC CTCATCAGTC AAACAGAGGC TAAAATTCGC GATTACCAGA 7440 

-CAGCTAAGTC AGCTATTGAA ACAGGTGCTT CCTTGGCCGG TCAGAATCTA GCCTACTCTC 7500 

TTTACCAGTC CTACAAGTCT CAGGGCGAGG AAAATCCCCA AACTAAGGTT CAGGCAGTTG 7560 

CACAGGTTGA AGCACAGATT TCTCAGTTAG AATCTAGTCT TGCTACTTAC CGTGTCCAGT 7620 

ATGCAGGTTC AGGTACCCAG CAAGCCTATG CGTCAGGGTT AAGCAGTCAA TTGGAATCCC 7680 

TTAAATCCCA ACACTTGGCA AAGGTTGGTC AGGAATTGAC CCTTCTAGCC CAGAAAATTT 7740 

TGGAGGCAGA GTCAGGTAAG AAGGTACAGG GAAATCTTTT AGACAAGGGG AAAGTTACGG 7800 

CGAGTGAGGA TGGGGTGCTT CATCTTAATC CTGAGACCAG TGATTCTAGC ATGGTTGCAG 7 860 

AAGGTGCCCT ACTAGCCCAA CTTTATCCAT CTTTGGAAAG AGAAGGGAAA GCCAAACTCA 7920 

CAGCTTATCT AAGTTCAAAA TATGTAGCAA GAATCAAGGT CGGTGATTCT GTTCGCTATA 7980 

CTACGACTCA TGATGCCGGG AATCAACTTT TCCTAGATTC TACTATTACA AGTATTGATG 8040 

CGACAGCTAC TAAGACTGAG AAAGGGAATT TCTTTAAAAT CGAGGCGGAG ACTAATCTAA 8100 

CTTCGGAGCA GGCTGAAAAA CTTAGGTACG GGGTGGAAGG CCGCTTGCAG ATGATTACGG 8160 

GCAAGAAAAG TTACCTACGT TATTATTTGG ATCAATTTTT GAACAAAGAG TAATGTTCGT 8220 

GTTTTTAGAG TTAAATAATT TTTAAACTGT GAGAAAGATT CTTCTTGCAG TTTTTTCTTT 8280 

ACAATTTTTG AAAAACATCT ACTATTTATT CGGTTAAATT CTTGTGTTTT TTGGTTTTTT 8340 

GTGGTAAAAT GTGCTCAAGT AATACGAAAG GCGAACTTTA AAATGTCAAA ACAATTGATC 8400 

TATTCGGGAA AAGCTAAAGA TATCTATACA ACTGAGGATG AAAATCTTAT TATTTCAACT 8460 

TACAAGGACC AGGCGACTGC TTTCAACGGT GTCAAGAAGG AGCAGATTGC AGGTAAGGGA 8520 

GTCTTGAATA ATCAGATCTC ATCTTTTATT TTTGAGAAAT TAAATGTGGC TGGTGTGGCG 8580 

ACTCACTTTG TGGAGAAACT TTCAGACACG GAACAACTCA ATAAAAAGGT TAAGATTATT 8640 

CCTTTGGAAG TCGTGCTCCG CAACTATACT GCTGGTTCCT TTTCAAAACG TTTTGGTGTG 8700 

GATGAGGGAA TCGCCTTGGA GACTCCGATT GTCGAATTTT ACTACAAAAA TGATGATTTG 8760 

GATGATCCAT TTATCAATGA TGAGCATGTG AAATTCCTAC AGATTGCGGG TGACCAGCAG 8820 

ATTGCCTACT TGAAGGAAGA AACGCGTCGT ATCAATGAAC TATTGAAAGT CTGGTTTGCT 8880 
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GAGATTGGGC 


TTAAATTGAT 


TGACTTTAAG 


CTAGAGTTCG 


GTTTTGACAA 


GGATGGCAAG 


8940 


ATTATCTTGG 


CAGACGAATT 


TTCACCAGAT 


AACTGCCGCT 


TGTGGGACGC 


TGATGGCAAC 


9000 


CACATGGATA AGGATGTTTT 


CCGTAGAGGA TTGGGAGAAC 


TAACCGACGT 


TTATGAGATT 


9060 


GTTTGGGAAA 


AGTTGCAGGA 


ATTGAAATAA 


TCTGTTTGCA 


At-GGAAAACC 


TTCGTCTCTC 


9120 


AACTAAAAGG 


ACTCAGGCTG 


AAAAGGTCCC 


CCAGACCTTT 


rCACTt_TGTA 


GAGAACTAGG 


9180 


TGAACTAACA GATGTTTACG 


AAATTGTCTG 


GGAAAAGTTG 


U AuCKi J. i i AA 


AATAACAACC 


9240 


TCAAGGCTGT 


TTGGGAATAT 


TGCAAGAGCT 


GAAATAAAGG 


it * m tv tv /*» iv tv mm 

AATAAGAATT 


GATGGATAAA 


9300 


CGTATTTTTG 


TTGAAAAAAA 


GGCTGATTTT 


CAGGTCAAGT 


CAGAGAGTTT 


GGTTAGAGAG 


9360 


CTCCAGCACA 


ACTTGGGACT 


GTCAAGCTTG 


AAAAGTATTC 


GTATTGTGCA 


AGTATATGAT 


9420 


GTATTTGACT 


TGGCTGAGGA 


CTTGTTTGCA 


CCTGCAGAGA 


AGCACATTTT 


CTCTGAGCAG 


9480 


GTAACCGACC 


ATGTTTTAGA 


TGAAGTATCT 


GTGCAGGCGG 


ATCTTGCTAA 


CTATGCTTTC 


9540 


TTTGCCATTG 


AAAGTCTGCC 


AGGGCAGTTT 


GACCAGCGTG 


CAGCTTCGTC 


ACAGGAAGCC 


9600 


TTGCTTTTGT 


TGGGAAGTTC 


GAGTGACGTG 


ACAGTCAACA 


CAGCCCAACT 


TTACTTGGTG 


9660 


AATAAAGATA 


TTGATGCGAC 


TGAGTTGGAA GCTGTCAAAA 


ACTACCTGCT 


CAATCCAGTT 


9720 


GATTCTCGTT 


TCAAGGATAT 


CACGACAGGG 


ATTGCCAAGC 


AGGAGTTTTC 


AGAGTCAGAC 


9780 


AAGACCATTC 


CCAAATTGAC 


TTTCTTTGAA 


AGCTATGCAG 


CAGAAGACTT 


TGCTCGCTAC 


9840 


nnuUvVi Unn\> 


AAGGGATGGC 


CATGGAAGTG 


GATGATTTGC 


TCTTTATCCA 


AGACTACTTT 


9900 


AAGTCAATCG 


GGCGCGTGCC 


AACTGAGACT 


GAACTCAAGG 


TTTTGGACAC 


TTACTGGTCT 


9960 


GACCACTGCC 


GTCATACGAC 


TTTTGAGACA 


GAGTTGAAAC 


ACATCGACTT 


TTCAGCTTCT 


10020 


AAATTTCAAA 


AGCAATTGCA 


GTCAACCTAT 


GACAAGTATA 


TTGCCATGCG 


CGAGGAATTA 


10080 


GGTCGGTCTG 


AAAAACCACA 


AACCTTGATG 


GATATGGCGA 


CTATTTTCGG 


TCGTTATGAG 


10140 


CGTGCTAATG 


GACGATTGGA 


TGATATGGAA 


GTCTCTGACG 


AAA1 CAAIviL 


CTGCTCAGTT 


10200 


GAAATTGAAG 


TGGACGTTGA 


TGGTGTCAAG 


GAACCTTGGC 


TCCTCATGTT 


TAAAAACGAA 


10260 


ACCCACAACC 


ATCCAACAGA 


AATTGAGCCA 


TTTGGTGGAG 


CGGCTACCTG 


TATTGGTGGA 


10320 


GCTATTCGTG 


ATCCGTTGTC 


AGGCCGTTCC 


TATGTTTACC 


AAGCCATGCG 


TATTTCAGGT 


10380 


GCTGGTGATA 


TTACAGCACC 


GATTTCGGAA 


ACTCGCGCTG 


GGAAATTGCC 


ACAACAAGTC 


10440 


ATTTCTAAAA 


CAGCAGCTCA 


TGGTTATTCT 


TCATATGGTA 


ACCAGATTGG 


GCTTGCAACA 


10500 


ACCTACGTTC 


GTGAATACTT 


CCACCCAGGC 


TTTGTAGCTA 


AACGTATGGA 


ACTTGGTGCC 


10560 


GTTGTTGGTG 


CGACTCCCAA GGGCAATGTT 


GTCCGTGAAA 


AACCTGAAGC 


AGGTGATGTG 


10620 


ATCATCCTTC 


TCGGAGGCAA 


AACAGGTCGT 


GATGGTGTCG 


GTGGTGCGAC 


GGGCTCTTCT 


10680 
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AAGGTTCAAA 


CAGTTGAGTC 


TGTAGAGACT 


GCTGGTGCTG 


AGGTTCAAAA 


AGGAAATGCC 


10740 


ATCGAAGAAC 


GCAAGATTCA 


GCGCCTCTTC 


CGTAATGGCA 


ATGTCACTCG 


TCTGATCAAG 


1O8O0 


AAGTCCAATG 


ACTTTGGGGC 


AGGCGGTGTC 


TGTGTGGCTA 


TCGGTGAATT 


GGCAGACGGT 


10860 


CTTGAAATCG 


ACCTCAACAA 


GGTGCCTCTT 


AAATACCAGG 


GCTTGAATGG 


TACAGAAATT 


10920 


GCCATCTCTG 


AATCACAAGA 


ACGGATGGCG 


GTCGTGGTTC 


GTCCTGAAGA 


TGTGGATGCC 


10980 


-TTCGTTGCCG 


AATGTAACAA 


AGAAAATATT 


GATGCTGTTG 


TGGTGGCGAC 


AGTAACTGAA 


11040 


AAACCAAATC 


TTGTCATGCA 


CTGGAATGGT 


GAGACAATCG 


TTGACTTGGA 


GCGTCGTTTC 


11100 


CTTGACACCA 


ATGGTGTGCG 


CGTGGTTGTC 


GATGCCAAAG 


TTGTGGACAA 


GGATGTCAAA 


11160 


CTCCCAGAAG 


AGCGTCAAAC 


ATCTGCTGAA 


ACACTGGAAT 


CAGATACCCT 


TACGGTTCTA 


11220 


TCTGACCTCA 


ACCATGCAAG 


TCAAAAAGGA 


TTACAGACTA 


TCTTTGACTG 


CTCTGTTGGA 


11280 


CGCTCAACGG 


TTAATCACCC 


ACTTGGTGGT 


CGTTACCAAC 


TCACACCAAC 


TGAGGCATCT 


11340 


GTGCAGAAAT 


TGCCAGTTCA 


ACACGGTGTG 


ACTCATACTG 


CGTCGGTCAT 


TGCTCAAGGT 


11400 


TTCAACCCAT 


ATGTAGCTGA 


ATGGTCTCCA 


TACCACGGTG 


CTGCTTATGC 


GGTTATCGAA 


11460 


GCAACTGCTC 


GTTTGGTGGC 


TGCTGGTGCC 


AACTGGTTCA AGGCTCGTTT 


CTCTTACCAA 


11520 


GAGTATTTCG 


AGCGTATGGA 


TAAACAAGCA 


GAGCGTTTCG 


GTCAGCCAGT 


AGCTGCTCTT 


11580 


CTAGGTTCTA 


TTGAAGCACA 


AATTCAGCTT 


GGCTTGCCAT 


CTATCGGTGG 


TAAGGACTCC 


11640 


ATGTCTGGTA 


CCTTTGAAGA 


ATTGACCGTT 


CCGCCAACCT 


TGGCTGCCTT 


TGGGGTGACG 


11700 


ACGGCAGATA 


GCCGTAAGGT 


GCTCTCTCCA 


GAATTTAAAG 


CTGTTGGGGA 


AAATATCTAC 


11760 


TACATCCCAG 


GTCAAGCCCT 


CTCTGCAGAG 


ATTGATTTTG 


ACTTGATTAA 


GAAAAATTTT 


11820 


GCTCAGTTTG 


AAGCCATCCA 


AGCTGACCAT 


AAAGTGACAT 


CTGCATCAGC 


TGTCAAATAC 


11880 


GGTGGTGTAG 


TTGAAAGTTT 


GGCTCTTGCT 


ACCTTTGGAA 


ACTATATTGG 


TGCAGAGGTG 


11940 


ACCTTGCCTG 


AACTTGAAAC 


AGCTTTGACA 


GCTCAATTAG 


GCGGCTTTGT 


CTTCACATCT 


12000 


CCTGAAGAAA 


TTGCTGGAGT 


AGAGAAGGTT 


GGACAAACGA 


AAGCAGACTT 


TACACTGACT 


12060 


GTCAACGGTG 


TGAAGCTAGA 


TGGACACAAG 


CTTGACAGTG 


CATTTCAAGG 


GACATTGGAA 


12120 


GAAGTTTACC 


CAACAGAATT 


TACCCAAGCG 


AAAGAACTAG 


AAGAAGTACC 


AGCTGTGGCA 


12180 


TCAGATGTTG 


TGATTAAAGC 


CAAAGAAAAG 


GTTGAAAAAC 


CTGTGGTTTA 


CATCCCAGTC 


12240 


TTTCCAGGAA 


CCAACTCAGA ATATGATTCA 


GCTAAGGCCT 


TCGAAAAAGA 


AGGTGCAGAG 


12300 


GTCAATTTGG 


TGCCATTCGT 


GACCTTGAAT 


GAAGAAGCTA 


TTGTCAAGTC 


AGTTGAAACT 


12360 


ATGGTTGACA 


ATATCGACAA 


GACTAATATT 


CTCTTCTTTG 


CTGGTGGATT 


CTCGGCTGCG 


12420 
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GATGAACCAG 


ATGGTTCAGC 


I TAAGTTTATC 


GTCAATATCC TGCTTAATGA 


AAAAGTGCGT 




GTGGCTATTG 


ATAGCTTTAT 


CGCCCGTGGT 


GGTTTGATTA TCGGTATTTG 


TAATGGATTC 


1254 0 


CAAGCCTTAG 


TCAAATCGGG 


TCTCCTACCC 


TACGGAAACT TTGAAGCTGC 


TAACAGTACT 


1 j finn 


AGCCCAACCC 


TCTTCTACAA 


TGATGCCAAC 


CAACACGTGG CCAAGATGGT 


GGAAACTCGC 


1 9fif\f) 


ATTGCCAATA 


CCAACTCACC 


ATGGTTGGTT 


GGTGTGCAAG TGGGCGATAT 


CCACGCTATT 


1 070(\ 


CCTGTTTCGC 


ACGGTGAAGG 


GAAGTTTGTC 


GTGACGGCTG AGGAATTTGC 


AGAGCTCCGT 


XC /UU 


GACAATGGAC 


AAATTTTCAG 


CCAATACGTT GACTTTAACG GTAAACCAAG 


TATGGATTCT 




AAGTACAATC CGAATGGTTC TGTCCATGCC ATCGAAGGAA TTACCAGCAA GAATGGTCAA 


12900 


ATCATCGGTA AGATGGGCCA CTCAGAACGT TATGAGGATG GTCTTTTCCA 


AAATATCCCA 


12960 


GGCAATAAAG 


ACCAACACCT 


GTTCGCATCA GCGGTTAAAC ATTTCACTGG 


AAAATAAGAC 


13020 


TTACAGATTT 


TCTAATAGAT 


AGTATCAGTA ATGTAAAAGT CATGTAAATC 


TAGCTCTTGA 


13080 


TGATTACAAA TGAAAATTAG GTATAAAAAA TGACATACGA AGTAAAATCT 


CTTAATGAAG 


13140 


AATGTGGTGT 


TTTCGGTATT 


TGGGGACATC 


CAGATGCTGC TAAGTTGACC 


TATTTTGGAC 


13200 


TCCACAGTCT 


TCAACACCGT 


GGTCAGGAGG 


GGGCAGGAAT CCTCTCCAAT 


GATCAAGGAC 


13260 


AACTGAAGCG 


CCATCGTGAC 


ATGGGGCTTT 


TATCAGAAGT TTTCAGAAAT 


CCAGCTAATT 


13320 


TGGATAAATT 


GACAGGAGCT 


GGTGCGATTG 


GGCATGTGCG TTATGCGACT 


GCTGGCGAAG 


13380 


CTTCTGTAGA 


TAACATCCAG 


CCCTTCCTCT 


TCCGTTTTCA CGATATGCAG 


TTTGGTTTGG 


13440 


CTCATAATGG 


AAATCTGACC 


AATGCAGCCT 


CTCTCAAGAA AGAACTGGAA 


CAAAGAGGAG 


13500 


CAATTTTCAG 


CGCGACTTCG 


GACTCTGAAA 


TCTTGGCTCA CCTCATTCGT 


CGCAGTCATA 


13560 


ATCCTAGCCT 


GATGGGCAAA 


ATCAAGGAAG 


CGCTCAGCCT TGTCAAAGGT 


GGTTTTGCCT 


1 

ljozU 


ATATCTTGCT 


GTTTGAGGAC 


AAGTTGATTG 


CGGCTCTTGA CCCAAATGGA 


TTCCGACCGC 


1 icon 


TTTCGATTGG 


TAAAATGGCT 


AATGGAGCAG 


TTGTTGTATC TTCTGAAACC 


TGTGCTTTTG 


J. J / 4 U 


AGGTCATTGG 


TGCCGAGTGG 


ATTCGTGATT 


TGAAGCCAGG TGAGATTGTG 


ATCATTGATG 


13800 


ACGAGGGCAT TCAGTATGAC AGCTATACAG ATGATACCCA GTTGGCGGTT TGTTCTATGG 


13860 


AGTATATCTA 


CTTTGCTCGC 


CCTGATTCTA 


ATATCCACGG TGTCAATGTC 


CATACGGCAC 


13920 


GTAAGAGAAT GGGAGCGCAA TTGGCGCGAG AATTTAAGCA TGAGGCAGAT ATTGTAGTTG 


13980 


GTGTGCCCAA 


TTCTTCCCTA 


AGCGCGGCTA 


TGGGATTTGC GGAAGAATCA 


GGCTTACCAA 


14040 


ATGAAATGGG 


TCTGATCAAA 


AACCAATACA 


CCCAGCGAAC TTTTATCCAA 


CCGACTCAAG 


14100 


AATTGCGGGA 


GCAAGGAGTG 


CGGATGAAAC 


TGTCTGCTGT TTCGGGTGTT 


GTCAAAGGCA 


14160 


AACGTGTGGT 


CATGGTGGAT 


GATTCCATTG 


TACGTGGAAC AACCTCTCGT 


CGTATCGTTC 


14220 
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AGCTCTTGAA 
CGTATCCATG 
ATACGGTCGA 
ATGGCTTGAT 
CTTACTTTGA 
-GTTTGGAAGA 
AAAAATAAAT 
AGCATTTAGT 
TTTAGATAAA 
GAAAAATAAA 
TTATGAAGTT 
GGGAGCTCTT 
CGTCTTGATT 
CAAGCACGAT 
AGGTGCGGAA 
GCTAGAACAA 
CGGTGGGGAA 
TTTTGCGGTC 
AGATGTTCTT 
TCGTGTCTTT 
TAAGGAAGTT 
AGAAGAGTTG 
TCGTATGTTT 
AATTTTCAAA 
CAATATGGGT 
ATTGTTGGAT 
CATTATCAAA 
ATTGCCGAAG 
GAGCGTGCAA 



AGAAGCGGGT 
TTTCTACGGG 
AGAAACTCGC 
TGAGTCGATT 
CGGTGACTAC 
AAAGACCAGT 
GACAAATAAA 
CAGTAGACGC 
AAGATGGTTT 
ATGGCAAATA 
GTTGAACGGA 
GGTGGCTTTG 
TCAGGGACTG 
ACCATCGGGC 
CCCCTCTATT 
GTGGTTGCTG 
ACGGCTGAAA 
GGTGTGGCTG 
CTCGGACTTG 
GCGGATTACA 
CTACTTGAGC 
GTCAACGGCA 
GCAGATGACC 
ACCCTTGAAA 
GTGGGACTTA 
GAAGCAGTCT 
TGAAAAAAAT 
AATTTCCAGT 
AGCAGCTCGG 



GCGACTGAGG 
ATTGATATCC 
CAAATCATTG 
GGTATCGAAA 
CCAACGCCTC 
TTTTACAAGT 
AATGCATATG 
TTTGTCCTAT 
ATCTAAAAAT 
AAAATGCGTA 
TTAAAAAGCA 
GTGGTATGTT 
ACGGTGTCGG 
AGGACTGTGT 
TTCTCGACTA 
GTGTGGCAGA 
TGCCGGGCAT 
AAAAATCTCA 
CTTCAAGTGG 
CAGGTGAGGA 
CGACTCGTAT 
TTGCCCACAT 
TAGCTGCTGA 
AATACGGTCA 
TGTTGGCGGT 
ATGAAATTGG 
AGCGGTTTTT 
GGAGTTTGTC 
CGTTCTGTCC 



669 
TTCACGTTGC 
AGACCCGTCA 
GTGCGGACAG 
CAGATGCGCC 
TTTATGACTA 
AGGCGACAGA 
CCTCACGTCT 
AGGATCAAAG 
ACGTCGCAGT 
CGCTCAATCT 
CGTGGCCCGT 
TGACCTTTCC 
AACCAAGCTC 
GGCCATGTGT 
CGTAGCGACA 
AGGTTGTGTG 
GTACGGCGAA 
AATCATTGAC 
GATTCACTCA 
AGTCCTACCA 
CTATGTCAAG 
CACAGGTGGT 
AATTGATGAA 
GATTAAACAC 
CAGCCCTGAA 
TCGCATCGTC 
GCCTCTGGTA 
TTTTCAGACC 
TATGCTTTTG 



CATTGGAAGT 
GGAGCTGATT 
TCTGACTTAT 
GAACGGTGGT 
CGAAGAAGAC 
TTCTCCATTA 
CACTACTGAC 
CTAGAGCCCT 
CTTTCTCAAA 
GGTGTGGATG 
ACGGAGCGTG 
AAGACTGGGG 
ATGTTGGCTA 
GTCAACGACA 
GGGAAGAATG 
CAGGCTGGTG 
GACGACTATG 
GGTTCAAAGG 
AATGGTTACT 
GAATTGGAAG 
GCTGTCTTGC 
GGCTTTATCG 
AGTAAAGTTC 
GAAGAAATGT 
AATGTAGAGC 
AAGAAAGAAA 
ATGGCTCAAA 
ATCGTGATGC 
AACTCAAGGA 



CCTGCACTAG 
GCAGCCAATC 
CTTTCAATTG 
CTCTGTGTCG 
TATCGTAGAA 
AAGAAAAGGA 
TAAAGGCTTA 
GACTAGTATT 
AAAAGAAAAG 
TTGAAGCGGG 
CAGGTGTCAT 
TTAAAGAACC 
TCAAGTACGA 
TCATTGCTGC 
AACCAGCTAA 
CTGCCCTCAT 
ACTTGGCTGG 
TGGTAGAGGG 
CTTTGGTTCG 
GCAAGAAACT 
CGCTCATCAA 
AAAATGTCCC 
CAGTGCTTCC 
TTGAAATCTT 
GTGTAAAAGA 
ACGAAAGTGT 
TTTTCAGGTG 
CTATGTGCTT 
GTTTGAGAGC 



14280 

14340 

14400 

14460 

14520 

14580 

14640 

14700 

14760 

14820 

14880 

14940 

15000 

15060 

15120 

15180 

15240 

15300 

15360 

15420 

15480 

15540 

15600 

15660 

15720 

15780 

15840 

15900 

15960 
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AAGGCAGACT ACGAAGCAGC 


CCTTGTCGAA 


CTCTTGGAAG 


& Ji 7\ f^C* TV ft T 


m/™» 7V OITVTXO /*^rnrn 
luAA-l 1\jVj1 I 


16020 


TGCCTAGCAG GCTACATGAA 


AATCGTTGGA 


CCAACCTTAT 








ATTGTCAACA TTCATPPAfiC 


CTACTTGCCA GAATTTCCAG 




JjAi IvjAGGAT 


16140 




TCAGTCTGGT 


GTGACCATTC 


AC rGGO I w A 


TTCGGGTGTG 


16200 


GATACAGCZCC AfWPPATT'Aa 


ACAGGTTCGT 


GTGCCACGAC 


TAG CTGATGA 


taccattgac 


16260 




TGAAGCAGAG 


TACAGGCTGT 


ATCCGGAAGT 


AGTGAAGGCT 


16320 


v» i A i i i nUno A 1 luAL X 1 I 1 


TGATGATTCA 


TATGATATCT 


TTGATTTTAA 


ATTGGAGTCA 


16380 


GTGTTTGTTG AAGACGGCTT 


CAAACGGAGG 


TATTTGTAAT 


GTTAGAATCT 


AAAAAAACAA 


16440 


CTCGATATGT ATTTTATGTC 


TATCTGATGT 


TATTAACTTG 


GGGAATCTTA 


TTTAAGTTTG 


16500 


AAACAAATCC TGAATTTATA 


GCATTTTTCT 


TAGCTCCAAG 


GTATATCAAT 


TGGATTCCAT 


16560 


TTTCAGAACC ACTAATAGTC 


GATGGAAAAA 


TTGTTTTTGC 


TGAAATGTTA 


TTTAATCTGA 


16620 


TTTTCTTTAT TCCATTAGGT 


GTTTGTTTCC 


CTTTGATAAA 


AACTAATTTA 


TCTAGTTTAA 


16680 


GAATAGTCGG GACAGGTTTC 


TTGATTAGTT 


TATTGTTTGA 


GTGCTTACAG 


TATATTTTAG 


16740 


CAATAGGTAT AACAGATATA 


ACGGATTTGA 


CTTTAAATAC 


GCTAGGTGTC 


TGTGTAGGCT 


16800 


TACTGATTTA TCAAATTTTT ATAAGAGTGT TCAAATCACA GACTAGAAAA TGGATCAATA 


16860 


TCTTAGGTAT GCTTAGCCTT GGTTTTGCTT ATCTTGTTTT ACTGTTACTG 


CATTTACTTA 


16920 


GTGTTTAACT AATGATTAAA AAGGAGAATA TAATGACTAA ACGCGTCTTA ATCAGCGTCT 


16980 


CAGACAAAGC GGGCATTGTT 


GAATTTGCCC 


AAGAACTCAA 


AAAACTTGGT 


TGGGAGATTA 


17040 


TCTCAACAGG TGGAACTAAG 


GTTGCCCTTG ATAATGCTGG 


GGTGGATACC 


ATTGCTATCG 


17100 


ATGATGTGAC TGGTTTCCCA 


GAAATGATGG 


ACGGTCGTGT 


GAAGACCCTC 


CACCCAAATA 


17160 


TCCACGGAGG GCTTCTCGCT 


CGTCGTGACT 


TGGATAGCCA 


CTTGGAAGCG 


GCTAAGGACA 


17220 


ACAAGATTGA GCTCATTGAC 


CTTGTGGTGG 


TCAACCTTTA 


CCCATTTAAG 


GAAACTATCC 


17280 


TTAAACCAGA TGTGACTTAT 


GCTGATGCAG 


TTGAAAATAT 


CGATATTGGT 


GGGCCATCTA 


17340 


TGCTTCGTTC AGCAGCGAAA 


AATCATGCCA 


GTGTTACAGT 


TGTGGTAGAT 


CCTGCTGACT 


i.7400 


ACGCTGTGGT TTTGGATGAA 


TTGGCAGCAA 


ACGGCGAAAC 


CTCTTATGAA 


ACTCGCCAAC 


17460 


GTTTAGCAGC CAAAGTATTT 


CGTCACACAG 


CGGCTTATGA 


CGCCTTGATT 


GCAGAATACT 


17520 


TCACAGCTCA AGTGGGTGAA 


AGCAAGCCTG 


AAAAACTCAC 


TTTGACTTAT 


GACCTCAAGC 


17580 


AACCAATGCG TTACGGTGAG 


AATCCTCAAC 


AAGACGCGGA 


CTTTTACCAG 


AAAGCTTTGC 


17640 


CTACAGACTA CTCCATTGCT 


TCAGCCAAAC 


AGCTCAACGG 


GAAAGAATTG 


TCATTTAATA 


17700 


ATATCCGTGA TGCAGATGCT 


GCTATCCGTA 


TCATCCGTGA 


CTTCAAAGAT 


AGTCCAACCG 


17760 
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TTGTGGCTCT 


CAAACACATG 


AATCCATGTG 


GAATTGGTCA 


AGCTGATGAC 


ATCGAGACTG 


17820 


CTTGGGACTA 


CGCTTATGAG 


TCTGACCCAG 


TGTCTATCTT 


TGGTGGGATT 


GTCGTCCTCA 


17880 


ACCGTGAGGT 


GGATGCTGCG 


ACAGCTGAGA AGATGCACGG 


CGTTTTCCTC 


GAAATCATCA 


17940 


TTGCACCAAG 


CTATACGGAT 


GAAGCGCTAG 


CCATTTTGAT 


CAATAAAAAG 


AAAAACTTGC 


18000 


GTATCCTTGC 


CTTGCCATTT 


AATGCTCAAG 


AGGCTAGCGA 


AGTGGAAGCA 


GAATACACAG 


18060 


-GTGTAGTCGG 


TGGACTTCTC 


GTGCAAAATC 


AAGACGTGGT 


CAAGGAAAGC 


CCAGCTGACT 


18120 


GGCAAGTGGT 


GACTAAACGT 


CAGCCAACTG 


AGACAGAAGC 


GACTGCTCTT 


GAGTTCGCTT 


18180 


GGAAGGCTAT 


CAAGTACGTC 


AAATCAAATG 


GTATTATCGT 


GACCAACGAC 


CACATGACAC 


18240 


TTGGTGTTGG 


TCCAGGTCAA 


ACCAACCGTG 


TGGCTTCTGT 


TCGCCTTGCC 


ATTGACCAAG 


18300 


CCAAAGATCG 


TCTGGACGGG 


GCGGTCCTTG 


CTTCAGATGC 


CTTCTTCCCA 


TTTGCGGATA 


18360 


ACGTGGAAGA 


AATCGCCAAA 


GCAGGAATTA 


AGGCCATCAT 


CCAGCCCGGT 


GGCTCTGTCC 


18420 


GTGACCAAGA ATCCATCGAA GCAGCGGATA AATACGGCTT GACTATGGTC TTTACAGGTG 


18480 


TGAGACATTT 


TAGACATTAA 


GAAGATAAAA 


GGGAAGAAAA 


CAGTTTCTTT 




18540 


CTTAAAATAC 


TAACTGAAAC 


AAGATTAAAA 


CGAACTTTTT 


TGATATAATG 


TTGGTAAATA 


18600 


ATTCGCAAAA 


GAGGTTGAGG 


AATGAAACTG 


CTTGTTGTCG 


GTTCTGGTGG 


TCGTGAGCAT 


18660 


GCGATTGCTA 


AAAAGTTACT 


TGAATCAAAA 


GACGTGGAAA 


AAGTCTTTGT 


AGCTCCTGGG 


18720 


AATGATGGGA 


TGACTCTGGA 


TGGTTTGGAA 


TTGGTAAATA 


TCTCTATTTC 


CGAACATTAT 


18780 


AAATTGATTG 


ACTTCGCAAA 


GACCAATGAT 


GTTGCTTGGA 


CCTTTATCGG 


TCCAGATGAT 


18840 


GCCCTTGCTG 


CTGGTATCGT 


GGATGATTTT 


AACCAAGCTG 


GACTTAAGGC 


CTTTGGTCCG 


18900 


ACTAGGGCTG 


CAGCGGAGCT 


GGAGTGGTCC 


AAGGATTTCG 


CCAAGGAAAT 


CATGGTCAAA 


18960 


TACGGCGTTC 


CGACAGCAAC 


ATATGGCACA 


TTTTCAGATT 


TCGAGGAAGC 


CAAAGCCTAT 


19020 


ATCGAAAAGC 


ATGGTGCGCC 


TATCGTAGTC 


AAGGCGGATG 


GCTTGGCACT 


TGGGAAGGGT 


19080 


GTCGTCGTTG 


CTGAGACGGT 


TGAGCAAGCG 


GTCGAAGCCG 


CTCATGAGAT 


GCTTTTGGAC 


19140 


AATAAATTTG 


GTGACTCAGG 


TGCGCGCGTG 


GTTATTGAGG 


AATTCCTTGA 


AGGAGAGGAA 


19200 


TTTTCACTCT 


TTGCCTTTGT 


CAATGGTGAT 


AAGTTCTACA 


TCATGCCAAC 


GGCTCAGGAC 


19260 


CACAAACGTG 


CCTATGATGG 


CGACAAAGGG 


CCTAACACGG 


GTGGTATGGG 


TGCCTATGCG 


19320 


CCAGTCCCAC 


ACTTACCACA 


GAGTGTAGTT 


GATACAGCGG 


TTGACACCAT 


TGTCAAGCCA 


19380 


GTTCTAGAAG 


GGGTGATTAA 


AGAAGGTCGC 


CCTTATCTGG 


GAGTTCTTTA 


CGCAGGGCTT 


19440 


ATCCTGACAG 


CTGATGGACC 


GAAAGTCATT 


GAGTTCAACG 


CTCGGTTCGG 


AGATCCAGAA 


19500 
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ACTCAGATTA 


TCTTGCCTCG 


CTTGACCTCT 


GACTTTGCTC 


AAAATATCAC 


AGATATCCTG 


19560 


GATAGCAAGG 


AGCCAAATAT 


CATGTGGACG 


GACAAGGGTG 


TGACTCTGGG 


TGTGGTTGTC 


19620 


GCATCCAAGG 


GCTACCCGCT 


AGACTATGAA 


AGGGGCGTTG 


AGTTGCCAGC 


CAAGACAGAA 


19680 


GGCGATGTCA 


TCACCTACTA 


TGCAGGGGCT 


AAGTTTGCGG 


AAAATAGCAG 


AGCACTGCTC 


19740 


TCAAACGGCG 


GACGAGTTTA 


TATGCTCGTT 


ACCACAGCAG 


ATACCGTCAA 


AGAAGCCCAA 


19800 


GCCAGCATAT 


ACCAAGAACT 


ATACCAACAA 


AAAATAGAAG 


GACTCTTCTA 


CCGAACAGAT 


19860 


ATCGGAAGCA 


AGGCAATTAA 


GTAAAGATAT 


AAGAATAACG 


CGCCGTAGTC 


GCCAAACACG 


19920 


ATAATGGTCG TCGTGGTGAA AAGACCAGAA CAGTGAATGT 


TCTGGTCAGG 


GGGAAACTTG 


19980 


GAGACCTTAG 


GCTCAAAGTT 


TAGGAATGAA ACCGAAGGTT 


TGCTTCCGCC 


TCCATCACCT 


20040 


AAGACCATTA TCAAAAAGAA AAATAAAAAT TCACAAAATA CGTTAATGAT CGTATGGTTT 


20100 


GCGAGCGTTA 


GCGAGCTAAT 


ATAGAACAAT 


CACCGCCGTT 


GTGAAAGAAC 




20160 


TAATCCAATC 


GTTCAGGGAA 


ATTGGAAGAC 


CTTGGGTTTC 


CAATTTAGGC 




20220 


TTTGGTGGCT 


GCTGCCGTCC 


CTCACAAGCT 


AAGGTGATTG 


TTGAAAAAGA 


GGAAAAAGGA 


20280 


GAAGAAATGA 


AACCAGTAAT 


TTCCATCATC 


ATGGGCTCAA 


AATCCGACTG 


GGCAACCATG 


20340 


CAAAAAACAG 


CAGAAGTCCT 


AGACCGCTTC 


GGTGTAGCCT 


ACGAAAAGAA 


AGTTGTTTCC 


20400 


GCACACCGTA 


CACCAGACCT 


CATGTTCAAA CATGCAGAAG 


AAGCCCGTAG 


TCGTGGCATC 


20460 


AAGATCATCA 


TCGCAGGTGC 


TGGTGGCGCA 


GCGCATTTGC 


CAGGCATGGT 


AGCTGCCAAA 


20520 


ACAACCCTTC 


CAGTCATTGG 


TGTGCCAGTC 


AAGTCTCGTG 


CTCTTAGTGG 


AGTGGATTCA 


20580 


CTCTATTCTA 


TCGTTCAGAT 


GCCGGGTGGG 


GTGCCTGTTG 


CGACCATGGC 


TATCGGTGAA 


20640 


GCTGGAGCGA 


CTAACGCAGC 


TCTCTTTGCC 


CTCCGTCTCC 


TCTCTGTAGA 


AGATAAGTCC 


20700 


ATTGCGGATG 


CACTTGCCAA 


CTTTGCTGAA 


GAACAAGGAA AAATCGCAGA 


GGAGTCGTCA 


20760 


AATGAGCTCA 


TCTAAAACAA 


TCGGAATTAT 


CGGTGGCGGT 


CAACTGGGTC 


AGATGATGGC 


20820 


CATTTCTGCT 


ATCTACATGG 


GCCACAAGGT 


TATCGCGCTG 


GATCCTGCGG 


CGGATTGCCC 


20880 


GGTCTCTCGT 


GTGGCGGAAA 


TCATTGTGGC 


ACCTTATAAC 


GATGTAGACG 


CCCTCCGTCA 


20940 


GTTGGCAGAC 


CGTTGCGATG 


TCCTCACTTA 


TGAGTTTGAA 


AATGTCGACG 


CTGACGGTTT 


21000 


GGATGCCGTT 


ATCAAGGATG 


GACAACTCCC 


TCAAGGAACA 


GATCTGCTCC 


GCATTTCGCA 


21060 


AAATCGTATT 


TTTGAAAAGG 


ACTTTTTGTC 


AAACAAGGCT 


CAAGTCACTG 


TGGCACCCTA 


21120 


CAAGGTCGTG 


ACTTCTAGCC 


TAGACTTGGC 


AGATATCGAC 


TTGTCGAAAA 


ACTATGTCCT 


21180 


CAAGACTGCG 


ACTGGTGGCT 


ACGATGGTCA 


TGGACAAAAG 


GTTATTCGTT 


CAGAAGCAGA 


21240 


CTTGGAAGCA 


GCCTATGCGC 


TAGCAGACTC 


AGCAGACTGC 


GTCTTGGAAG 


AATTTGTCAA 


21300 
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CTTTGACCTT 


GAGATTTCTG 


TCATCGTGTC 


AGGAAATGGC 


AAGGAGGTGA 


CGTTTTTCCC 


21360 


AGTTCAGGAA 


AATATCCACC 


GCAACAATAT 


CCTGTCTAAG 


ACCATCGTAC 


CAGCCCGCAT 


21420 


TTCTGAAAGT 


CTAGTAGACA 


AGGCTAAAGC 


TATGGCAGTG 


CGAATCGCAG 


AACAACTCAA 


21480 


CTTGTCTGGA ACTCTCTGTG 


TGGAAATGTT 


TGCGACAGCT 


GATGACATCA 


TTGTCAATGA 


21540 


AATCGCCCCA 


CGACCACATA 


ACTCTGGGCA 


CTATTCTATT 


GAAGCCTGTG 


ATTTCTCTCA 


21600 


- GTTTGACACC 


CATATTCTGG 


GTGTTCTCGG 


AGCACCATTA 


CCAGTCATCA 


AACTCCATGC 


21660 


GCCAGCCGTT ATGCTTAATG 


TCCTCGGTCA 


GCATGTCGAG 


GCTGCTGAAA 


AATATGTCAC 


21720 


AGAAAATCCA AGCGCCCACC 


TCCACATGTA 


TGGTAAAATA 


GAAGCAAAGC 


ATAATCGTAA 


21780 


GATGGGACAT 


GTGACTTTGT 


TTAGTGATGT 


GCCGGATAGT 


GTGGAAGAGT 


TTGGGGAAGG 


21840 


GATTGATTTT 


TAGGACAAGT 


CTATGATACA 


AATTATCGTT 


AATACATTTA 


TTGAAAAGTA 


21900 


TAAGACTGGA GCAGTTGTTG 


AAGTGTTGTA 


TGCCAGTGCT 


GACCAAGATA 


AGGTACAAGC 


21960 


TAAATATGAA 


GAACTAGCTG 


CACAATACCC 


CGAAAATTAT 


TTAGCTATCT 


ATAATGTACC 


22020 


GCTGGATACG 


GATTTGAATA 


CACTAGATCA 


TTACCCGTCT 


GTGTTTATTG 


GAAAAGAGGA 


22080 


GTTTGAGTAG 


AAATCTTGGT 


TTACCTAGAT 


AGCTTATTCC 


CAACAGCTTA 


AGAAGAAAGG 


22140 


AAAAATTAAC 


ACATGATCAA 


CCGTTACTCT 


CGCCCTGAGA 


TGGCGAATAT 


TTGGAGTGAA 


22200 


GAAAATAAAT 


ACCGTGCTTG 


GCTTGAGGTG 


GAAATCCTCT 


CTGACGAGGC 


ATGGGCTGAG 


22260 


TTGGGGGAAA 


TCCCTAAGGA 


AGATGTGGCT 


TTGATTCGCA 


AGAAGGCGGA 


CTTTGACATC 


22320 


GACCGTATTT 


TGGAAATTGA GCAGGAGACG 


CGCCACGATG 


TGGTGGCTTT 


CACGCGTGCG 


22380 


GTTTCTGAGA 


CTCTTGGTGA 


AGAGCGCAAG 


TGGGTTCACT 


ATGGGTTAAC 


TTCTACTGAC 


22440 


GTGGTGGATA 


CTGCTTATGG 


TTACCTCTAC 


AAGCAGGCCA 


ACGACATCAT 


CCGTCGTGAC 


22500 


CTTGAAAACT 


TCACTAATAT 


CATCGCTGAC 


AAGGCCAAGG 


AGCACAAGTT 


CACCATCATG 


22560 


ATGGGGCGTA 


CTCATGGTGT 


GCACGCTGAG 


CCGACAACCT 


TTGGTCTTAA 


ATTAGCAACT 


22620 


TGGTACAGCG 


AAATGAAACG 


CAATATCGAG 


CGCTTCGAGC 


ATGCGGCTGC 


TGGTGTAGAA 


22680 


GCTGGTAAGA TTTCTGGTGC GGTTGGGAAC 


TTTGCCAATA 


TCCCACCATT 


TGTAGAGGAG 


22740 


TATGTCTGCG 


ATAAACTTGG 


CATCCGTGCC 


CAAGAAATCT 


CTACACAAGT 


CCTTCCTCGT 


22800 


GACCTTCACG 


CTGAGTACTT 


TGCGGTTCTT 


GCCAGCATTG 


CGACTTCAAT 


CGAACGTATG 


22860 


GCGACTGAGA 


TTCGTGGTCT 


ACAAAAATCT 


GAGCAACGCG 


AAGTAGAAGA 


GTTCTTTGCT 


22920 


AAAGGGCAAA 


AAGGGTCTTC 


AGCAATGCCT 


CACAAACGCA 


ACCCAATCGG 


TTCTGAAAAT 


22980 


ATGACTGGTC 


TGGCGCGTGT 


CATTCGTGGT 


CACATGATTA 


CGGCTTATGA 


AAACGTCGCT 


23040 
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CTCTGGCATG AACGCGATAT TTCTCACTCA TCAGCTGAGC GTATCATCAC ACCAGATACG 23100 

ACCATTTTGA TTGACTACAT GCTCAACCGT TTTGGAAATA TCGTCAAGAA CTTGACAGTC 23160 

TTCCCAGAAA ATATGATCCG AAACATGAAC TCGACTTTTG GTCTTATCTT TAGCCAACGG 23220 

GCTATGTTGA CATTGATTGA AAAAGGCATG ACCCGTGAGC AAGCCTATGA CTTGGTGCAA 23280 

CAAAAACAGC CTACTCTTGG GACAACCAAG TAGACTTTAA ACCACTTCTT GAGGCAGATT 23340 

CAGAAGTAAC ATCACGTCTC ACACAAGAAG AAATCGATGA AATCTTCAAC CCAGTTTATT 23400 

ACACCAAACG AGTGGATGAT ATCTTTGAAC GTCTTGGACT AGGTGATTAA TTAAAAAATA 23460 

AACAGCGAGC TTCAATCTCG CTGTTTATTT TTTATCGAAA AGACTTAGTC TTCTTTTCTT 23520 

TTAGTGAGTC CATAGGCTGC TAGTGTGGAC ATGAGTCCTG CGACTACTAG TCCTGCAGAA 23580 

TCGTGAGTTC CTGTTTCAGG AAGTTTTTTC TCTGTTACCA CAGGAGCTGG ATCTTGAGGA 23640 

AGAACTTTGC TTTCCTCAGC AGGAGCAGTT GATGGAGCTG GTTGGCTTGG GATTTCTAGT 23700 

TTTGGTTTTT CTTCAGCAAT AGCGGCTTGT CCGTTTTCAT CGCCTACATG TGTTACCATA 23760 

GTTCCGACTT CGACTATTTG AGTAACGGCT TCCTGTGCTA CGACACTATT TACAAGTGTT 23820 

TTCACTTCCT TACCATCGGC AGAAGTGCTC ACAGAGTAGA AGTTGCTACG ATGTCCATTG 23880 

ACGCCCTTAG TAATGACTTG TGTTTTTCCT TTGAGTAAGA GTGGATTTTC ACAAGTCACT 23940 

GTGGTAAATG GAATTTCTTC TTCTTGGATA TCCAGTCTAG GTTTTACCTC AGTAGTTGGT 24000 

GCAAGACCAC TTTCATCACC CTTGTGAGTT ACAGGAGCGC CAACTTCAAC CACTTGGTTT 24060 

ATAACTTCTT TGGTTACCTG GCTATCAAGG ACTGTTTCTG TTGTTTTTCC ATTTTCAGTG 24120 

AGTACAGAGA TGTAATGAGT TCGTTCACCT TTGACTCCTG CTGTGATAAT ATTTTCCTGA 24180 

CCGGCTGGGA GGTTAGGATT TTCTTTCTTG ATAACTTCAA ATGGAATTTC TTCAGTTCTT 2424 0 

GTGATGAGTT CTGGTCTGGT TTCAACATTG GCAGCCACTT CATTTTCATC TAGGCTTCCT 24300 

GAATGAGTTA CAGCTGGTTT GAGGCCTTGA AGAGCGGCTT TTAGGTTGGC TACAAGCGTG 24360 

TCAAGCTCAG CTTGTTTATT ACGGTTGAGG TTGTAATTTA GAGCTGTTTT AGCTGCGTCA 24420 

AGGGCCTCAA GACTTTCTTT ACTATATCCT TCTAAGTTTG TAGGAATTTT AGCTAATTCT 24480 

TCGCGGAGAG CATTATAATT AGCACGAAAG TAGTCTTTGT TGTGGTCTGC AAAGGCAGTC 24540 

ATGAGTTCAA AGATTTCCTC TTCCTTGTAT TCAGCGCTTG GTCTATCTGC CCAGATTGAA 24600 

AGCATACTTC CGACTGTTGG AAGATCTACT TCAGGATATT TGGTAGAAGC TAGTTGATTG 24660 

AATGGTGTTT TTCCAGTATT CTCAATAGCT TTCTTGAGGA AACCACCACC ATCTTCTGGT 24720 

TTTTGACCAA GAATGTAGTA CCAGTCACCG TTGGTATTCA AGAATTTATA GCCTTTGCTT 24780 

GCTAGGTATT GAGGTGATGC GAGGTTATAT CCCCACCAGC CTTTAGACCA GTAAGAAATC 24840 
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AAGACATCTT 
GCCATTGGTT 
AATTTGCCAT 
TTGGCGTATT 
TATTTACCGA 
GTTTTTGAAA 
ACCAGCATAG 
GATTTAGCGT 
TCGTCGTAGT 
GTTTTTCCGT 
AGAAGGAGAT 
TGGTTCAGAG 
TTTTCAACCT 
TTGACAGTTT 
AGGTTGCTAT 
GCAGTATAAC 
TCAGCTGCGA 
GATGGTGAAT 
GGGATATCAG 
AGCCCCTGAT 
CAAGCATCAT 
ACATCGTAGC 
CCAAAGCTTG 
TGCGATTTTA 
ACAGGGTAGC 
TCTGTCGCAT 
AAATAAGCAG 
AGATCGACAG 
AATTCTTTCA 



TGTCAAACTG 
GAAGCCCTCT 
AGAGTTGATA 
CGTCAGTACC 
TGAGGGCTTT 
CTTTATCAAA 
CATCCATGTG 
ATTCAATTAG 
AAGCTTTAGT 

TGGCAGTAAT 
GGACATCAGA 
TAAAGTATTT 
CACGTTTAGC 
CTTGAAGTTT 
CGAGAGAGCG 
GGTTCAGGTC 
AGTATTCAGC 
AACGTGCAGA 
CTCCTTCTGT 
CGAGATTGTA 
TGGTATTAAG 
CTCCCCAACC 
TGTCGCTATT 
CAATACGAGC 
CTTCGTTTGG 
CATTGGCATA 
CATACTTGTC 
TACGGGCTGA 
TGGCATTGAG 



AACATCGTCC 
TTCTTTGGCC 
CCACTTGAGG 
AAAGTTGAAA 
TACAAAGTTC 
GTGGGCTTGA 
ACCTGGACTG 
CTCTGTTACT 
TCCTTCGATA 
GGTCATATCA 
ATATCCGAGC 
GCGTCCAGCA 
TTCTTCTTCT 
AGCAATGGCT 
AATAGCTTTT 
TTTTGGTACC 
GTTGGCATTT 
TGGAGTGTCA 
TTTTGGTACA 
CCAGCCTTGG 
GATTTGGTGA 
ACCAGTCCAC 
GTAGTAGATA 
GAGGTCATTG 
ATAGTATTTA 
TTCATCAAGT 
GATAAGGGCT 
TTTCTTCCCA 
AATCGCATCC 



675 
TTGTCTTCAT 
ATAGCTGCGA 
TAGTACCAGC 
ATCTTTGTTT 
ATCGCTTCTT 
GGATTTTTAA 
TTAATAGCTG 
TCTGCCTGTG 
ATAGCTTTTT 
TCGAGTAGAA 
TCACTGGCCT 
TCGATTGAGA 
TTTTGAGCTT 
TGATCAATCG 
TCAGCTTCTT 
TCGTTAAGTG 
GCAAAATGAC 
GCCCAAGCAG 
GAAGTGATTG 
CCATCAGCGT 
CCTTTTTCAG 
ATAGAAACGA 
CCGTCGTTAA 
GCGTAGGCAA 
TCAGCTTGAA 
CCGATGTTGA 
TTTGTAAAAG 
AAATAGCTAA 
ATGTGTCCAG 



AGTAGAAGCC 
GGGTGTTGGC 
CTTGGGCACT 
TACCTGCAAA 
CGTTTTTCAA 
TACCTAATTT 
GGATGAGACC 
TTAGTGCAGT 
TAACGTCATC 
AGCGAAGTCC 
TGTCTACGAT 
TTACCTTGTT 
CAGGCGTGAG 
TATCTTGTTG 
TTACGGCCGT 
CTTGCTCTGC 
GCATGAGTTT 
CTACCATACC 
GTGTGTTTTT 
TTCGTCCAAG 
CTAGTAGTTT 
TGATGTCTTT 
AAGCCATTGG 
TAAATTTTTC 
GCACACTCCA 
AGATTTCAGT 
CGACAGCTTG 
AGTTAGGGTT 
GACTATTTAC 



ATCGTTGAAG 
ATATTCGGCA 
AGTCGCATCG 
GAAGTCCATG 
GTCCATAGTT 
TTCCATGGCA 
GATGTCCTTA 
ACCGTTTGGA 
ACTAGCATAG 
GTCATTTCCT 
GCGTTTGAGC 
TTTGGCAAGT 
GGTCAAGTTG 
GGCACGGCTA 
GACGCTTTCT 
AGATTCATAA 
GAAGAGGCGT 
ACCGATGATT 
AATACCATTG 
AACGTAGTAC 
AGAAGAAGCG 
GTCAAAACTA 
TTTGAGACCG 
ATAGCCTTTT 
ACCTTTAGCA 
CTTTTTCGCG 
TTCGTTGTCA 
TTGGATTCCC 
TGTCGGAATG 



24900 

24960 

25020 

25080 

25140 

25200 

25260 

25320 

25380 

25440 

25500 

25560 

25620 

25680 

25740 

25800 

25860 

25920 

25980 

26040 

26100 

26160 

26220 

26280 

26340 

26400 

26460 

26520 

26580 
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AGACCGATAC CTTTATCTTT GGCATAGTTA ATCAGATCTG TCATTTGACT TTCTGTTAAG 26640 

TGATTGCCGT TTGGATCGTT GTAATAATCA TTTGTACCTT TTTCAATGGC GCGTTTGACA 26700 

TCGTCACTGG CATAGGTCTT GCCGTTAGCT GTGATGCTCA TATCGTCCAA CATGAAACGG 26760 

AGTCCATCAT TTCCGACTAA TAGGTGTAAA TCAGTGTAGC CATAATGTTT CGCTTTATCG 26820 

ATGATTTCCT TGAGCTGTTC TGGTGAGAAA TATTTACGTC CAGCATCAAT AGAAACAATT 26880 

TTCTTTTTCG CTAGTTTTTC ATTTACAGTT GCAGCACGTT CCTTTCCTGC CTCTGTTGCC 26940 

GGTTTGTCAG CCTCTGCTTT CGCTTCATCT TTTTTAGCTG GTTTATCCTT GTCAGTCTTG 27000 

TCTGTATTTG ACTCTTTAGA ATCAACCTCT TTCGCTTCTT CCTTTTTAGG GCTAGCTTCT 27060 

TCTGCCTTTT TATTAGCAGT TTCTTTTTCA GCAGAAGTTG GAGTTACCAC TTCTGCTTTA 27120 

TCACTAGGAG TTGAACTAAC TTCCTCTTGT GGTTTTTCTT CTGTTTTTGG AAGACTAGCT 27180 

ACCTTATCAG TAGCTGGAGT TTCTGTTTCT ACAGTTTTTG GAGCTTCTGG TTGAAGCACT 27240 

GCTTTAGGTG TTTCCTCAGT CCGATTTTCG GATGATTGAG GGGAATCAGA AACCGTATGG 27300 

ATGGTCGGTT GGTTTTCTGT AGTAGTAGGA GTAACTCCAT CGGCTGCAAC AGTCTGTGCT 27360 

TGGAAGGCAA ATCCAATTAG AACAGAAGCT GCTCCTACAG CGTATTTACG AATAGAAAAA 27420 

CGCTGTTGTT TTTCATGTTT CATTGCAAAA CCTCCTGATT GCATTGTTAT ATTGATAGCG 27480 

ATTATATAAA TCAACGCCTT TATTTTATTT CTTATATTAA TTTCTTATAT TAACGAGAGT 27540 

CAAGAGGAGA TGACAAAAAA CTATAATAAG TATAAAAAAA TATAAAATTT AAACTTAAGA 27600 

TTTCAGATTG GTCGGAAAAA ATACGTATAT ATATCTAGTA TAATTTTTGG TTCTATTTCT 27660 

ATAAAATATT CCACAAATTA TAGAATTTTC CAAAAATAGG TAAGCGCTAC CTTTTTGGTG 27720 

TAGTATAATA AGCATAGAAA AAGCCCAAGC GATTAGCTCA GGTTTTCTTC TTAGTGATCA 27780 

CGGTCACATG AGATAAATTT AATCTTGTAG TAATCAGATC GTTTGTAAGT TTCACTGTAT 27840 

TCTAAAACTT GGCCAGTTGA TTCGAGTTTG GTGATTTTAG TTTGTAGGAC AGTAGGGAAT 27900 

TGTTCATCGA CTCCGAGGAC TGAAGCTGCA TGTTCTGGAG TTGGAAAGAC TATTTCGTTG 27960 

ATTTCTTCAA AGTGTTCATC ATTCATGTGA ATGTGGTAGT CTAACTTGAA ACGATTATAG 28020 

ATAGAACTAT AGTATTCAAG GTTTGGATAA TTTGCGTTGA TATATTGTTC TGGGATGTAG 28080 

GATGTATGGT AGATATAAAC GACACCGTTT GATTCGCGGA TACGTTCAAT CTTGTAGTAG 28140 

AATTGATCGC CGCGTAGACC CAATTTTTCC AAGTAAACAA GCTTGTTTCC GCGTTCAATT 28200 

GAAAGAACAG TTACCTTATC ATCTTTAGCA TTGAAGAGTT CAATATCTGA AAACTCTACA 28260 

AGCTTGTGTT TGCGTGCACG TGAAACGAAG GTTCCTTTTC CTTGTTGGCG GACAATATAG 28320 

CCATCTTTGG CAAGGTCGTT TAAGGCGCGA ACAACTGTGA TAGAGCTGAC ATCGTACATT 28380 
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GAAATGAGTT CTGCTTCAGT GTAAAATTTA TCTCCACTGC TAAACTGCCC AGAGATGATT 28440 
TTATTTTTTA ATTCGTCTTT TATGTATTGA TGG 28473 
(2) INFORMATION FOR SEQ ID NO: 84: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6749 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS : double 
(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 84: 

CCTGATGGGT GGTATGCGAG GATACAGTTC TGAAAATCGC CGTTACTTAA TTAATGGACG 60 

CGAAGTCACA CCTGAGGAAT TTGCTCACTA TCGTGCGACT GGTCAATTAC CAGGAAATGC 120 

AGAAACTGAT GTGCAAATGC CACAACAGGC ATCAGGTATG AAACAAGGCG GTGTCCTTGC 180 

AAAACTAGGT CGAAACTTAA CAGCAGAAGC GCGTGAGGGC AAGTTGGATC CTGTTATCGG 240 

ACGAAACAAG GAAATTCAAG AAACATCTGA AATCCTCTCA CGCCGCACCA AGAACAATCC 300 

TGTTTTGGTC GGAGATGCAG GTGTTGGTAA GACAGCAGTT GTCGAAGGTC TAGCGCAAGC 360 

CATTGTGAAC GGAGATGTTC CTGCTGCTAT CAAGAACAAG GAAATTATTT CTATTGATAT 420 

CTCAGGTCTT GAGGCTGGTA CTCAATACCG TGGTAGCTTT GAAGAAAATG TCCAAAACTT 480 

AGTCAATGAA GTGAAAGAAG CAGGGAATAT TATCCTCTTC TTTGATGAAA TTCACCAAAT 540 

TCTTGGTGCT GGTAGCACTG GTGGAGACAG TGGTTCTAAA GGACTTGCGG ATATTCTCAA 600 

GCCAGCTCTC TCTCGTGGAG AATTGACAGT GATTGGGGCA ACAACTCAAG ACGAATACCG 660 

TAACACCATC TTGAAGAATG CTGCTCTTGC TCGTCGTTTC AACGAAGTGA AGGTCAATGC 720 

TCCTTCGGCA GAGAATACTT TTAAAATTCT TCAAGGAATT CGTGACCTCT ATCAACAACA 780 

CCACAATGTC ATCTTGCCAG ACGAAGTCTT GAAAGCAGCG GTGGATTATT CTGTTCAATA 840 

CATTCCTCAA CGTAGCTTGC CAGATAAGGC TATTGACCTT GTCGATGTAA CGGCTGCTCA 900 

CTTGGCGGCT CAACATCCAG TAACAGATGT GCATGCTGTT GAACGAGAAA TCGAAACGGA 960 

AAAAGACAAG CAAGAAAAAG CAGTTGAAGC AGAAGATTTT GAAGCAGCTC TAAACTATAA 1020 

AACACGCATT GCAGAATTGG AAAGGAAAAT CGAAAACCAC ACAGAAGATA TGAAAGTGAC 1080 

TGCAAGTGTC AACGATGTGG CTGAATCTGT GGAACGAATG ACAGGTATCC CAGTATCGCA 1140 

AATGGAAGCT TCAGATATCG AACGTTTGAA AGATATGGCT CATCGCTTGC AAGACAAGGT 1200 

GATTGGTCAA GATAAGGCCG TAGAAGTTGT AGCTCGTGCT ATCCGTCGTA ACCGTGCTGG 1260 
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TTTTGATGAA 


GGAAATCGCC 


CAATCGGCAA 


CTTCCTCTTT 


GTAGGGTCTA 


CTGGGGTTGG 


1320 


111 ft 7V /"^ TV n 

TAAGACGGAG 


CTTGCTAAGC 


AATTGGCACT 


CGATATGTTT 


GGAACCCAGG 


ATGCGATTAT 


1380 


CCGTTTAGAT 


ATGTCTGAAT 


ACAGTGACCG 


CACAGCTGTT 


TCTAAGCTAA 


TTGGTACAAC 


1440 


AGCAGGCTAT 


GTGGGTTATG 


ATGACAATAG 


CAATACCTTA 


ACAGAACGTG 


TTCGTCGCAA 


1500 


TCCATACTCT 


ATCATTCTCT 


TGGATGAAAT 


TGAAAAGGCT 


GACCCTCAAG 


TTATTACCCT 


1560 


TCTCCTCCAA 


GTTCTAGATG 


ATGGTCGTTT 


GACAGATGGT 


CAAGGAAATA 


CAGTAAACTT 


1620 


V.AAUAAL ACT 


GTCATTATTG 


CGACCTCAAA 


TGCTGGATTT 


GGCTATGAAG 


CCAACTTGAC 


1680 


AGAAGATGCG 


GATAAACCAG 


AATTGATGGA 


CCGTTTGAAA 


CCCTTCTTCC 


GTCCAGAATT 


1740 


CCTCAACCGC 


TTTAATGCAG 


TCATCGAGTT 


CTCACACTTG 


ACTAAGGAAG 


ACCTTTCtAA 


1800 


GATTGTAGAT 


TTGATGTTGG 


CTGAAGTTAA CCAAACCTTG 


GCTAAGAAAG 


ACATtGACTT 


1860 


GGTAGTCAGT 


CAAGCGGCTA 


AAGATTATAT 


CACAGAAGAA 


GGTTACGACG 


AAGTCATGGG 


1920 


GGTTCGTCCT 


CTCCGTCGCG 


TGGTTGAACA 


AGAAATTCGT 


GATAAGGTGA 


CAGACTTCCA 


1980 


CTTGGATCAT 


TTAGATGCTA 


AACATCTGGA 


AGCAGATATG 


GAAGATGGCG 


TTTTGGTTAT 


2040 


TCGTGAGAAA 


GTCTAAGACA 


GAATTTTGAG 


GATAAAAAAG 


AAGGAGCCAG 


CTGAAAAAAA 


2100 


CTGGTTCCTT 


TTTAGGTACG 


ACAGGCATGT 


CGTATAGTAG 


AAGTGTATTA 


TTCTAGTTTC 


2160 


AATATACTAT 


AGTAGCTCAG 


AAGTCGGTAC 


TTAAACGTGC 


TATATCAAAA 


CCAGTCCTGG 


2220 


AAAAACGTGG 


ACTGGTTTCG 


TGTTTGGATT 


ATTACCTTGA 


ACGACATGCG 


TTAAAAGTTA 


2280 


GTTGAACCGC 


CGTATGCCGA ATGGTACGTA CGGTGGTGTG 


AGAGGGGCTA 


GAGATTATCC 


2340 


CCTACTCGAT TTTAAATCAC ATGACGTTCA AAGGCATCAT CTGAAATCCC 


TTGTTCCAAG 


2400 


ATGAGTTTTG 


CCCATTCTTT 


AGCAGAGAAG 


AGGCTGTGGT 


CCTTGTAGTT 


TCCGCAAGAT 


2460 


TCGATGGTTG 


TCCCTGGGAC 


ATCTTCCCAA 


GTAGTAGTTT 


CAGCGATTTC 


CTTGAGCGAA 


2520 


TCCTTGATAA 


CAGCTGCGAT 


TTTAGCACTG 


GTGTGACGTC 


CCCACATAAT 


CATGTGGAAG 


2580 


CCTGTGCGGC 


AACCAAATGG 


TGAACAGTCA 


ATCATGCCGT 


CAATGCGGGT 


ACGGATGAGT 


2640 


TTGGCTAAGA 


GGTGCTCGAT 


AGTGTGAAGG 


CCGGCAGTAG 


GGATAGAGTC 


TTCGTTTGGT 


2700 


TGCACCAAGC 


GAATATCATA 


ATTGGAGATG 


ATGTCTCCTT 


TTGGTCCTGT 


TTCTTCCCCA 


2760 


ATCAAGCGAA 


CATAGGGTGC 


TTTGACAATG 


GTGTGGTCAA 


GTTCAAAACT 


TTCGACAATA 


2820 


ACTTCTTTTG ACATGGTAAA 


TCCTTTCAGT 


TTTCTTCTCT 


CATTATATCA 


TAAAGGTTGC 


2880 


TCCTGAGACA 


GAGAGAAAAC 


CTCTCCGAGG 


CTGGAGAGGT 


TGAAATCTTT 


ACTTACGATA 


2940 


TAAGCGGTCG 


TATTGGTAGT 


ATGGGTCAAA GGTTACGTTG 


ATACCCAGTT 


TACGAAGGAC 


3000 


ATTCTTGTCT 


TCATCAGTCA 


AGATGATGGT 


TGAGTGGGCT 


TCGCTTCCTT 


TGAGGTTGCC 


3060 



WO 98/18931 



PCTYUS97/19588 



679 

GAGTTCTTCC ATAGCGCGGG CAGCATCAGG ATTTTCTGTA GCTGTGATAG CAAGTGCAAT 3120 

CAGGATTTCA TTTGAATGAA GGCGTGGATT GCGGCTACCG AGATGATCGA TTTTAAGACC 3180 

TTGGATTGGC TTAACAACTT CAGGCTCGAT TAGTTTTACT TCTTTAGCGA TGTCAGCTGA 3240 

TTTTTTGATG GCGTTGATCA AGGCAGCGGC TGTAGGACCA AAGAGTTCTG AGTTCTTACC 3300 

AGTGATGATT TCCCCATTTG GCAATTCAAA GGCTAGGGCT GGTCCACCAG TTTCTTCTGC 3360 

"TTTTTGGCGC GCAACGACAG CAACCTTACG GTCTGCAGGT GTGATACCGA GGTCGTTCAT 3420 

GAGCAACTCA ATTTTCTTGA CGGCAGCTTC GCCAACTTTT TCAGCTTTGA AGTCAAGAAC 3480 

TGTTTGATAG TAACGGCGGA TGATTTCTTG TTTAGAAGCT TCGACAGCGG CCTCGTCATC 3540 

TGTAATAGCG AAACCAACCA TGTTGACACC CATATCTGTC GGTGAAGCGT ATGGTGATTT 3600 

TCCGAGAATA CGTTCCAACA TGCGTTTGAG CACTGGGAAG ATTTCGATAT CACGGTTGTA 3660 

GTTGACAGTG GTTTCTCCAT AGGTTTGAAG ATGGAAGGGG TCAATCATGT TGACATCATC 3720 

AAGGTCAGCT GTGGCAGCTT CATAAGCCAA GTTAACTGGA TGATGAAGGG GAAGATTCCA 3780 

AACAGGGAAG GTTTCAAATT TAGCGTAGCC AGATTTGATG CCATTGATTT GGTCGTGGTA 3840 

CATATTGGAC ATACACGTTG CCAATTTTCC AGAACCAGGT CCAGGAGCGG TTACGACAAT 3900 

CAAGTTGCGA CTGGTTTTGA TGTAGTCGTT TTTGCCCATG CCTTCTGGGG AAATGATGTG 3960 

ATCCATATCC GTCGGATATC CTTTGATTGG ATAATGAAGA TAAGAATCAA TTCCGTTTTT 4020 

CTCAAGTTGA TTGCGGAAGG CATCTGCAGC GGGTTGGCCA GCGTATTGTG TAATGACAAC 4080 

GGAACCAACA AAAATCCCTA ATTCATTGAA TTTATCAATC AAACGAAGAA CTTCTTGGTC 4140 

ATAAGAAATG CCTAAGTCGC CACGTGCTTT GGAATGTTCA ATGTTGCTAG CATTAATGGC 4200 

AATCACAACC TCAACCTGCT CTTTCAATTC TTGCAAGAGC TTGATTTTGT TGTCAGGTTC 4260 

ATAACCAGGA AGGACACGAG CAGCGTGGAA ATCTTCTAAC ATTTTACCGC CAAACTCTAA 4320 

GTAGAGCTTG CCGTCAAATT GGTTAATGCG CTCCAAAATA TGGTCGCGTT GTAAATTCAA 4380 

ATATTGTTCA GAACTAAAAG CTTGTTTTTT CATTTTTTTA CCTCTGGACT CTATTATAAT 4440 

AAAAAATTGG AAGTTAGGAA ACTACGGAGC TAAAAAAGAA ATTAAAAAGA TTAAGCAAAC 4500 

GCTTGCACAA AATTTTAAAA AGTGCTATCA TAGACTATAG ATTATGAAAA TAATGAGGTA 4560 

AACAGATGCA AGAAAAATGG TGGCACAATG CCGTAGTCTA TCAAGTCTAT CCAAAGAGTT 4620 

TTATGGATAG TAATGGAGAT GGAGTTGGTG ATTTGCCAGG TATTACCAGT AAGTTGGACT 4680 

ATCTAGCTAA GCTAGGAATC ACAGCAATTT GGCTTTCTCC CGTTTATGAC AGCCCTATGG 4740 

ATGATAATGG CTATGATATT GCTGATTATC AAGCGATTGC GGCTATTTTT GGAACCATGG 4800 
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AGGACATGGA 


TCAGCTGATT 


GCAGAAGCTA 


AGAAGCGTGA 


CATTCGTATC 


ATCATGGACT 


4860 


TGGTGGTCAA 


TCATACCTCA 


GATGAACATG 


CTTGGTTTGT 


CGAAGCCTGT 


GAAAATACTG 


4920 


ACAGCCCTGA 


GCGAGACTAC 


TATATCTGGC GCGATGAACC 


CAATGACCTA 


GATTCTATCT 


4980 


TTAGTGGGTC 


TGCTTGGGAA 


TACGATGAAA 


AGTCAGGTCA 


ATACTATCTC 


CACTTTTTCA 


5040 


GCAAGAAACA 


GCCGGATCTC 


AACTGGGAAA 


ATGAAAAACT 


TCGCCAGAAA 


ATTTATGAGA 


5100 


TGATGAACTT 


CTGGATTGAT 


AAAGGTATTG 


GTGGTTTCCG 


TATGGATGTT 


ATTGACATGA 


5160 


TTGGCAAAAT 


TCCTGACGAG 


AAGGTAGTCA 


ATAATGGTCC 


TATGCTCCAT 


CCCTATCTCA 


5220 


AGGAAATGAA 


TCAGGCGACC 


TTTGGAGATA 


AGGATCTCTT 


GACAGTAGGG 


GAGACTTGGG 


5280 


GAGCAACTCC 


AGAGATTGCC AAGTTCTACT CTGATCCAAA GGGGCAAGAA TTGTCTATGG 


5340 


TCTTCCAGTT 


TGAACATATC 


GGTCTTCAGT 


ATCAGGAAGG 


TCAGCCTAAA 


TGGCACTATC 


5400 


AAAAAGAGCT 


GAATATCGCT 


AAGTTAAAAG 


AAATCTTCAA 


CAAATGGCAG 


ACAGAGTTAG 


5460 


GAGTTGAGGA 


CGGCTGGAAT 


TCCCTCTTCT 


GGAACAACCA 


TGACCTCCCT 


CGTATTGTCT 


5520 


CAATCTGGGG 


AAATGACCAA 


GAATACCGCG 


AAAAATCTGC 


CAAAGCCTTT 


GCAATCTTAC 


5580 


TTCATCTCAT 


GAGAGGAACT 


CCTTATATCT 


ACCAAGGTGA 


GGAGATTGGG 


ATGACCAACT 


5640 


ATCCGTTTGA 


AACACTGGAT 


CAAGTAGAAG 


ATATTGAATC 


TCTCAACTAT 


GCGCGTGAGG 


5700 


CTCTTGAAAA 


AGGTGTTCCG 


ATTGAAGAAA 


TCATGGACAG 


TATCCGTGTT 


ATTGGACGTG 


5760 


ACAATGCCCG 


TACCCCTATG 


CAATGGGACG 


AGAGCAAAAA 


CGCTGGTTTC 


TCAACAGGTC 


5820 


AACCTTGGTT 


GGCGGTTAAT 


CCAAATTACG 


AGATGATCAA 


TGTCCAAGAA 


GCGCTGGCAA 


5880 


ATCCAGATTC 


TATTTTCTAT 


ACCTATCAGA 


AACTGGTCCA 


AATTCGCAAG 


GAGAATAGCT 


5940 


GGCTAGTTCG 


AGCTGACTTT 


GAATTGCTTG 


ATACGGCTGA 


TAAGGTCTTT 


GCTTATATAC 


6000 


GTAAGGATGG 


CGACCGTCGC 


TTCCTAGTTG 


TGGCTAACTT 


GTCCAATGAA 


GAGCAAGACT 


6060 


TGACAGTAGA AGGAAAAGTC 


AAATCTGTCT 


TGATTGAAAA 


CACTGCGGCT 


AAAGAAGTAC 


6120 


TTGAAAAACA GGTCTTGGCT 


CCATGGGATG 


CTTTCTGTGT 


GGAATTACTA 


TAAATATTTT 


6180 


TTGCAGAAAA 


ATTTAAAATT 


GAAATCGTAT 


AAAAACAAGG 


GAGGACTGTA 


TAAAAGACAG 


6240 


AAATCCTTTG 


TTTTTTATAA 


CCAAAGTTTA 


TAAACTTTCA 


TTCTTGAAAT 


TCAATTAACT 


6300 


TTACAAATTC 


CCACTATTAA 


GGAGAAAGAA 


GATGAACATA 


AAGAAGCGTG 


TCCTTAGTGC 


6360 


AGGCCTGACT 


TTTGCATCTG 


CTTTGCTTTT 


ACCCAAATCA 


TTCATACCTC 


TCTCAACTAG 


6420 


ATGTAACTTA 


CAAAACCCCT 


GACCTCATGA 


GCCACTTTCT 


TCCTCCTCAT 


GAGGTCAGTT 


6480 


TTACTTTCTG 


CTGTTCCAGT 


ATCGTTTTTC 


CTCGCTAGAT 


TTCCTCAAAA 


GGGCAGACTC 


6540 


CTCCCTTGGT 


GCGTCACACG 


ATTTTTTCAT 


CTCGACTGTT 


CTTTAATGCA 


TCATTAACGA 


6600 
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CGCTTTTCTT CTAGGTGGTT CATAAGGAAC AGGAAGATTC AGGTTGACTT TTCTAATCCT 6660 

AGAATAAAGT GCTGAAAACA ATTCGGAATA GGCATAGAGA CTAGACAATT TGAGGAGCTG 6720 

CTTGCGTCCT GTTCGAACAC ATTTTCCGG 6749 
(2) INFORMATION FOR SEQ ID NO: 85: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1842 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 85: 

TCTACCCATG GACTTTGAGG CATTCATTGT TCCATCTTCT AGTGGCGAAT CTTTTGATAC 60 

AAACGATTCA ATTCACTTGG ATAGTGAAAC TCTCCCGCAA ACATTTTTCT GGTTAACTCA 120 

ATCCAGCTGA TATTTCTTTC AGCCAAAATA ATGGACAAGT TCTCCCAAAA TCGTTCAGCC 180 

ATATTGCTTC TCCTTTAGTT AGATAAATAA TGTGTTTGCG CCATGTAAAT CAATTGTTTC 240 

GTATCTCTTG GCAATAGAGC TCTAGCCTCT TCCAAATTCA GACTTGGATA AACTCGCTTA 300 

TTTGAAACCG CAAGAGGAAG TCTGATGGTT AGTTCAGGAT TTTTTAAAAT TATCTCAACG 360 

AAATCCGTTA ATCTTAGATT GTCACGGTTC TTAAATCGTA ATAAATTGGG AGATAAAAAC 420 

TCAAAACAAT CTGAAGAATA GCTCATCATC TCAATTAATT TGTCCTTTGT CATTTCAGAA 480 

ACTGAATGAC AAGATACCTC TATGCCATAG TTTTGGAAGA AATCTAAAAG AAGTTGATTT 540 

CTTTGTCTAT TTTTACTTAG ATAGAGATCA ATCATGGGAG ACCTCCCAAA GATTCGGTTC 600 

CATTTGATAT TCTGACACGA TTAAGGAATC TAATAAATTA AGGAATCTAA TAAATTTGCG 660 

AAGTTAATCG GTTTCTTGTC TTCATCATAA GCTTTTACAG TTACTTGGGT TGTAAGTATT 720 

CCCTCTTTTC CCTCGGCTCG ATAGCCTTGT CCATATAAAA CAAAAACGAG ATTTTGATGA 780 

TCATCTACAA AGGCATCAAC CCCATTCTTT ATGTCTTGAC TTTCAAGGAA TTCCATAACG 840 

TTTTGAAGAT AGGATTCGTA AAATAGTGGG TAGTTATGTT TTTTATGGTA ATCATCTAAA 900 

AATGTCACTT CAAACTCACA TGGAGAGTAA TTTTGACTTT GAACAGCCTA AAAGTGCCAT 960 

CAAATTTGAA TTGGAATAAA TCAAATAAAT AGCCCCATCC TCATCAATCC AACCTTTGCT 1020 

CAAAGACAAC TCCAACCGAT CTTTTAAAAC TGAGTAAACC ACCTTAACCT CCAGTTTCAT 1080 

ATTCTTATAC CGTTCACTCT CAAATAAAAG TTTGGGGAGC TTATAATAAC GCTCTGATGT 1140 

CTGATATTGA TTAGCGGTAA TACGCTTCAT TATTGTCCCT CCAAGACTAA AATTCCAACA 1200 
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TTTCCAAATT CATCAAATCG GATTAAACCT ACTTGTTCCA TTTCATCAAC TAACTGAGTT 1260 

GCTTTTACCC AAATCATTCA TACCTCTCTC AACTAGATGT AACTTACAAA ACCCCTGACC 1320 

TCATGAGCCA CTTTCTTCCT CCTCATGAGG TCAGTTTTAC TTTCTGCTGT TCCAGTATCG 1380 

TTTTTCCTCG CTAGATTTCC TCAAAAGGGC AGACTCCTCC CTTGGTGCGT CACACGATTT 1440 

TTTCATCTCG ACTGTTCTTT AATGCATCAT TAACGACGCT TTTCTTCTAG GTGGTTCATA 1500 

AGGAACAGGA AGATTCAGGT TGACTTTTCT AATCCTAGAA TAAAGTGCTG AAAACAATTC 1560 

GGAATAGGCA TAGAGACTAG ACAATTTGAG GAGCTGCTTG CGTCCTGTTC GAACACATTT 1620 

TCCCACCACG TGAAGAAAAA GATGGCGGAA GCGTTTGATT GTTAAAGTTT GGAAGTCACC 1680 

TCCAGCTAGA TGTTTGAGAA AAAGATAGAG ATTGTAGGCG ATACAGCTCA TCATCATACG 1740 

AACTTCGTTT TTGATTAAGG TTGAACTATC CGTTTTATCG CCAAAAAATC CCTCCTTCAT 1800 

CTCCTTGATG AAATTCTCGG CTTGACCACG TCCACGATAA AG i 8 42 
(2) INFORMATION FOR SEQ ID NO: 86: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19390 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 86: 
TCATCTTTAT CTCCTCGAAA TTTTCTAATA TAG CC ATT AT AACAGAATTT TGTGAAAATT 60 

CCTATTATAG TAAATCACTA TTTCAGTATA AAAAGAAAAA ACGAATCAGA CGATTCGCTC 120 

TTCTTAAAAT CTGAAAATAG CTTTCCAGAA AGGATTAGCC GATTTTTTGC AGATTGAGCA 180 

CTGCATCGTG ACTCATCAAG ACTTGACCAT ACTCTTGTAA GACTGAGCGA CTGATATCAC 240 

TATCGTCTGC AAACTCGCGC ATACGGGCCA ACAGCCAAGC TGGATATGGG CTTGGATGAT 300 

TTTCAATATC CACTAAAATG GTCAAATAAT AGCGCTCGTT CATTTTGTAG AGTTCAGAAG 360 

TTTCCATTTC AAAAGTCACT GTCTTGGCAA AAGCTACCAA GTCAGCCAAC TTAGCAAAAG 420 

AAAGGATGTA GTAGATGTAA GGTTCTTTCT TACTCTCAGC TTCTTGTTCA GCCTGCTCTT 480 

GCTCTTCTTC CTTGACTTCA ACTTGCTCAA GAGATTGAAT GGCTTCGATA TCATCCTTGG 540 

TTTTGTCTGC GATGCTTTTT TCCAGGGTTT TGATAAATTC ATCTGGAGAC ATTTGAGCCA 600 

ATTCTTCCAT ATCTGGCAAA TCCGATAAGT CTTCAAAATC TAGATTTTGG TCAATCTTTG 660 

ACTTGGTCAC AAAGACATCT ACCTTATCAG GTTTTGGAGT CACACGGAAG CTCAACATGC 720 

CTGTATCCAG AAAGCTATCA GGCATCTCTA GCTCATCCAA GATAGCATAA AAGAACTCTT 780 
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CTGTTTTTTC 


TTGAGGAACG 


AGAAAGTCAG 


CAATCTCCAT 


TCCACGATCC 


ATCAAATCCT 


840 


CTAAAGATAT 


CGTGATTTTT 


AAAGTTGTAT 


CACTAATTTG 


TTTCATTTTC 


ATTGCTAGTA 


900 


ACCTCATACT 


TTCAGTTCTA 


TCTATTATAC 


TAGATTTTTA 


CGATTTTATC 


AAAAGAAGGC 


960 


TCCTCTATAC 


GGATAGATTT 


TCCCTAGGGT 


CTTTCTATAG 


GAGACTCCAA 


AAGAAAATTT 


1020 


CTGCAGACAG 


ATAGAAAAAG 


CCTTCAAAAT 


CGGCTAAGAG 


CCGACTTTGA 


AGACCTTATA 


1080 


■ CATCAGAATA 


CTTATAATTT 


AAAGGTTGCT 


ACACCGAGGA 


TAGAACGATT 


TAAGTTTCTG 


1140 


AGAATTTGAA 


GACTTTGCTC 


AAATTTCTTA 


TAACGAGTCA 


CTCCGTACTC 


TTCAACAAGA 


1200 


AGGACTGTAT 


CTCTTTCCAA 


AAGAGATGAT 


ACATCCTGTA 


AATCTACAAA 


ATGCATTCCT 


1260 


TTTAAAGCTT 


CTTGACTCTG 


TTTCAATTTA TCTAAGATAG 


CTTTATTTGA 


GCTAACGATG 


1320 


GTCAATTCCT 


GTCCAGTATT 


TTTGTATGAC 


AAAACATCTG 


CTAGGTTAGC 


AATTGTTGTA 


1380 


ATCTCTGTTA 


CAAAATCAAT 


TTGATACTGA 


GAAAAATCAC 


CTACTCTATT 


GATTGTTGGA 


1440 


TTAAAGAGAT 


AAACTAACAC 


ATTTCCCATC 


ACAACCAAAA 


TCACACAAAC 


CACTCCAATA 


1500 


ACAACTAAAC 


GAAGAATCAG 


ATTTTTCACA 


TTTAAGCCAA 


GCGCTGTTTC 


ACCATTTGCG 


1560 


TTCAATTCTT 


TAGAGTTGAT 


GGTTTCCAGT 


TTTTCAATTT 


TCACATTTGC 


ATAGGCATGT 


1620 


TTAAATTTCT 


CAATCAACCC 


ATCAATTTTT 


TTCTCTAACA 


AGTTATTGGC 


ATCTTTACTT 


1680 


GATGTCAAAA 


TTTTCACACC 


AACCCCTGCA 


TCGTCAATCA 


TATAGTAGAC 


GGTCAATTTT 


1740 


TTCCACCAAT 


AGTCATTCGT 


TGAATTTTTC 


AAGGTTGTTT 


CTGTCGTGTC 


TAATTCACTG 


1800 


GCAATTTTTT 


TCAACTCACT 


GGGTTCTACA 


TCATTGAAAA 


GATAAGCTCC 


ATTCAAATTA 


1860 


CCATCAATCA 


ATTTCCCATA 


AAAATCACTA 


TAACCACCAA 


TTTGATGATT 


CAAAATCGTT 


1920 


TTGTCCGACT 


CTTTTGGAGG 


AGTGATTTTA 


TAGATAAGAT 


AAGTTGAATA 


ACTTGTTGTA 


1980 


TCTTTGACAG 


TGTTTTTATT 


CCTAACTGCT 


TTAATTGTAA 


ATGGTACAGC 


AATGAGAGCA 


2040 


AATAAAGCGA 


TGAGAGCTAA 


AATATTTGCT 


TTTCGCTTTT 


TATAAAGATT 


TGCAAACAAA 


2100 


TCAGCTACTG 


AATAATGTTC 


AAACATGATT 


TTTTTCTCCT 


TTGTTTAGTA 


GATAGTAGTT 


2160 


TTCCTTTGTA 


AGCATTTTTG 


CTACAAATAT 


AATCACAAGA 


ACAATTCCCC 


AGAATTGCAT 


2220 


TGTAAATAAA 


TTGAAGAAAC 


TTTCTGAAAA 


GCTGCTTCTT 


GGCATAAAGA 


ATAGATTATT 


2280 


CAAGATGAGT 


AGGGATAAAG 


CAAATAGGAT 


TGTCCTTGAG 


CGATAGGCTA 


CTTGCAGCAT 


2340 


GGCTATAAAT 


AATACGCCGA 


GTAAGAAACT 


AAGCAGAAAG 


ACTCCAATCA 


TACCATAGTC 


2400 


GGTATACAAC 


TCCATGATAT 


AACTACTTCC 


GATACCATGC 


CCTTTCAAGT 


ATTCCTTGTT 


2460 


CAAGACAAGA 


TAGGATAGAT 


TGTGGGCATA 


ACTATTACTA 


TCAATAGCTA 


GTTCCACACT 


2520 
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ATTGGTTGTA 


TGTTCAAAGG 


CTTTTCCTCC 


GAAAATGGCT 


CCCAAACTCC 


CCCTTGCAAA 


2580 


ATAATCAAGA 


ACAGGACCAA 


AAGTAAAATT 


ACGGAAATCT 


CGGTAAGGGA 


GGCTACTGTT 


2640 


AAATAGAAAA 


CCTCGAGCCA 


GAACACCAAA 


ACTAGTCCCT 


TGTTTATAGA 


TAAAGTCAAG 


2700 


TAAGATATCC 


CAGAAACCTG 


TATGGGAAAC 


TTGGACATTA 


TCCCGTACAT 


AATTGAGTAC 


2760 


TCCCATCGCT 


AACATGAGAA 


TAGGAGAACC 


TACAAAAATC 


GCTAACTTTT 


CTTTAAACCC 


2820 


AATCCATTTT 


CCTTTTTCAG 


TTTGCTCCCG 


CATAAAGTAA 


TAAACAAAAG 


CAAATmAAAT 


2880 


ACTTAAAATA 


AAGGGATTTC 


GTGTCCCAAT 


TGCCAAATGA 


ATAGTATTAG 


CTGCAATAAA 


2940 


GGAGACAAGC 


ACTGCTGTGG 


CCTGCAATTT 


CTTTGGCTTG 


GTTGCCAGAT 


ACATACACAT 


3000 


TGCATAGACC 


GTAAAGGTAG 


ACAAAATGTA 


GGTAAAATAA 


GGCAGTTTAC 


TTTCAAAATT 


3060 


TGCATAGTAG 


GCATAGTAGG 


AAGTCTGCAA 


ACGATACAAG 


AGCCGTTCAA 


ATAACCGAAT 


3120 


GAAATAGAAA 


GGATAAGTTA 


GAAGAAAAAC 


TCCTAGTGAT 


ACAAAGCGTA 


ACCGCTTGAT 


3180 


ATAAACCTCT 


TTTAGAGAAT 


TTCCTATATT 


TGCTACTTTT 


ATTTTCTTCC 


TAGCTATGAA 


3240 


GTAACGAGCC 


AGAATGCCTC 


CTGTGGTCAA 


GCCCAGAATC 


GAAATCATGA 


CAACTATAAA 


3300 


GGCAAAACGA 


TAGGCTATTG 


GATGATAGGT 


ATCCAAAGCA 


CCATCCCTAA 


AATAATCAAT 


3360 


GGTCGGTCTT 


GATACCAGAA 


ATACAAAAAT 


GGTTAAATAG 


AAAATAAAAT 


GGATTAAGTA 


3420 


ATACTTGATA 


TCATTCCAAC 


AAGCAATTAA 


GCTACTAACC 


AACAAGAACA 


ATAAAGTAGA 


3480 


AAGTAAGCTA 


ACATTATTAT 


TATTAAACAG 


ATACACAATT 


CCACTTACTA 


GCGTCAAGGC 


3540 


ATAACTGACT 


ATGGTCAAAC 


TAAATAATAA 


TCGTTTCCCA 


TCAATCACTT 


GGTCACCCCC 


3600 


GTTCTAATGT 


AATTTTTTAG 


ATTTTTCAAT 


ATTTTTCAGT 


AATAAGAATC 


GATATAAGGA 


3660 


AATATTTATG 


AATAGGGCCA 


AAGCACTAAT 


TCTTCTCCCC 


TTACGGAAAA 


TTGGATTCCT 


3720 


AGAAATAGCA 


AAGGCATGGC 


CTTTTAAAAA 


ACGATGAATC 


TGAGAATAGG 


CTTCAAACTG 


3780 


TTTATACTGA 


TCATCTAGCA 


ACATCTTATC 


CAGAATAAAG 


AAGTGGGCAT 


AGGCCAATCT 


3840 


GAAAAAAGCG 


ACCTCTTTCA AGTCAGGATA GTTTTTCACA ACTTCATTAT 


AAAACTTT TG 


3900 


GTAGATATCA 


ATATAGGCTA 


AATCCTTCTC 


TGCATAGGGT 


TTGGTCGTAA 


TACTATCCCC 


3960 


TCTATGGAAA 


TAGTAATAAT 


AGGGTTTAGT 


ATTAACCACA 


TACTTCTTGG 


CCAACTTGAT 


4020 


TAAATCAAAA 


TGGTAATAGG 


CATCTTCGTA 


AATCAACCCC 


TTAGGAAAGG 


ATAGGGCAGT 


4080 


TGCAATCTGT CTCTTGATTA GCTTATTGCA AATCGTCCCA GGTATTTTTT CACCTATGAG 


4140 


GTATTCCTTT 


AGAAATGTTT 


GAGAATCACA 


GACAAAATAG 


TCATCCTGAT 


TGGCTGACTG 


4200 


TGGGCTTTCA 


TCATTAGCAT 


AGACATTCAT 


GACACCACAG 


CTCGAAACAT 


CCGCATCTTC 


4260 


TTGAACTAAT 


TGCTCATATA 


AGCTCTGAAT 


CATTTCTGGA TGGATATAAT 


CATCTGAGTC 


4320 
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AATAAAAATC 


AGATAATCCC 


CGTGAGCCTG 


CTTCATCCCA 


TCATTTCGTG 


CTTGCGACAA 


4380 


TCCTTCGTTC 


TTTTTATGAA 


GCACTGACAC 


CCTGTCATCT 


TGTTCAGCGA 


TTGAATCACA 


4440 


CAAGCGACCA 


CTTTCATCTG 


TTGCACCATC 


ATCAACAAGA 


ATAATTTCCA 


GATTTTGATA 


4500 


GGTCTGCTTC 


TGAATGGAAG 


CTATCGATTT 


TTCTAGGTAC 


TGCGCCACAT 


TATAGACTGG 


4560 


CACAATCACA 


CTAATTAATG 


CAGTTTCCAT 


GCTACTCCTC 


TAATAGTTTT 


TCTACTTGTT 


4620 


" CGATTTGTTT 


TGTAATTGTA 


AATTGTTGAA 


TGAATTGGCT 


AGCCTCATCG 


ACATCAAAGT 


4680 


TTGAGGCAGA 


AGTCATGTAA 


TTAGTAATCG 


CCTGAGCTGC 


CTCTTGATTG 


CTCTCAATGA 


4740 


TTTGTCCAAA TCGTCCTTCT 


TGGGATAATT 


CCTCAGCCCC 


TCCAACGTCC 


GTAGAGATAA 


4800 


AAGGGAGTCC 


CAGACTCAAG 


GCCTCCACAT 


ACACTCCAGG 


AAAACCTTCT 


TGTTTAGACA 


4860 


TAGACAAAAG 


AACTTTCGTC 


TGAGATAGAT 


ACTGATAAGG 


ATTTTTTTGA 


TAACCAAGGA 


4920 


AATGTACATA 


GTCCTCAATC 


CCATACTCTT 


TGACTCGTTT 


TTTCAGTTCC 


TCTTCCATAT 


4980 


CACCAGCCCC 


GATAAAATAG 


AGATGATAGT 


TTTTTCCCTC 


TTGGTGTAAT 


AATCGTATCA 


5040 


CTTCCACTAC 


ACGGTCAGAA 


CCCTTATTTT 


CCTCAATCCG 


TCCGATAGTA 


CAGATACTTT 


5100 


GAGGAGCAAT 


CTCGATATCG 


ATCTTCTCTT 


GAGATTTTTC 


TAGAATAGTC 


TGAAAATCAT 


5160 


ATCCATTGTA 


GATTGTCTGT 


AATTTAGAAG 


TATAATCTGG 


ATAAACTTCC 


TTGATAGAAT 


5220 


TGCTGGTCTT 


TTTTGAAATC 


CCTACAATTG 


TATTCGCAGC 


ATCCAACTGG 


CTTCTATGTG 


5280 


ATTCTCTTTT 


AGAGCTATCC 


TTAAGAAGTT 


CTTCAATACT 


TCCATGAATC 


CAAGATATCT 


5340 


TCTTGACTTC 


TCTTCTTTTA 


GAGAACAACA 


GTGGTGGATT 


CATAATGGTA 


AAAGAAACTT 


5400 


CAACATCATA 


ATCATCTTTT 


ACAAGCAAAC 


GACGAGTCAG 


TCTTGGAAAA 


TAAATTCTCA 


5460 


TTCTCCACAA AAAAGCTCGT 


AACCATCTGG 


TTTGGCGATA 


ATCTTGAAGG 


GATTTTAAAA 


5520 


TGCGTACATG 


CTTTGGAACA 


GATTCATATC 


CCTTGTCAAA 


GTGCTCCATT 


TCAAGAATAT 


5580 


CAATATCATA 


CTTTTCTGGA 


TCCAGATTTG 


AAACAATGGT 


TGATAGAATC 


TTCTCTGCAC 


5640 


CACCTCCAAG 


AGAAAAAGAC 


CACATAAAAA 


ATAAGATTTT 


TTTCTTAGCC 


ACCATATTCT 


5700 


CCCTTGTATT 


CTGTATAAGA 


CTTATCCATA 


TCAGCGATGA 


CAGCATCATG 


ATGCGGTACC 


5760 


TGCTTGTCTG 


CTGGTGGAGG 


CGTCATATAA 


TCCCCAAAAG 


CAGTTCTGAG 


ATAGACATCA 


5820 


TAGCCGATTG 


GAATAGGCAT 


CTCTGTTCCT 


TCAAATGGCA 


AGAAAAGATT 


GTCTTCAAAA 


5880 


GATGTGATTG 


GGTACTTGTT 


TCTCATGTAG 


CCAGGACCTG 


AGCATAATTC 


TGTAATGCCA 


5940 


TCACAATCAG 


CCAAATCATA 


CTTAGTCATT 


TCTTTCTCAG 


CTTTTTTCCA 


GATGCGATAA 


6000 


CGGAGAGATT 


TTGGAGTCAA 


ACCCAGTAAA 


ATGCGACTTC 


CCCATTTCAT 


GAGATCACCA 


6060 
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TGCTTTTCTG GAATAGTTTG CGCACAAAAG AGTGAATAAA TCAAGGCCCA ACGAACCTGT 6120 

TTTTTCCGCT CAGCTGGATT TTTCGGATAA TAATCCAAAG GCAAAACATC CAAGGCCAGA 6180 

CCATGTGGCA AATCCAAATC CTGCTGATAA GGCTTGATAC AGGTGGTTTT CTTGTCACGA 6240 

ATGGTAATAA AAAGATTACG ATCAACAAAA TCCTTGTGAC TCTTTGACAA GAAATAACGT 6300 

TCATCTGCAT AACGAGGCCA TAATTCTGCT AATTTCTCAT AATCTTTACG AGGCATAAAA 6360 

AAGTCTAGGT CGTCGTCCCA AGGAATAAAT CCCTTGTTTC GAAGGGCACC AATAGCGCCT 6420 

CCGCCACAGA GATAACAGAG CAAATCATGT TCTTTACAAA AGGCCACAAA ATATTCAGCC 6480 

ATCTCCAGAC TACGAGCCTG AATTGCTTTT AAATCAGTCA TATTGTTCAT TATTCTTTCT 6540 

ATCGTATCGT TTCATTATAC CACAAACAAG GGGTGAAAAT CTATTGCAGA CTGTAAAAAA 6600 

TCAAAGCCTG ACTGCTATCC AAATAGCTAT CAAACTTTGA TTTTTCTGTC TTATACTCTT 6660 

CGAAAATCTC TTCAAACCAC GTCAGCTTCA CCTTGCCGTA GGTATAGGTA ACTGACTTCG 6720 

TCAGTCTTAT CTACAACCTC AAAACTGTGT TTTTAGCAGC CTGCGGCTAG CTTCCTAGTT 6780 

TGCACTTTGA TTTTCATTGA GTATTATCTT ATCTTAAGCC CATTTGAGCG AGCTTGGTTT 6840 

GATATTTGTT TTGATCAACC AGCAGGCCCA AGCCCCCATA AACATCATAG GCATCTACCC 6900 

AGTCACCCAG TTCTGGAATC GTCAATTTTT CAATACCATT TTTTGCTCCA TCCAAAACAG 6960 

ATAAACCGTT TGTTAGGAGG AAAGTATAGG GTACGTTGGT TGAGGTCATA GCAAAAACCT 7020 

TTCCAAGAGC TTCAGAACCA GTGAAAAGTT TAGTGGGATC TTTAATTTGC TCTAAAATTG 7080 

CTGTTAAAAC TTGTTGCTGT CTTTTTGTAC GGCCGTAATC TGCCTCATCA TCATCACGGA 7140 

AACGAGCATA ATTGAGCAGG GTCGAGCCAT TCATCTGCTG TTTTCCGACT TTAATGGTTT 7200 

GGGTTGGAGA CTCAGTCTCG GTAGCGTATA AATCATCTCC GACTGTAGCT TCTGTTAGGG 7260 

GACGCCCATT CAATGTTGAA AATTGAGCAT CAATCGTCAC CCCATCAGGG AAAAGCGTGT 7320 

CAATCGCTGT GGCAAAGGCC TGGAAATCAA CCAAGGCGTA GTACTTAATG TCCAAGTCAA 7380 

AATTATCTTT CAAGACTTGG CGAACCATTT CTGCCCCTTT TTGCCCCTCT TGTTCTCCTA 7440 

ACTCGTAGGC TACGTTTAAC TTGTTATCTG TCTGTTTTCT ACCATTAATC ACTTGACTAT 7500 

AACCATCTAT ATAGACCAAA TTATCACGCA TGAAACTGAC TAGCTTCATT TTCTTATCTG 7560 

AGCCCCCGAC ATTTAATACC ATAATAGAGT CAGTTCGTGT CTCAACACTG TTCTGGCCGA 7620 

TTCGACCATC AGTACCCATG ATTAAAATAT TAACTCCATC TCTAGTGTCC TGACCATTAA 7680 

AGACTTCTAC TTGAGCTGCC CGGGCATCAG CAGTTTTCTT TGCGCTAGCA TCTTGGTAAC 7740 

CACGCAAAAA CATGAATACC ATGGCCAAAG CCACACAGAC CAAAAGTGAA AAAATCACCA 7800 

TAAAAATTCG TTTAAGACGG AGCTTCCGTC TTTTCTTTTT TGGAGGGAAA GAGAGTGCTT 7860 
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GTGATTTGGA 
TTTCTTGTTC 
AGGCAAACTC 
AACGCAACTG 
GTCTGATATT 
" TAATTCTTTT 
AAAGTATTCA 
TCGCCAAGCA 
CAAGGCCAAA 
TAGAACCCAG 
CAATCCATAT 
TTCGGAAATA 
ATATTGTTCA 
TTCAATCTTA 
TGCTACTGCT 
CTCAATAGAA 
GATGACATCT 
AACGTGGTGT 
ATAATTTCCT 
CAAGCCTGTC 
GGTCGCTGAT 
AGGTAGATGT 
CCTCCTCCAG 
ACCTGACGAA 
CGATTCTCAA 
GATTTTCCAG 
ATCAAAGAAA 
ATCTGCAACC 
ATTGACTCTA 



TTGTGAGCGA 
CAAGCTAGAG 
ATTTTTTTCT 
CTCATGATGA 
GGATAAATAG 
TGGGCAAAGT 
CCCAGTGCTG 
AAGGTCGACA 
CTCATCTTTT 
AAACGTGTGA 
TCTTCAGCAG 
AAACGGGCCG 
TCGATGAATT 
GTATGGCCTG 
TGGATGTGAG 
TTTTCAACTG 
GTAATGTTGG 
GAAAATGATC 
CTGGGCTTAG 
TTTCTTGGAA 
TGTCCTTATC 
TATCTGTATT 
TTGACACGAC 
AGGCTGTTTC 
TCAGAGCATC 
CCCCCATAAA 
CTAGGATAGC 
AAGAGGGCTG 
GCACCGTGAA 



CTCCGGTTCG 
CTACTATTTC 
CTCTCATTGA 
CTTAAGGGAT 
GATAGGCACC 
GGACCAAGTC 
TCTTGAGTGG 
GGGCCTTATA 
CAGTTTGTGC 
AATTGGCTTC 
AACTTCTAGG 
CATAAGCTGT 
TCTTTCCTTG 
GAACCACCAT 
CCTGATGAAA 
GCACCACAGA 
CAAAAGCCTG 
CCTTGGGACC 
CTTGGTCACA 
AATAGCTACT 
AGCTGCGATA 
AGTCTTGAGT 
TTGGTCTGTT 
TCCCTTTTCA 
CATATCAAGG 
CCCTAATAAC 
TGGTATTGAT 
CGATAGCTGT 
GAGCTGATTT 



687 
CATAGCTTGG 
CCCTAGCAAG 
GATAGTGAAT 
TTTCTTTACT 
CAGAATTTTA 
CTTATCGGTA 
ACAACTTTCA 
AAGTGCACCT 
TTGCAAGGGA 
CATTTCCTGA 
TGCAACTGCT 
ACTAGCTGTT 
AGCCAAGGCC 
CAACTGCTGA 
AAGATAGTCC 
ATAGTTCACT 
CAATTCCTCA 
TAGATAAGCA 
TCCAAAACCC 
AGTTCTTCCT 
CGTTGGTAGA 
AAGTCACGAT 
TGTAGTAAAT 
GCGAAAAAAT 
TAATTAGGGT 
ACCTTAGCCA 
GGCTTCTGCA 
CATCATGCCG 
TCCTTTGATA 



TAAGTCAACC 
AGTTAGCTTT 
ATTTTTTAGC 
CATCTTCTCT 
TACTGGATTC 
TAATCCACAT 
ATTTTTGTCA 
GGAAGGTTGT 
ATACTAGGCT 
ATATCCTCGG 
GCAAAGGGCT 
ACCTCGATTT 
TGTGGATGTG 
TGAATAGGCT 
AAGGTTTCAT 
AATCCTTGCT 
TGAGGAAAAG 
ATTTTCATCT 
GACTAGCCAC 
TGCTATTATT 
GGGTTTCAAA 
TTCTCTGAGA 
CAGCTAGGAC 
TCGCAATGGA 
CCAAGCCTCT 
TGAATCAAGC 
CGGTCAAGCT 
ATACGGTGGT 
ATCATCCCAT 



TGCTCTTCTC 
TCTTGCAAAT 
AAATAATCAT 
CCTTTCCATG 
CAATCGCTTC 
CGATAATGAA 
AGTCAATTCC 
CAGGTAATGT 
TTTCAGCTCC 
CAATCAGTTC 
GGTCTGGATG 
GAGCCTCTGG 
AAAAAATCTT 
GAACGATTTC 
GAACACTACC 
CATAAGCCTT 
CTGTCTGCAC 
TAGTTCCTCT 
TTCCTCATAC 
TAGAAAAAGC 
ATCTGCTCTC 
AATAACCACT 
TTCTGATTCT 
CATACCTAGG 
TGCAATAGTC 
TCTCCAAATC 
CCACCTCTCC 
CACCAAACGT 
CTGCCGTAGG 



7920 

7980 

8040 

8100 

8160 

8220 

8280 

8340 

6400 

8460 

8520 

8580 

8640 

8700 

8760 

8820 

8880 

8940 

9000 

9060 

9120 

9180 

9240 

9300 

9360 

9420 

9480 

9540 

9600 
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AGTAATATCT 
CTTGACCTTG 
CAGGGCAATA 
TCCTTTCAAG 
AGTTATTTCC 
GTTGATCCCC 
CCAAAAGGCT 
TGGCCCCTGG 
CATATCTTCA 
TTGTAAGGCT 
ATGAATAGGT 
TTGCCCTGAA 
TTTGGAAAGA 
TGAAATCAGG 
TAAGCCAGCC 
ACCAAGGTCA 
AACCTTGGTC 
GTCACCTGGG 
GGACCTCATA 
AATTCCTGAC 
CAATTTTTTC 
ACAAGAGGCT 
CATCTGGCAT 
CTGTAAAGAT 
CCTTGTATTC 
TTGACATGTT 
TGCTCAGCTC 
TCCTCACGAT 
AAAATCCGCA 
TCCACACCAC 



GCTCCCATAC 
AGCTCCTCAG 
ATGGGCAATT 
TCAGAAGACT 
AATTTTCCAC 
ACATTCTGCA 
GCACTGGAAA 
ACTGTGATTT 
GTATGATTAC 
GCAAACATCA 
CTTAGGTTTT 
ATGCTGACGC 
CTATCATCTC 
CGAATCGAGG 
ATGCCTACAC 
CGAAAAACCT 
TCACCCTCAG 
ACGCGGATAC 
CTTGCAATAC 
TTTAGGATCG 
AATATCAGAA 
ATTTCCTTGA 
TTGACCTGTC 
TTTGCTTGCC 
TGGTAATCCG 
CTTCTCCTTA 
TTTCCAAGTC 
TTTCCTCGTT 
GGATGACATC 
CACGCTTATG 



TATTTAAGGC 
CATCCTTGAT 
CATCAATCAA 
CAACAATCAA 
CCATGGCACG 
GCACTAGACG 
TATCTCCTGG 
TCTTACCATC 
GGGTGTACTC 
AGGCTGACTT 
TCGTCCCTTT 
CCATTTTTTT 
CAAACATCTC 
TGCCAGAATT 
CTTGAATGGT 
GCATGGTCGA 
CCAAACTTCC 
TACCATGTAA 
TTTTACCTAT 
TTCTTTTCTT 
AGGTAAATGG 
ATCTGTTTAC 
TGTGCTAGTT 
TCAGCATCTG 
CGTAGACCGC 
TTTGATGACG 
TTGAGCATTT 
GATGTGGATA 
TTCTTCATCA 
GATTTCCATG 



688 
GTCTGCCACA 



AACTGTTACA 
TCGTGGAATC 
GGTAGCAGAT 
AATGACATCA 
AGAATTTGGA 
TACGACCACC 
CACACTTAAA 
TTTTTCGATA 
GACTTGGGCA 
TAAGCGAAGG 
CAGTGGAAGG 
TACTTCGAAA 
TCCCATATTA 
AATAACCCCA 
AAGAACGTCT 
AAAGATAATG 
ATGGCGAATG 
TTTATCATAA 
ATTTCAGCAA 
CCAATTGTTG 
CAAAGCCTTC 
TTTGAATTTC 
CTGCAATCGC 
GACTGAGTTC 
ACTGTATAGT 
TTAAATGAAA 
TTAACCAAGG 
GGAACGTCAA 
GTCTGGCGTT 



ACCTGAATAC 
CCTTGGGCTT 
AAAGCGCCAC 
TTAGCGACTG 
ATAATACCGG 
GCAATCAAAC 
TTCTGTCCTG 
TGACCACCAA 
ATAACTGACT 
GAGGCAATTG 
GGAGGCAAGT 
GTCACACGGT 
TCTGCACCAG 
AGGGCATTTT 
TCTTTATCCT 
TCACCTCGCA 
GAACGGTGGC 
TTTGTTTTTA 
AAAGCCAGAA 
TTCTGAAACT 
TTGCTTGGTA 
CATCTTAGCT 
CTCTTGAAAG 
ATCTTTAGCT 
GTTTGCACTA 
CAGTATTTTC 
TTTGTAGGAT 
AAGTTCCACG 
CATAGAGGTC 
GTTCACGCGC 



GGTCTGTTTC 
GGGTCGCAAG 
CAATCTCTGT 
GATCGATTTC 
TGCGAGTTTC 
CTGCGACTAA 
TCAATTTTTG 
ATTGTTTCAA 
CCCCCTTAGC 
GCAACTCATA 
CTCGTTCAGT 
CCATAGGACG 
CAAGGACACC 
GTGGCGCTTT 
CAATTTCAAC 
GAATATCATA 
TGATAGACTT 
GTTTCATACT 
ATTCCTTAAA 
GGTTCAAAAA 
AAGAATTCTG 
TGGAAGGACG 
GCAAGATAAT 
GCTTTAACAG 
TCGTAAATAT 
TGTTATGAGA 
TCCGTGAATA 
TAGCAGTTCC 
GTAAGAGCTA 
TTGGTTAAAA 



9660 
9720 
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AAGTTCCAAA 


TTTGCTCTTC 


ATCTCCCTTA 


CTAATGGCCT 


GACCAATCGC 


TTCCAAACGT 


11460 


TCCTTGAAAT 


CCTCAATTCT 


ATCCAGAATG 


ATCTCGCTAT 


TGGACAAGAG 


AATGGAGGTC 


11520 


CACATTCCTG 


GCTCGCTTTC 


CGCAATTCGG 


GTCATATCTC 


GAAAACCACC 


TGCCGCAAAG 


11580 


CGCCTTGCCA 


TCTCATGCTC 


TTGAGCATAG 


ACCGCAGTCT 


GCTCCATGAG 


ACTAGAAGCC 


11640 


AAAATATGAG 


GAAAATGGCT 


AATCTGAGAA 


GTGACACGAT 


CATGCTCCTT 


GGCATCAATC 


11700 


- TCGATAAAAC 


GAGCATGAAG 


ACCTGAAAGC 


AGATCCTTCA 


TTTCCTTAAG 


CGTGTCCTGA 


11760 


CTTGTCAGGC 


TTGAAGGTGT 


AAAGATATAA 


TAGGCATTTT 


CAAAAAGATT 


GACATCTGCC 


11820 


GAAGCAGCCC 


CTGTCTTGTG 


ACTACCAGCC 


ATGGGATGGG 


CCCCGACAAA 


GCGAACAGAC 


11880 


TTGCCAGCCA 


AATACTGCTC 


CGCCGCATCC 


ACAATGGTTG 


ACTTGGTCGA 


ACCAGCATCT 


11940 


GAAATAATAA 


CGCCTTCTCG 


CAAATCCAAA 


TTGGCCAACT 


CCTTAATGAA 


AGCAATAGTT 


12000 


TGTTTGATTG 


GCAAGCTGAG 


GATAATGACA 


TCTGCCAAAG 


GAGCAAAACT 


AGCAAAATCA 


12060 


TCCGTTGCAC 


GGTCAATCAT 


ACCTTCTTTC 


AAGGCGATAT 


CTCTCGAAGC 


TTGACTACGA 


12120 


TTATAACCTA 


AAATTTCATA 


ATCTGGATGA 


TCGCGTTTGA 


TACCAAGTGC 


CATAGAGGCT 


12180 


CCAATCAACC 


CAAGACCTGC 


GATATAGATT 


GTTTTTGCCA 


TAGGAACTCC 


TTAATAGTTC 


12240 


TTTGTATAGT 


CTCGGTGTTT 


GGCTACCGCT 


TCTTTTAGTT 


CCTCAAGATT 


ATCTGATGAG 


12300 


AATTTTTCGA 


GGATTTCTTG 


CGCCAGAACC 


GTTGCTACAA 


CTGCTTCCAT 


GACCATTCCT 


12360 


GCAGCTGGAA 


GAGCAGTCGG 


ATCACTTCTC 


TCCACGGTTG 


CCTTGTAAGG 


TTCGTGGGTT 


12420 


TCGATATCCA 


CACTCATAAG 


AGGTTTATAA 


AGAGTAGGAA 


TGGGTTTCAT 


GACCCCACGA 


12480 


ACAACGATGG 


GTTGCCCATT 


AGTCATACCA 


CCTTCAAAAC 


CACCTAGATT 


ATTGGTACGG 


12540 


CGAGTATAAC 


CGTCTTCTTT 


AGACCAGAGA 


ATTTCATCCA 


TAACTTGGCT 


GCCTTTACGA 


12600 


TAACCAGCCT 


CAAAGCCAAG 


ACCAAATTCC 


ACCCCTTTAA 


AGGCATTGAT 


AGAGACAACA 


12660 


GCTTGAGCCA 


ATCTTGCATC 


CAATTTTCTA 


TCCCATTGGA 


CATAGGAACC 


AAGACCAACT 


12720 


GGAACGCCTC 


CGACGACTGT 


CTCCACAACC 


CCACCGATGG 


TATCACCATC 


ACGTTTGATT 


12780 


TGGTCAATAT 


AGTCCTTGAT 


TTCCTGTTCT 


CGTTCTTGGT 


TGACAATAGA 


AACTTCAGAC 


12840 


TGGGCAGCTC 


TTTGCTTAAT 


TTCAGCGACT 


GTCAGATTTT 


CAGGAACATC 


GATTTCCTTG 


12900 


CCACCAAAGA 


CCACGACATG 


GTTGGCAATC 


TCCATATCCA 


GCTCAGCCAA 


GAGGCGTTTG 


12960 


GCTACTGCAC 


CAACTGCCAC 


CCGCATGGTG 


GTTTCACGAG 


CTGATGAACG 


CTCCAAAGAA 


13020 


TTTCGCAAAT 


CATCAAAACG 


GTACTTAATC 


CCCCCAACCA 


AATCGGCATG 


ACCTGGGCGA 


13080 


GGATGAGTAA 


TTTTCCGCTT 


GCTTTTAAGG 


CGGTCTTCAA 


TGTCCTCCGC 


AGACATGATG 


13140 
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TCCAGCCATT TCTGGTGGTC CTTATTGATG ACATCCATAG TAATAGGCGC CCCTGTCGTC 13200 

TTCCCGTGGC GAACGCCCGA AGTAAAGACA ACCTGGTCAT TCTCAATCTT CATACGACCA 13260 

CCACGACCGT AGCCACCCTG ACGGCGTCTA AGGTCCTCAT TGATATCCTC AGCTGTCAAT 13320 

GGAAGTCCAG CTGGAATTCC CTCAATAATA GCTGTTAGAC GGGGGCCGTG TGATTCTCCT 13380 

GCAGTTAAAT ATCTCATACA CTCTCCTTAT TTTACCAAGT AGTCTTTCAT CTCTTCCAGA 13440 

GAAACTGGGT GAATGGTCGC TGAACCAAGC TCTGGCACCA AGACCAATTT CAAGGTGTTA 13500 

CCACGCGCTT TCTTGTCATG AGTAAGAGCC TGATAAAGCT TGCCAACTTC CCAATTTTCA 13560 

TAGTCAACAG GCAAACCGAA TTTCTGACAC ATCTCTGTGA TAGATTGGGT AATGCCAGCT 13620 

GGCATGAGGC CTTTTTCCTC AGCAACCTTG GAAATCTGTA CCATTCCCAT GGCAACAGCC 13680 

TCTCCATGCA TGACCTTGCC ATAACCGGCA GTCGCTTCGA TGGCATGGCC AATAGTGTGG 13740 

CCAAAATTGA GGTAAAGACG AATACCATTG TCCAACTCAT CTTCAACCAC CATCTTGCGC 13800 

TTCACCTGAC AAGAATGTTC AATCAAGGTC TCTGCATGTT CCAAAATACT CTCAACAGAA 13860 

CCATTCAGTC CCGTCAAGAG AGCCCACAGT TCTGGATCCT CAATCAAGCC ATACTTGATA 13920 

ACTTCACCCA TCCCTTCAAT CAACTCTCTT TTTCCGAGGG TTTCAAGAAC AAGTGGATCA 13980 

ATCAGAACCC CATCTGGTTG GGCAAAGGTC CCCACCATAT TTTTAGCAAA TGGTGTATTA 14040 

ACGCCTGTCT TTCCACCGAT AGAAGAATCA ACCTGAGCTG TCAAACTAGT CGGAATCTGA 14100 

ACAAAGTGAA TACCCCGCAT ATAGGTAGAG GCTACAAATC CAGCCAGGTC CCCAACAACG 14160 

CCACCACCAA GAGCAACGAT TCCATCGCTA CGAGTCAGAC CTTGCTTGAC TAGAAATTCA 14220 

TAGACTTTCT GAACAGTAGT TAAATTCTTT CTTTCTTCAC CTTCTAAGAA ATCAAAAACA 14280 

GCTACCTGAA AACCAGCATC TTCTAGGCTG AGCTTGACCT TCTCTGCATA GAGAGAGGCT 14340 

ACATGGTTAT CTGTCACAAT GACTACCTTT TGCGGTTGCC AGAGTTCTCG CAACCACTGA 14400 

CCAGCCTGGG CCATACAACC TTTTTCAATC TGAATATCAT AAGGATGGTG AGGAATATCG 14460 

ATTCTGATTT TCATAGGAGA GTCTCCCTTT CTTTATTGGT ATTTTTCTGT TAAAGACTGC 14520 

CAAATCTCTT CTGTCGGCAT TTCCTTGCCT GTCCACAGTT GAAAAGCTTC TGCAGCTTGA 14580 

TAGAGTAACA TTCCCAGACC ATTGACTGCT GGATTGCCCT GACTTCTAGC CCATTTCAAA 14640 

AACGGTGTTT CAAAGGGTTG GTATATGATA TCTGCAACTA AAAGAGTTTC TGGTAAGACT 14700 

ATGTTTTCAG GAACAGGAGA GGATTGGCCA TCCATGCCCA CACTGGTGGC ATTAACTAGC 14760 

AAATCCGACT CGGCAATCCT TGCTTGCAGT TCAGAAACAT ATTCTAAAGC ACACAAATCC 14820 

ACTTTAAAAC CTGTCTGCTC CTGTAACTTG TCTAGGTAAG GTCTTGTTTT TTCCATAGAA 14880 

ACGGAACGAA CAAAGACCGA AATCTGACTG ACGCCATCCA AAATAGCCTG TGCCAAGATT 14940 
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GATTTAGCCG CACCACCTGC ACCCAGCAGG GTCATCTTTT TACCTGAAAT TGTAAAAGAA 15000 

GGCAAGCACT TAAAAAATCC CTTGCCATCT GTATTATATC CAATTAAATT GCCATTCTCA 15060 

TTGACAACCG TATTAACCGC ACCAATCAAG CGCGCTTCAT CGCTCAGCTT ATCCAAATAA 15120 

GGAATCACCT GCTCCTTATA GGGCATGGAC AGATTGATGC CAAACATCTG GTAGCGACGA 15180 

ATATTGGCCA CTGTTTCTAC CAAGTCACTC GCTTCAATCT CCCAAGCCAC ATAAGCACCG 15240 

- TTGGTAGCTG TCGCCTCAAA GGCTCTATTG TGGATGAAGG GAGAAATAGA ATGCTTAATA 15300 

GGATTGGCAA CAACTGCAGC TAAACGTGTA TAGCCATCAA GCTTCATCCA AAATCTCCCT 15360 

GATTTTTTTC ATGCTAGCTA GAGAAATCTG CCCAGGGGCA CTAACCTCAT CCAGACTGGC 15420 

AAAAGACCAA CTCGAACCAG TCACATCCGC AGTGATACGA GAGACCTTGC CCACCTTACC 15480 

CATAGAAATG GTCACATATT CCTGTTCAGG ATTGAGGGTT TTAAAGCCTC GTGTATAGTT 15540 

CATCAAGTCT AAGACATCCT GCTCCGTGTG AGCCATCACC GCAACCTTAA CAAGTTTTGG 15600 

ATTTAGGATC GTCAACTCTG ACAAGATTTC CATCATGTTC TCAGGTGTTT CTTGGAAATT 15660 

ATGGTAACTC AAAACAAGAT TTGGGAAGTC CAGCATTTCC TCAAAAACAT CCTTGTAGCT 15720 

ATAGTACTCA AAATCAATAT AGTCTGGTTG ATAGAGTTGC GCAACTTCCT TGATTAGATG 15780 

GATATACTCT TCTGGAGAAA GGTCGATTTC TCCACCTTCG GAGCGAGTTC GTAGCGTGAA 15840 

AACCAACTCA CGGCCTGCGA ATTTTTCAAA AATGGCTGGA GCTACCTGCA AAATCGCTTC 15900 

TTTAGGCAGA TAGTCGGCAC GCCATTCAAT GATGTCGGCA TCCAGGTACC TCGTGGCATC 15960 

CAGAGCCTGA GCCTCCTCTA AACTTCTTGG CATTACTGAA ACGATTAATT TCATTTACTA 16020 

ACCTTCATAC TAATCACCTT GAGGTAATTA CTACTTTCAT CTTTTTTATT ATAGGCAAAA 16080 

TCTGCTGGAA GACCATATTT GTTTAAAATC TGGTAACTTC TTCCTGCAAA ACCTTTATCA 16140 

ATTTGTTCTG TAAATTTCTG ACGGGAAACA TTGGCAGCAT TGGTACTGGC AATGATAATC 16200 

CCTCCCGGAT TTAAAATCTC AAGACTCTGG GAAATCAACT TGTGATAATC CTTGGCCACA 16260 

GAGAAAGTTT GTTTTTTATT CCGAGCAAAG CTAGGCGGAT CTAGGACAAT CACATCGTAG 16320 

GTCAAGTCTT TGCGTTTGGC ATATTTGAAA TACTCAAAGA CATCCATGAC TATAAAACGA 16380 

TGCTCGTCTG TGCTGAGCCC ATTTGCCTGA AAATGCGCTT GAGACAATTC TCGTGAACGT 16440 

TTGGCTAGAT CAACAGAAGT TGTATGGCTA GCTCCTCCCA TGGCCGCAGC TACTGAAAAA 16500 

GCCGCTGTGT AGGAAAACAT ATTGAGTAAG GATTTACCCA TAGCCAAGCC GTCAACTAAA 16560 

CTACCGCGAA CCTCATGCTG GTCTAGGAAA ATTCCTGTCA TCAAGCCATC ATTCATAAAG 16620 

ACTTGATACA GGACACCATT TTCTAAAACA TTGAAAAAGT CAGGTGCTTC TTGACCATAA 16680 
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ACATGGGCAG ATTCATAGTC CAAACCCTTA AAGCGGATTT TCTCATAAGC TCCTAAAACC 16740 

TCAGGGAAAA CCTGTCTAAA GGCTTCTGAT ATAGTCTGAC GAATCTGATA AACATAAGAG 16800 

TTATACCAAG AAAAGACGGC GTAGTCGCCA TAAAGGTCCA CTGTCAGACC CCCAAAGCCA 16860 

TCTCCCTCTT GATTAAAGAG ACGAAAGGCA GTTGTCAAAT CATCTTGATA GTAGGCGTTT 16920 

CTCTTTTCTT TGGCTTTTCT AAACAACGTT TCAAAGAAAG CTTGATTGAA GGCCACCTTG 16980 

TCTTTGCTGA TAAACCAGCC CAAGCCCTTG TTTTGCTGAG AAAGGTAGGC AGTCCCAAGA 17040 

AAGTTTCCTT CCTGACCCTG CACCTCTACT TCCTGATCCT TAAGATTGAC ATTCTCAAGA 17100 

TCACTGGCTT CTAGTAAAAC TAGCCCCTTA GCAAGCTTCT TTTCAACCCT TTTGCTGACT 17160 

CTTATTCTAT TCATAACTAC CATTATATCA AACTTTTAGA CAATTCTCAA AAAAGAAACT 17220 

ACCCTTGCTT TTTTACTCTT CTTTTAAAAA ATGGTATACT AGACTTCCTG CAAAACTAGG 17280 

AAGTAAATGT GTAAGAATCA CAGTAAAAAA TGCTCTTCCG TCTTGGAGGA GCATTTCTTT 17340 

TTATCAACGA AAATCAAATA GCAAACTATG AAACTAGCCT CAGGTTAACT GTGAGATTAT 17400 

AGGTAGAGAG GTTGTATCAG CAATATGTGT CTGTCAAATT TAGTGACAAA GGTAGTAGAA 17460 

GAAAGATAAA GAAATAAATC AGCTTCAGTA GGTATCTGGA AAATTTGATT TTATAGAGAA 17520 

GCCTTTTGTT ACAAACTCAA TATACTATCA ATAAATAATA TTATAGAAGC AACAATAATT 17580 

ATAATTTCAC CTATCTGCAT CATTCTATTT CGAACTCTAA ATATATGTTC TATCAAAAAT 17640 

ACTTGGAACA CACACATTAT AGGAATTAAC GTTTTTGAAA TTGAAAAATA TCCAAATAAA 17700 

TAAACTATAA ACAACAAAAA TAGAACTATG TTATATTTCT TATTCAAAAC ATTCCTCCCT 17760 

ATATATTTTT GATTACCAAT CTTAATCATT TACAACTACA TTCTAACAAA CTATAAAAGC 17820 

GTTTGTCGAA TTGAATTTAT CAAGCAAGCG ACCAACCAGT TCATCTTTTT TCTATTTCTG 17880 

CCAATATGCG TGACAGGTAA TAATGATAGC CAAAAATAGC AAGAGCAAGC AAGACGATAA 17940 

GAGCTCCTAC TCCCAAGCTG ATGGCAAGGA TAGGGGAGAG AGACTGAACC AAGAATATGC 18000 

TCCCAATTAC AAGGGCCATC AGGATTGCAC TATAAATAAA CAATAAAACT ATGGCGACTA 18060 

TGCCATTTGA ACGATTCACC AGGTCCGTAA TGCTACTCCA ATTGGTTGAC AGATTTTTAA 18120 

CGTCCTTAAA GTAATGGTGG CAAGAAAGGA TGACACTGGC AATGATCCAG ACTACAAGAA 18180 

GGTAAATCAT CGAAATGATG GGCAAGCCTA GATATAGAGA AAGAGCAAGC AAAGTCAGAA 18240 

CTGGTAAAAA GGACTGGACA GCATATATAA TCCAAAATTT CACTTTCACA TAACGAGCAA 18300 

AGTCAAAGGG TAAACTCTTA AGAAAATCAA CATTTTCCCT CTCCAAGGAC AAGGCAATTG 18360 

AATGCAGGCT GGTGATATTG TTATTGACAA CTGCTATAAA GAG AG CT AT A AAAAACAAGG 18420 

GTAACCAGTA TGGAGGATGA ATGTCTGGAA CTATCTGAGA ATCTCGGATT TTGGAAATCA 18480 
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GACCGATCAT CATGAGATAA GGAAGGAAAG CACTTGTAAA 
TCCCCTGTCC CAAGAGGGTG AGGTGGTAGC GTAAAACCAT 
TGGTTGAAAT TCTCTCCTTG CTGCGACGTT CTTTTTTGAC 
GGATCACGTC ATAAAAACGA GGAAGGACCT TCTTTTTGGT 
TTAGTCCTAT CCAAGCGAGC AGACCCACTA AGGCTTCTGT 
- TTTTGTAAAA GATATGAAGA GGATAAAGGA GAAATGGAAT 
TACTTCCAAA AGTCGACTGA AGAAAGAAGA TAAATATTAA 
CAATCATCAC ATTCGAAAAA ATAGACTGAT ACTTTCTGAA 
AATGCACTGC CACTACCATC ACTAGAGCCA CAGAGACAAA 
GCATCAAAGG CAAACCCAGC CATAGAGAAG GAGCTAGCCT 
AAGCTAGGAT TGGTACAATT CCAGTTAGAG CTGGCAAAAG 
TTATAATCTC TGATTCTTTA AAGGCATAGG GCCTATACGA 
AAAAGACATT GTAAAAGGCC GTTAAAGAAG TTGAAAAGGC 
TCATCGAGCT AAAATAAATA GGTATTTCCT CAAAAGGAAA 
AACAGATGAT CATCAAGAGA CTGGAAAAAA TGTAAGAACT 
TTACTTTTTT 

(2) INFORMATION FOR SEQ ID NO: 87: 

(i) SEQUENCE CHARACTERISTICS • 

(A) LENGTH: 18436 base pairs 
<B) TYPE: nucleic acid 
(CJ STRANDEDNESS: double 
(D) TOPOLOGY: linear 



AAGCACTGTA 
GCGAAAAAAT 
CTTCTCCTCA 
CAGATAAAGC 
CGAAAAAGGC 
GTCTCTAACT 
AGGTATGAGA 
GACCCTAGTT 
TAATAAGGTC 
AATGTAGAGG 
GACAGACAGT 
TACCAAATCC 
AATCACTAGT 
ATGAATGGCT 
TAAGACTCTA 



ATCACGCCAG 
CCCTTTTTAG 
CTATTAAGCA 
AGGAAGAGAG 
TCCACTGCTA 
TTGTCAACAA 
ACTCCTATCC 
TGAGCCAAGA 
AAGGACAGTA 
ACCAGAAAAT 
CCTTTAGCAA 
TTACTCTCAT 
AAAATAGCAA 
ATATTACTAA 
GCGGAAACAT 



18540 

18600 

18660 

18720 

18780 

18840 

18900 

18960 

19020 

19080 

19140 

19200 

19260 

19320 

19380 

19390 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 87: 
CCGAGCGTCG TTACAGACTT TATCAAGATT GGACGCAAGA AGAAATTCAA 
AAAATATGGC ACAATCTCCA TGGCATACTC ATTACCATGT TGAGCCAAAA 
TCAACGACCC AAATGGCTTT TCTTACTTTG ATGGCAAGTG GATCCTCTTT 
TTCCTTTTGG TGCAGCCCAC GGTTTAAAAT CTTGGGCACA GCTAGAAAGT 
TTCACTTTAA AGAAACTGGA ATCAAAGTTT TACCAGATAC TCCATTAGAT 
CCTACTCTGG TTCTGCCATG CAATTTGGCG ATAACTTATT CCTATTTTAT 
TTCGCGATAA AAACTGGATC CGTCACCCAT ACCAGATCGG TGCTTTGATG 



CATATAAAGG 
ACAGGACTTC 
TACCAGAATT 
GATGATTTGA 
AGCCACGGTG 
ACAGGAAATG 
GACAAGGAGG 



60 
120 
180 
240 
300 
360 
420 
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GTAAP,ATTAP A A Jk.fi atwpp A P 


A APATPTTP A 

AAGAxV.1 IGA 


TTGACCAGCC 


AGCAGACTCT 


ACTGACCACT 


480 


TPPOPfiATPP ir An ATurvpfpri 


AAl~ 1 X It, AGG 


GTCAATATTA 


TGCCATTGTC 


GGCGGACAAG 


540 


n\— x i uununn AnAnu u 1 | H_ 


Gil (_G 1 L. 1 CT 


ACAAGGCTGT 


CAATAACGAC 


TACACAAACT 


600 


fSnr'A AP.PAnT iyi/spp a pptpt 


GAl, A L IxjCTA 


ACGACCGTAC 


TGCCTACATG 


ATGGAATGTC 


660 


1 /irt ill VjVj 1 LI i lu 1 Aunu 


GAAuAA^tTTG 


TCCTTCTCTA 


CTGTCCACAA 


GGATTGGATA 


720 


AlaAAAGl Iti AGALTACGAT 


AATATCTTTC 


CAAATATGTA 


TAAGATCGGG 


GCTTCCTTTG 


780 


A^-H- 1 ArtAAA luLLAAAA TG 


GTAGATGTGT 


CTCAACTTCA 


AAACATGGAT 


TACGGTTTCG 


840 




11 (J AAvJGCTC 


CTGATGGGCG 


TGCTCTAGCA 


GTTAGCTGGC 


900 


x luu 111 nuAlui l 1 v_ 1 


TACCCATCTG 


ACCGTTTTGA 


CCACCAAGGA 


ACCTTCTCTT 


960 


1 vjvj i UuiVnin Av- 1 l_ Av_ 1A1L 


AAAGACGACA 


AGCTCTACCA 


GTATCCAGTC 


GCTGCTATTA 


1020 


/\UVjrtv_L 1 1 L(j l\j<_ 1 IL/lvjAA 


GAAGCCTTCT 


CAAACCGTTC 


CCAAACCAAG 


AACACTTACG 


1080 


A APTT'P. A APT P A A pflW^ A A 
ftW- x luAAL 1 LAAL 1 i GGAA 


(jCIAATAGCC 


AGAGCGAGAT 


TGTCTTACTT 


GCTGATAAAG 


1140 


a/mju A nnuuu AL 1 1 1 v_/\Al l_ 


AACTTTGACC 


TTGTAAACGG 


TCAAGTAACA 


GTGGATCGTA 


1200 


u*_\»AGGt- AGAACAGTAT 


GCCCAAGAAT 


TTGGGACAAC 


TCGTTCTTGC 


CCTATCGAGA 


1260 


A A (~AGG(. TAC TACTGCTACA 


ATCTTCATCG 


ATAACTCTGT 


CTTTGAAATT 


TTCATCAATA 


1320 


AAGGAGAAAA AGTATTTTCT 


GGTCGTGTCT 


TCCCACATGC 


GGACCAAAAT 


GGTATCCTGA 


1380 


1 1 AAA 1 CTGG AAACCCAACT 


GGAACTTACT 


ATGAATTAGA 


TTATGGTCGC 


AAAACTAACT 


1440 


GATGTCGCCA aacttgcagg 


CGTCAGTCCT 


ACTACCGTTT 


CTCGGGTTAT 


CAATAAAAAA 


1500 


GGGTATCTAT CTGAGAAAAC 


CATCCAAAAA 


GTCAATGAAG 


CCATGCGAGA 


ATTGGGCTAT 


1560 


AAALLLAALA ACCTGGCTCG 


TAGTCTGCAA 


GGAAAATCAG 


CTAAGTTAAT 


CGGCTTGATT 


1620 


TTV^PPP imnnvi/< » » tns+fn 
I 1\.U.LAA1A I 1 IvLAATUT 


TTTCTATGCA 


GAATTGATTG 


ATAAATTGGA 


ACACCAACTC 


1680 


TTP A A A A ATVi fi*I"?APAAPAP 
1 1 v.nnn/Vrt oil ALAAuAV. 


CATCATCTGC 


AACAGTGAAC 


ATGATTCTGA 


GAAGGAACGC 


1740 


GAATACATCG AAATGTTGGA 


AGCCAATCAG 


GTGGACGGCA 


TCATTTCTGG 


TAGTCACAAC 


1800 


CTAGGAATCG AAGACTACAA 


TCGTGTGACA 


GCGCCGATTA 


TTTCCTTTGA 


CCGAAACCTA 


. 1860 


TCGCCAGACA TCCCTGTCGT 


CTCCTCTGAC AACTATGCTG 


GTGGGGTTCT 


TGCTGCCCAA 


1920 


ACCTTGGTCA AGACAGGTGC 


CCAGTCTATC 


ATCATGATTA 


CAGGGAATGA 


CAATTCTAAT 


1980 


TCGCCAACCG GACTGCGCCA 


CGCTGGTTTT 


GCATCCGTAC 


TCCCAAAAGC 


TCCTATTATC 


2040 


AATGTTTCCA GTGACTTTTC 


TCCCGTCAGA 


AAAGAAATGG 


AAATCAAGAA 


TATCTTGACC 


2100 


CGGGAAAAAC CAGATGCCAT 


TTTTGCTTCG 


GATGATTTGA 


CAGCTATTCT 


GGTCATTAAA 


2160 


ATCGCTCAAG AATTGGGCAT 


TTCTGTCCCA 


AAAGAGCTCA 


AGGTCATCGG 


CTATGATGGG 


2220 
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ACCTACTTTA 


TCGAAAATTA 


CTACCCTCAA 


TTGGCTACTA 


TCAAGCAACC 


TTTGGAAGAG 


2280 


ATTGCTTGTC 


TCACTATTGA 


TCTTCTCTTG 


CAAAAGATTG 


AAGGCAAGGA 


AGTCGCCACA 


2340 


ACTGGTTACT 


TCTTACCAGT 


TAOGCTATTA 


CCAGGAAAAA 


GTATTTAAAC 


ACAAGAAAAC 


2400 


TCAGACCGAT 


TCGTCTGAGT 


TTTTATGATC TTAAATTTTC 


GAGATAGCGC 


TGGGCTGTCT 


2460 


CTAGGTTAAA 


GGTTTTATCT 


GAGATGAGGC 


GCTCTACTAG 


GGGAGCAACT 


TCAGATTCAC 


2520 


- TAGCCCCAGC 


TAGGAGAGCT 


AGGGATTTGG 


CCTGTAGTTT 


CATGTGGCCT 


TGCTGGATGC 


2580 


CCGTACTTAC 


CAAGGCTTTG 


AGGGCTGCAA 


AATTTTGAGC 


AAGACCGATG 


GACACGATAA 


2640 


TCTGGGCTAA 


TTCTCTGGCA 


GAAGGATTTC 


CTAGTAGATC 


ATGACTGAGA 


ACTACACGTG 


2700 


GGTTGAGGCC 


GATAGAGCCA 


CCCTTAGTCG 


CTACAGGCAT 


GGGCAGGGTC 


ATCTCACCGA 


2760 


CCAATTCTTC 


TCTTTCAAGG 


TCCAGCGTCC 


AGCAGCTAAG 


ACCTTGATAG 


CGTCCATCTC 


2820 


GACTGGCAAA 


GGCATGGGCC 


CCAGCTTCGA 


TGGCACGCCA 


GTCATTACCA 


GTGGCAATCA 


2880 


AAATCGCATC 


AATACCATTA 


AAAATTCCTT 


TATTATGAGT 


AGCAGCTCGG 


TAAGGATCAG 


2940 


CCTGCGCAAA 


CTGACTAGCC 


AACGCAATTT 


TCTCCGCAAT 


CTCTCGTCCT 


TGATCCTTTT 


3000 


GGCGGCTCAA 


GTAGCGAAAG 


GCGATGCGAC 


AGCTTGCAGT 


CACCAGAGAA 


TCGGTCGCGT 


3060 


AGTTGGACAG 


GATTCCCATG 


AGACTCTGTC 


CCTGACTGAG 


TTCTTCTAAG 


ACTGGTTTCA 


3120 


AGGCTTCCAG 


CATGGTGTTG 


AGCATATTGG 


CACCCATGGC 


TTCCTGGGTA 


TCGACATGAA 


3180 


TATAAACAAC 


GAGAAAGTCT 


GGTTCGCCTT 


TTATCTGCTC 


GACATGCAGA 


TCACGCGCCC 


3240 


CACCTCCACG 


TTTAACGATA 


GAAGGATAGG 


CTTGATTGGC 


AAGCTCCAAG 


AGCTCCGCTT 


3300 


TCTTGCTGGC 


AATCTTCTCT 


TGCGCTAGTT 


TAGGATTAGC 


AACTTGATAA 


AGGGCTACCT 


3360 


GCCCAATCAT 


CTGTCGCTGA 


TGGACTTGTG 


CAGTAAAACC 


ACCTGCACGC 


TTGATGATTT 


3420 


TGCTGGCATA 


GCTGGCCGCC 


GCAACCACAG 


AGGGTTCTTC 


TGTCACATAG 


GGAACGGTGT 


3480 


ATTCCTGACC 


GTTGACAAGT 


ACCTCCGGAA 


CCAGTGAATA 


AGGCAGAGAA 


AAAGTTCCCA 


3540 


CTACATTCTC 


ACTCAGCTGG 


TCTGCCACAG 


TCACGCTCAT 


CTGTTCATCC 


TTCTCCAGAC 


3600 


TAGCTTGTCT 


CTCAGGACTA 


AGGAGCGCCT 


GAGCTTTTAA 


CAGCTCGAGG 


CGCTCTTGGT 


3660 


ATGATTTTTT 


AGAAAATCCA 


TTCCAACTTA 


TCTTCATTAT 


TTTTCAACCT 


TGCTATAACG 


3720 


GCGTTGGTGG 


TCGAGAATTT 


CAACCAAGGC 


AAAATCTTGA 


TTTTCATAGC 


CAGCAAACTG 


3780 


GGCAGAGTTA 


GTTTCATCCA 


AGTTTACTTC 


CTCAAAAAAG 


ACCTTTTCAT 


AGTCTGCAAC 


3840 


GGATAGGGCA 


GTTCGTTGGT 


TGAGCTTGTT 


CAAACGGTCT 


TTATCCAAAT 


AAGCTTCATA 


3900 


TCCTTCAACC 


AATTCACCAC 


TGAAGAACTC 


AGCCACAGCT 


CCACTTCCGT 


AACTATAAAG 


3960 
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GGCGATTTTA TCCCCAGCTT TCAAGCTATC TGTATTTTCC AAGAGAGACA AAAGTCCAAG 4020 

GAAAAGTGAA CCTGTGTAGA TATTCCCCAC CTTTTGACTG TAGAGAATAG ACTGGTCAAA 4080 

ATGCTTTTGT AAGAGGTCTT TTTTCTCTTG AGGCAGGCTC TTATCCATGA TTTTTTTCAA 4140 

GCCTTTTAGC GCTAATTTAG GATAAGGCAA GTGGAAACAA ACAGCCGCAA AATCATCCAA 4200 

AGTAAGCTGG TAGCGTTTTT GATATTCAAG CCAAGTCGTT TTCAAACTAT CCAAGTATTG 4260 

TTGGGTAGAA TAGACACCAT TTACATAAGG AGTTGTCGAG TAATTTGGTC GCCAGAAATC 4320 

CATGATGTCA CGGGTCTGAG CTACATTGTC ATTATTAAAG GCCATCATGC GTGGATTTTG 4380 

TGTAATCAAC ATAGCTACAC TTCCAGCACC TTGAGTTGGT TCTCCTGGAG TTTCAATACC 4440 

GTATTTGGCA ATATCACTGG CAATGACCAA GACCTTGGAC TCCGGAGAAT TTTCCACATG 4500 

CAATTTGGCA TAATGGAGGG CAGCAGTCGC TCCGTAGCAG GCTTCTTTAA TCTCGAAACT 4560 

ACGAGCAAAG GGCTGGATGC CCAGCAAGCC ATGCACAAAG ACGGCCGCAG CCTTACTCTG 4620 

GTCAATTCCT GACTCGGTCG CCACAATGAC CATGTCAACT TCTTGTCTTT CTTGCTCAGT 4680 

TAAAATAGAG TCACTAGCAC TGGCCGCCAA GGTCACGATA TCCTCAGTTA GGGGCGCAAT 4740 

ACTCAATTCC TTGAGTAAGA GTCCTTTACT TAATTTTTCA GGGTCAATTC CCCTCGCTTC 4800 

TGCTAAGTCT TGTAATTTCA AGACATATTG ACTGGTCGCA AAACCAATCT TATCAATACC 4860 

GATTGTCATA TTTACCTCTG TTTTATCATT CATGTAAAAA ATCGTTCTAT ACTATTTTAT 4920 

CACAAATGGC AGTAAAAGAG AGAAAAAAGA CTTGATTCAC CAAATCAAGC CTCTTATTGG 4980 

TCATCATTTT AAAGAATGAT TAGTTGCTAG AGAGTTCACC GATATAAGTA GCTTTATAAG 5040 

CTCCATTCAC AGTTATCAGC TCCTGGAGGA TCAAATTTCC TGAGTAAGTC CTTCCCATCT 5100 

CATCTACAAA TTTTTGATAA AACTGACTGG TCGGAATTTC TCTGACATCC TTATCAAATG 5160 

TCTTATCAAG TGTTTTACTA ACCTTCTCAG CAATCAATTG ATGCTCTTGC CATCCACTTT 5220 

GAAACTCTGA GCCCGAACTA GAAACCATGA CTGGGATAAA CAACAAGGTC AGTAGATTTA 5280 

CAGACAATAA GGAAAGTAGT AGACTTCCTG CAAAACTAGA ATCCTAGTTC ATGATTGATA 5340 

ATACCAGCAA TCAAATTCAT TCGTAATCCG AAGCGTTTAC GATGATTTCG ATAGGTTGTT 5400 

GAAAACATTT TAAACGTTTT TACTTTGGCA AAGATGTTCT CAACCTTGCT TCTCTCCTTA 5460 

GATAGCGCAT GGTTACAGGC TTTATCTTCA GCTGTTAGCG GCTTGAGTTT GCTGGATTTA 5520 

CGTGGAGTTT GTGCTTGAGG ATATATCTTC ATGAGCCCTT GATAATCACT GTCAGCCAAG 5580 

ATTTTACCAG CTTGTCCGAT ATTTCTGCAA CTCATTTTGA ACAACTTCAT ATCATGACTA 5640 

TAGTTCACAG CGATATCCAA AGAAACAATT CTCCCTTGAC TTGTGACAAT CGCTTGAGCC 5700 

TTCATAGCGT GAAATTTCTT TTTACCAGAA TCATTCGCTA ATTGTTTTTT AGGGCGATTG 5760 
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ATTTTTACTT 


CCGTCACATC 


AATCATTATC 


GTGTCCTCAA 


AGCTGAGAGG 


AGTTCTTGAA 


5820 


ATCGTAACAC 


CACTTTGAAC 


AAGAGTTACT 


TCAACCCATT 


GGCTCCGACG 


GATTAAGTTG 


5880 


CTTTCGTGGA 


TACCAAAATC 


AGCCGCAATT 


TCTTCATAAG 


TGCGGTATTC 


TCGCACATAT 


5940 


AGAAAGCGTT 


ATCAATTTAT 


TTATCTCATT 


TTTCAGAAAA 


TTCTTTTATT 


TCTGTAAAGT 


6000 


CTACGATACT 


CGATGTGTTT 


TTATATAATG 


ATAGAGTCTG 


AGAATCACTG 


TTCCGCTAGC 


6060 


CATTCCAATA 


GAGATTACCA 


AAGCCAACAT 


GACAACCAAG 


GTCGCACTTG 


CCAGTGCTTT 


6120 


ATTATAGTCC 


CCTGTCACAA 


AAAAGGCAGT 


TGTTCGGTAG 


GAGAGATAAC 


CTGGAACCAG 


6180 


CGGTGCCAAA 


ATGGCCAAGA 


TAAAGACCAC 


AGCAGGTGTC 


TTATAAAGAA 


TACTTAAAAT 


6240 


CTGGCTGACA 


CAAGAACCAA 


TAATGGCTGC 


AATGAAGGTA 


GCTACAATGA 


CATTGGTCGG 


6300 


TTCCTTGAGC 


AAGAGATAGA 


TTAGCCAGAC 


AGTCATGCCC 


AAAATCCCTC 


CAGGTAAGAG 


6360 


CATAGACCGT 


TGCACATTGA 


GTACGATTAA 


AAAAGTGATA 


ATGGCAAGAA 


AACTTGCTAC 


6420 


TGCTTGTAAT 


AAAAAGGTTG 


TTAGTGTCAT 


ATTAGTTCAT 


CAATACCAAG 


GCGACAGAAG 


6480 


TTCCTGCCCC 


TAAAGCGAGG 


GTAATGAGCA 


GGGATTCAAA 


CATCTTACTC 


ATACCAGAGT 


6540 


TTATGTGGTT 


GGTCATAATA 


TCACGGACCG 


CATTGGTCAA 


GGCAATACCT 


GGTACAAACG 


6600 


GCATGACCGC 


ACCAGCTATA 


ATCAAATCTG 


CCGTTGAAGG 


AAAACCTGTG 


TAGCGAGCCC 


6660 


AAAACTGGGC 


AATTATCCCA 


AAGACAAAGG 


CTCCAGCAAA 


GGCTGTCACA 


AAGGGAATTC 


6720 


GGATAAATTT 


TTCCACATAG 


AGGGAAAAGG 


CAAAACCAAA 


TAAGGTCGCC 


ACTCCTGCCC 


6780 


CAAGTGCGTC 


GTAGATATTT 


CCGCTAAACA 


TAACTGAAAA 


GAAAGGAGCA 


CTAAAGGTCG 


6840 


CAGCCAGAGT 


TACCTGCAAC 


TTAGTATAGG 


GAAGGGGTTT 


GGCTTGCAAG 


GCCGTCAATT 


6900 


GCTTAAAGGC 


TGTTTCTAAG 


TCAATCTGCC 


CCCCAACTAG 


CTGACGAGAA 


ATCTGGTTCA 


6960 


CATCGCAGAC 


TTTTTCGATG 


TTATAAGAAG 


AGGAGGTCAC 


GCGCTTCATG 


CGCAAATATT 


7020 


GGTATTTTCA 


ATAGAGAAAA 


AGATAGCGGC 


AGGCATGGCA 


AGGACATTGC 


AATCCACAAT 


7080 


CCCCTGCGAA 


TGCGCGATTC 


GAATCATGGT 


ATCTTCTACA 


CGATGGATTT 


CTGAGCCACT 


7140 


TTTAAGGAGA 


ATAGTCCCCG 


CTAGCATAAT 


CACATCAATG 


ACGGCATTTA 


ATTCTTTTGA 


7200 


TTCTTCCATG 


CTTTCCTCCT 


TTTATCAACT 


CCCTCTATTC 


TATCACAAAT 


CCGGACTCAA 


7260 


AAAAAATCTT 


TGCCATGAAA 


TCATGACAAA 


GATTGATTAC 


TCATTTTGAT 


TATCCATCTG 


7320 


CTTTTAAGGA 


GTAGCTGAAG 


TTGTTTTAGG 


TTTGTAGATT 


GAAATCTTGA 


CTCTAGTCTT 


7380 


ATTGAGGTCT 


ACCTTTTCAC 


CTGCTCTAGG 


ACTTTGTTCA 


ACAACCATGC 


CTTCTGCACT 


7440 


ACCTGCAGGC 


GCTGTCGTCA 


CTTCTACAAC 


TTCTATATTA 


GCTTCCTTAA 


TCCCAACAAT 


7500 
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TTGAATCAAA 


TTCTTf^TT Af3 


*p a A A i^wr a a 

1 nnAL luLAA 


GL I AGAAllA 


ATGTAACTlG 


GCATGGCAAC 


7560 


ACTTGTAACT 


TTTTT AdfT A 


Ll\3 1 wUUjnL 


AA1 1 JuAulA 


GGTTTACTCA 


lATCATAAGT 


7620 


CGTTCCGGCA 


V~V- 1 UVJAL Ill 


Oil 1LA1AA1 


LTj IT, LLTTGGT 


TCGCTTTCGC 


TGGACTCTTC 


7680 


TTCCTCTATC 


TTAATPAAAT 


1L1 LnuuAAL 


CTTCTTCTGC 


TTGAGTTCTG 


AGATTACTTC 


7740 


TGTAGAGTTC 


CGTCCAATAT 


nu 1 1 LLL 1 nn 


rnrpfil ITPPTtp 
1 1 unn 1 Lu 1a_ 


GTAGCTTTTT 


TAGCTACTGT 


7800 


CAAAACAATT 


x vxnvj i i V9L L l 


TV^PTPllPT'P 
luL 1 Liviu 1 L 


Al AviO 1 Lu 1 A 


CCTTCTGGTA 


GACTTTGCTT 


7860 


CAGGACCGTT 




i V- i un i i LuA 


p*p p*PT , p'P m pp 

L1L1 1L1 1 LL 


J L AATTTTAA 


TCAAATTATC 


7920 


TGGAACTTTT 


1 1 1 In 


AT*TVW2PAa m 

Al IVLuLAAl 


uALA 1 aJAuAu 


OA I ™rCCGAC 


CGACATAATT 


7980 




A A Afi ITTCPT 


tvspptp a <rf a 

luLL 1\jA 1 VsA 


GAlAAllAAA 


TTGATTTTCG 


TTCCTTCTTT 


8040 




L UiuLuLv.Au 


GA1 Llbl ALG 


GATAATCCGC 


CCTTCTTCCA 


CCTTTTCACT 


8100 




1 1L1 LL PL/VL 


LflA fLTCAAA 


ATTGGCTTTT 


TTGAGCGTTG 


CCTTGGCCTC 


8160 


1 wL/viL loJIL 


rp/-t » tTTPrTT* a 


CATCTGGAAT 


GGCAATGGTT 


GCAGGAGTTC 


TGGATAGTAT 


8220 


L Ln/ui 1 Anon 


uMuL rtjLLA 


L C AATAC AAG 


GCTGGCCAAC 


AAAATCAGGT 


AACGCATCTT 


8280 




TTTTTCGGTG 


CTTGTGGTTG 


GTAAGTTTCC 


TCTGTCACAG 


CCTGGCTTGG 


8340 


ullll 1 AiA 1 i 


GATTTGTGTT 


CTGTTTGCGC 


TTGAACCTTA 


GGAATAGATG 


TCAAGGTACT 


8400 


L lunuAnAL L 


TTCGGCAAGG 


TCTTGGTATC 


TGCCTTGCTC 


GTTTCATCAA 


AGATTAACTT 


8460 


nV. Ill Ln 111 


CTACGATTGT 


AGGACAAGCT 


ACTAGACAAG 


TCCACATACA 


TCTCTGAAAC 


8520 




CGATTGGTCA 


ACTTTTTAGC 


AGTTGCCTTG 


ATAATAACAT 


TTTCTAAAGC 


8580 


fTRAfifiTAPA 


GATGGATTTT 


CTGCAATAAC 


GGACGGCAGG 


GGTTTCTGGA 


AATGCTGGAG 


8640 




ACCGCGCTAT 


CCCCGTCATA 


AGGGATATGG 


CCTGTCAGCA 


TCTCATAGAA 


8700 


AATAATCCCC 


ATGGCATAGA 


TATCACTCTG 


/"» tv. ^ ta /**fYW"v/"' 


TTlGAAlCAl 


GCGCCTGCTC 


8760 


TGGTGACAAG 


TAATGAACTG AGCCCAACAT 


CGAGTTAGTC 


T/7PPTP jpip 
luw 1 LAviAL 


1 rGlLlLTGL 


8820 


AAAGGCTACA 


GCAATCCCAA 


AGTCTGTGAC 


CTTGGCAGTC 


CCATCTGGTG 


TCAAGAGGAT 


8880 


ATTTTGAGGT 


TTCAAGTCCC 


TGTGAACAAT 


TCCTCGAGTA 


TGGGCCAAGC 


GCATAGCCAA 


8940 


GAGAATTTGT 


CCCATGATAC 


GGACTGCTTC 


TTCATTAGAA 


AGAGGATAAT 


GTTCCTTGAT 


9000 


ATAGCGTTTG 


AGGTCCAGTC 


CAGCCACATA 


CTCCATAGCT 


AGGTACTGTT 


GACCGTCTTC 


9060 


CTCGCCAATA 


TCTGTTATCC 


GAACGATATG 


AGGATGGTCT 


AGATCTGCCA 


TAGCTCTCGC 


9120 


TTCACGCTGA 


AAACGAGCTA 


CAGCTATCGG 


GTCCGTCTGG 


TAGTTGGTCC 


TCAGAACCTT 


9180 


CACTGCCACT 


TCTTCCCCAT 


CTAAGATTAA GTCTTTGGCT 


AGGTAGACAT 


CCGCCATACC 


9240 


TCCTCGACCA ATCTGTTTGA 


CAATCCGATA 


GCGTCCGGCA 


AAAATCTTGC 


CGATTTGGAT 


9300 
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CATTCTGCAT 
GCATTGTTAG 
ACAATATCAC 
AGATAGTCAC 
CCAATAGACT 
" TGACCAGCCT 
TCTCCACGAA 
ATAATAGCAA 
TGGTGAATCT 
ACTGTATCGA 
GCGATATTCC 
GCTCTATTGA 
GTTAATAATG 
TGATGAAGAA 
TGATATCCTT 
AGGCCTTAAC 
TACCACCTTT 
AGGACGCGAA 
CGATTCCTGA 
CATGCACCTT 
GGGCATTTTC 
CTGTCGTAAG 
GCACTCGCTC 
GGATGGTAAT 
CCAGACCAGT 
TTCGACTTAG 
TGGCTTTTGC 
GAGAATAGGC 
CTTCACGCAA 



CCTCCTCGTT 
CAAAACGAAC 
GAATCTCACT 
CTGACTCAAG 
GGGTGATAAT 
TGAGCAATTC 
TCAAGCCGAT 
GGACTTCCAA 
TTTGATTTTC 
TCTGGGTATC 
CTGCGCGATG 
CATAGTGGTT 
AAATTTCCAT 
TCCATCACTT 
ACATTCATGT 
GACTTGAAAA 
GCCTAGTATT 
ATCTGCCGTT 
ACAAGGAGCA 
TCTGGCATCC 
TTGAATTAAA 
ATAAGAGGCT 
ATCACCTTGT 
GGCTCCATCC 
GGTTGCTAAA 
GTCTGTTACA 
TCTCTCCTCT 
AATGGAGTCA 
GATACGGCGA 



CATAGAAACA 
AAGTGTCTCC 
GCCTGAAATC 
GATAACTGTC 
ATTTTTTTGC 
ATTAACCAAG 
ACGCGAATCA 
AGTAGTTCCC 
AATTTCTAGG 
AACCCAAGCT 
ACCTCCCATC 
GACATAGTCT 
GTGTCAGTTC 
CCATACAATT 
TCTAGTTTTA 
TTCTCCTCTG 
TGACAAACAC 
TCTTTATTGT 
TCCACCAAAA 
AATTTTTGAG 
TCCAACTTGT 
ATATGGGCTG 
AAATCAAGCG 
GCAAACAAAT 
AGGGAATTAT 
CGAATACTGG 
CCGTATTCTT 
CGCTTGTTTT 
AGGACAGCGT 



699 
AGGGCAACCG 
GTTTTATCTG 
ATGTTGGTCA 
CCAAAATCAG 
GGATGAGCTT 
GAATGATCGC 
CCAATATGAG 
ATGCCTCTGT 
TAATGGGCGA 
ACACCCAGGT 
CCATCAGCTA 
TGGTTATTTG 
CTTCCTAATC 
CAGGTGTAAT 
CCTGCTCGAA 
AGACGATAGT 
TACCTAATAT 
ATTTGATATC 
TCTTATCAAA 
TTTGAACCCG 
GGTCGTACAA 
TTTTCCCACC 
TCGGAGCAAC 
TATGCCCTGC 
TCGCCTCCAA 
CTTTGTTTCG 
CCTTGAGTTT 
TTCGCTTGAT 
TGACCAATTT 



TAATGTTGTC 
CTAAAGGAAT 
AGCCGTCACT 
GCTGAATTTC 
CTGCCTCTTC 
TCGTCAACTG 
CATAGATAGC 
AAGCTTCATC 
ACCATTCACG 
CTGTGACCGC 
AAATAATCAT 
TTCGTTTCTG 
CGATATCTTG 
GAGGATACAG 
CTCGGGATGA 
GCAGGTGCTA 
TTCTAACTGA 
TGGTTTTCGG 
GGAATCCTGG 
ATCTTCAACT 
GTCCAGAGCA 
TGGAGCCGCA 
CAGCTGACTG 
AAAATGCCCC 
CAAGGCTTGG 
CACTAACAGG 
GGCAACTAGC 
GCTAGCAATA 
TTCACTGCCT 



TAAACCTCCT 
ATCACTGGTT 
ATTGAGCAAG 
ATCTTTTTGC 
TGGTGTCAAT 
ATGGTATTCT 
CTGATTATCA 
CTGACCAAGC 
CACTTCATTG 
CATTTCACTA 
GGTACGTCCA 
ACCAACATCT 
CGAAATTGAC 
CCGTCTTTCA 
CTCTCTAAAA 
TAAGTTATTA 
ATTTCCTGCA 
CGCAAAAGAC 
TCAAAAAACT 
CCCAGACGTT 
GTAACCTGAC 
CAGGCATCAA 
GACTCATCTT 
TGCTCCTTAA 
ATTTCCTCTT 
CTTTCAAAGA 
CAAACTGGGA 
TCTGGCCAGC 
TTTTTACGGA 



9360 
9420 
9480 
9540 
9600 
9660 
9720 
9780 
9840 
9900 
9960 
10020 
10080 
10140 
10200 
10260 
10320 
10380 
10440 
10500 
10560 
10620 
10680 
10740 
10800 
10860 
10920 
10980 
11040 
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GTTTGGCCAA 


TTCCACTGCT 


TCATTAACCA 


CAGCATGATC 


TGGAATCTTG 


TCCAAATAGC 


11100 


GGAGTTGGTA 


GGCACTCATG 


AGAAGAAGGA 


CATAGAGCCA 


GCTGTCTAAC 


TGGTCTCTGT 


11160 


CTTCGATAAA 


GTGGGATAGG 


TACCATTCCA 


GAGTCAGTTT 


ACGGGCTACC 


GTTCCATAGA 


11220 


CCAGCTCGGT 


CACTAAGCCC 


TTGTCTGCTG 


CCAAAAGTTG 


ACTTCCCTTT 


AGATGCTTAT 


11280 


TTAAGGCGAT 


ATTTGAATAT 


GCTTGGTTCA 


CAAAAACATC 


CTCTAGCACT 


GCTAGAGCTA 


11340 


. AACTTCTAGC 


CGTTTCTACT 


TTAGTCACCA 


AATCGTTCTC 


CTACAGTCAA 


TGTACGTCCA 


11400 


ACTCCGTTGA 


GGAAGGAAGC 


AATGTCCATC 


TTAGGCTTAC 


CAGCTGGCTG 


CACTTGTTTG 


11460 


AGGGATAGAG 


CCCCTTCAGC 


CGTTGCGACA 


ATCAATTCTT 


TCTTGCCGAT 


AGAGAGAATC 


11520 


TCACCTGGAT 


TTCCCTGACC 


TTCTACTGGT 


AGGGCTTCAT 


AAATCTTAAA 


GCGGTCGCCC 


11580 


TTAAGGAAAG 


TATGGGCAAC 


AGGCCAGGGG 


TTCATTCCAC 


GAATTTGGTT 


AAAGAGTTGA 


11640 


CGATTGGTTT 


TGTTCCAGTC 


CAGTTTTTCT 


TCCTCTGGCT 


TTATATTTGG 


AGAGAAGGTA 


11700 


ACCTGACTCG 


TATCCTGCGG 


TTCAGGTTTG 


ATATCACCAG 


CAATATAGGC 


AGGCAGAGTG 


11760 


TCCAAAAGCA 


AATCACGACC 


AACTAGCGCC 


AATTTTTCAA 


ACAAGGTGCC 


AACATTGTCC 


11820 


TCATCTGTGA 


TCGGAATGCT 


GCGACGAGAA 


ATCATATCTC 


CTGCATCCAT 


TTCCTTAACC 


11880 


ATTTCCATGA 


TGGTCACACC 


AGCTTCCTCA 


TCCCCTTGAA 


TCAAGGCATA 


ATGGATAGGC 


11940 


GCACCACCAC 


GGTGTCTAGG 


AAGGAGGGAG 


GCATGAACGT 


TGACAGCAAA 


GTCCATGCTA 


12000 


TCAAGGAGTT 


TGCTTGGGAG 


AAACTGCCCA 


AAAGCAGCAG 


TCACAATTCC 


ATCTGCTCCT 


12060 


AGCTTCATAA 


GATCTTCCAT 


CTCTGGACTT 


CCAGATAATT 


TTTCAGGTTG 


GTAGATAGAT 


12120 


AGTCCTGCTT 


CCTTGGCAGC 


CTGCTTGACT 


GGGGTTTCTT 


GGATAACTTT 


TTTACGACCA 


12180 


ACAGCACGGT 


CTGGCTGGGT 


CACAACGGCT 


AGAATTTCGT 


AACGGTCATC 


TGTCAAAAGT 


12240 


CCTTTTAAGA 


CTGTTGCTGA 


AAAGTCGGGG 


GTCCCCATAA 


AGATTAGTTT 


TGTCATATCT 


12300 


TCTCCTTCTT 


ATAAAAATTG 


CTGCGGCTCA 


TGGTCAATGC 


TGAGACGGAG 


CTCACTATTT 


12360 


TCCCGTTCTT 


GAGTCAAGGC 


TAAAACCTGG 


TTGAGGGTCG 


ACCCCAGCTC 


ATCTTCTAAA 


12420 


CGGTATTTAA TTAAAATCTG 


GTAATGATAG 


AGGTTGTGGG 


TACGGGCAAT 


CGGTTTTGGC 


12480 


GTTGGCCCCA 


GAATGGGACT 


GGTCTCTGAC 


AAGCCTGACC 


GCAAAATGTT 


CATGACTTCA 


12540 


TAGGCACGTT 


TGAAAACCTC 


TTCTTCTTTC 


TTGTGAGAAA 


GGGTAATACC 


AATCGTGAAA 


12600 


TAGTAAGGTG 


GATAGCCGAG 


TTGTCGTCTG 


ATTCCCATTT 


CATAGGCATA 


AAAGCCTTCG 


12660 


TAATCTTGAT 


CCTTGGCAAA 


TCGAATAGCA 


TAGTGCTGCG 


GATTGTAGGA 


CTGTATCAAG 


12720 


ACTTGACCTG 


CCTTTTCAGC 


CCGACCTGAT 


CGACCTGCCA 


CCTGAGTCAA 


GAGCTGGAAG 


12780 


GTTCTCTCAG 


AAGAACGGAA 


ATCAGGCAGA 


TTCAAGGCCG 


TATCCGCATT 


TAGAACTCCG 


12840 
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ACTAGGGTAA 


CATTGGGAAA 


ATCCAAACCC 


TTTGCAATCA 


TCTGAGTACC 


AAGTAAAATA 


12900 


TCCGCTTCCC 


CTCGCCCAAA 


CTGGTCAAGC 


AAGGCTTGGT 


GACTGCCTTT 


CTTTCGAGTC 


12960 


GTATCCACAT 


CCATCCTCAG 


AATGCGAGCT 


TGGGGAAAGA 


GTTCTGCTAG 


CTCATCATAA 


13020 


GCCTTCTGAG 


TTCCCGTCCC 


ATAGTAACGA 


ATACTGCGGC 


TCTTACAGTT 


AGGACAGACC 


13080 


TGAGGAATAT 


CCTTCGAGAA 


ACCACAATAA 


TGGCAGTTCA 


TAGTCTTGGT 


ATCCATATGC 


13140 


AAGGTCAGAG 


AAATATCGCA GTTGGGACAA 


GTATCCACCG 


TCCCACACTC 


CCGACACATG 


13200 


ACAAAGCTAG 


AATAACCACG 


GCGATTGAGC 


ATGAGAACCA 


CCTGCTCTTT 


TTTAACCAGA 


13260 


CGGTCTTGGA 


TAGCCTCTAG 


CAAAGGAGGC 


GTAAAGTTTG 


ACGTCTCATT 


TTGTCCGATA 


13320 




AGTCAATCAC* 


TTGAACCTCA 


GGGATTGTAG 


CCAAAGGATT 


GGCACGTTGG 


13380 




AGTGTTGATA GACGCCTTTG 


CCAGCACGTG 


CCCGGCTCTC 


TAAGCTCGGC 


13440 




CAAGTACCAG 


AGTTGCTTGA 


TTATACTGAG 


CCCGTAAAAT 


AGCTACCTCT 


13500 




AACGGGGATT 


GCTGTCCTGC 


TTATAAGCCG 


CTTCATGCTC 


TTCATCAATA 


13560 




CCAGATTTTT 


CAGAGGAGCA 


AAGATAGCAG 


ATCTGGCACC 


AACAACAACT 


13620 




CACGCTCCAC 


CTTGCGCCAT 


TCATCATACT 


TTTCACCATT 


GGATAATCCT 


13680 


n AGTG AAG AA 


TGGCTACCTT 


GTCCCCAAAA 


CGTGCTATAA 


AACGCTCGGT 


CATCTGAGGA 


13740 


GTCAAGGAAA 


TCTCAGGTAC 


CAGCAAAATA 


GCTGTCTTGC 


CCTT AT CC AG 


GGCACCTTGG 


13800 


ATAATCTGCA 


AGTAAACCTC 


GGTCTTCCCA 


CTTCCTGTAA 


TCCCTTGAAG 


TAGAAAGGGA 


13860 


GGTTGAGAAC 


TGCCAATAGA 


ACTCACAACC 


GCATCACGCG 


CCTGTCTTTG 


TTCTGGATTT 


13920 


AACTCCAAAG 


GTCTACTTGC 


TTCAATTCCT 


TCAAAATAAG 


CAGCCGAGCG 


TTGAACTTCC 


13980 


TTTTGGACTA 


TGGTAACAGC 


ACCTTGATCC 


ACAAAGAAGT 


TGACTTGCTC 


TCGCGAGTAG 


14040 


GACTCTAACA 


AGCTAGCCAA GGAAGCGCTC 


TCTGGATGAG 






14100 


TCCAACTTTT 


TCTTGGCACG TGTAGAAATC TCAACACCTT 


CTAATTGAGC 


ATGGTCAACC 


14160 


TCATACCAAG 


ACTGGGTCTT 


GACCTTCTTT 


TGATCGACTC 


CCTGATATTC 


CAGACCAAGC 


14220 


AGGCCTTTTC 


TAGTCAAACG 


CATCATTTCA 


GCTTGCTTGG 


CAAGGTCTAG 


TGAAGAAAAG 


14280 


GCTAGCGAAT 


CTTCTGAACC 


AAACAGGCGC 


ACTCGTTCTT 


CCTGACTCAA 


GCCTTCCAGA 


14340 


GGATAGAGAA 


TCTTGTCATA 


GCTAGAATTC 


AGAAACCCTG 


GAAGCATGGC 


CTTGAGGATA 


14400 


GAGATTTTGT 


AGGAGAAGAC 


AGATTTGCGT 


AACTCCTCAG 


CCAGCCAGAG 


TTGTTCTGGC 


14460 


GTGAGAACAG 


GAGAAAAATC 


CAGCACCTCT 


GCAATATCTT 


TTAAATCTTG 


CTCCATCTCT 


14520 


TCTCCATCTG 


ATTGGGACTT 


CAAACCAAGA 


ACAATCCCTT 


GAATCAGGCG 


ATTACCCTTA 


14580 
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CCAAAAGGCA CATGAACCCG CATCCCAACT TCCAGCATTC CCTCAAATTC CTCCGGAATC 14640 

CTGTAACTAT AGGGCTGGTC CGTCTGCATC AAGGGCACAT CTACGATAAT CTTAGCTAGG 14700 

GCCATCTTCT CACCTCCTCC TTGTCAGTAC ATTCTTGCAA TAGAAAAAAT AAGATTGAGT 14760 

CCCCCCAACC TTAAATTTTT TCACCATCTT CTTTTTCTTT AGCAATTTGC TCTTTGATTT 14820 

TCTTTTCTTC TTCTTCTTTG CGGCGTTTTT CTTCTTCGAT ACGGCGACGC ACTGCTTCAC 14880 

GTTTTCCTTC TGGATCTGGG TGAATTGTAA CGTTTCCTGA TTCGATTTCT TCTAAAG CGC 14940 

GAAGAGTTGA TTTTTCAGAC TTGAAACCTT GAGTTGCTGG GGCACCTGCT TCCAATTCGT 15000 

GGGCACGTTT TGCTTCCAAG ATTACGAGTG AATATTTTGA AGGAACCTTG TCGAGCAAGG 15060 

TATCAATAGA GGGTTTTAAC ATCATTTGCT TGTACCTATT TTCTAAATTT TATCGGGTAG 15120 

TTGGAGATTT TGGTAACATC TCCTGATAGT GACCAATGAC ACGATCCACA CAGAAGTGTT 15180 

CTGCTTCAAT CACACATTTG ACACGTTCAG CAGCTAGGGG TACCTGATCG TTGACAATCG 15240 

CATAATCATA CTCACGCATG AGGGCAATTT CTTCCTTGGC CTTTTCGATT CGTTGGGCAA 15300 

TCACTTCTGC ACTATCTGTT CCACGACCTA CCAAGCGATC TTGCAATTCA TCCAAATCTG 15360 

GTGGTGTCAG GAAGATAAAG ACAGCATCTG GAACCTTTTT CTTGACCTGA AGAGCACCCT 15420 

GAACTTCAAT TTCAAGGAAA ACATCGATTC CCTTGTCCAA GGTTTCATTG ACATAGGTCA 15480 

GAGGAGTTCC ATAGTAGTTA CCGACATATT CTGCGTATTC CAACATCTGT CCTTGACGAA 15540 

TCAGCTCTTC AAATTCTTCA CGAGTACGGA AGAAATAGTC AACACCGTCC ACTTCTCCAG 15600 

GACGTTGTGC GCGTGTCGTC ATCGATACAG AATATTGAAA TTGGTTTTCA GAACTCTCAA 15660 

AAATCTCTCT TCTAACCGTT CCTTTTCCAA CCCCTGAAGG ACCAGAAAAA ACGATTAGTA 15720 

AGCCTCGGTC TGCCATTGTG TCTCCTTTTA GTCAATCTGT GAAATAACAT TTCTCTAGAA 15780 

TAATGGCAAA AAGCCAGATT ATCCTTTACA GTCTTTCTAT CTAGTGTAAC AAAAAAGCAG 15840 

TAATTTTTCA ACTGCTCTTT CTTATTTATT TAGCATAATC TACTGCACGA AGCTCGCGAA 15900 

TCACGGTTAC CTTGATATTT CCTGGATAAT CGAGATTGTT TTCAATTTTC TTACGAACTT 15960 

TGTGAGCCAA GATTGTGACT TTGTCGTCCT TGATTTTTCC TGGATTGACC ATGATACGAA 16020 

TTTCACGTCC TGCTTGAAGG GCAAAGCTAG TTTGCACTCC TTCAAAGCCG TTAGCAATTT 16080 

CTTCCAAATC ATGGAGACGC TTGATGTAGC TTTCAAGAGA CTCACTACGA GCACCTGGAC 16140 

GGGCTGCGCT CAAGGCATCT GCTGCAGCGA CGATAACTGC TATCACGCTC TCAGCTTCAA 16200 

CATCTCCGTG GTGACTAGCA ATCGTATTCA CCACAACTGG GGGTTCCTTG TACTTACGGG 16260 

CCAATTCCAT ACCGATTTCA ACGTGGCTAC CTTCAACCTC ATGGTCAATG GCTTTCCCGA 16320 

TATCGTGAAG GAATCCAGCA CGACGGGCAA GAGCCGCATT TTCACCAAGT TCGCTCGCCA 16380 
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TGATACCAGC 
TACGGAACTG 
TTTCATAGGC 
TCTCAACCAA 
TAGTCATACG 
, GTGTATCGTC 
CTTCACGACC 
TTGACTCCGC 
CCATTTTGTC 
CCCTGGTCAA 
GCGCACCAAT 
CACGCGCATC 
GTTCTTTACT 
GACTTTCGAT 
GGTATTTTCT 
CACGTTTGGC 
TTGCTTCTTG 
TAGCTGAGAT 
CAAGCGACAT 
CTTTTACATT 
GAATTTGGAA 
CTTGAGAAAA 
ACGCTATCTC 
GCTCCTTGGT 
TGAATATCCT 
TAGCAATAAT 
ACATGGACCT 
AGTTCCTTAT 
AGGACTACAA 



CAACTTAGCA 
CAAACGTCCC 
AGCAGCCTCA 
CTCTTCGATA 
GGCAATCTCA 
GATAATCACA 
AATAATGCGT 
TACATATTCA 
AGAACGTTCC 
GTTTTCCTCT 
ACGCTCTAGT 
AAGGTTTTTC 
CGTCAAATTG 
TTGTTTGAGT 
GGCTTCTTCT 
TTCATTAACA 
TTCAGCATTT 
GCTGACATAT 
GATTTCCATG 
CTACCATAAA 
CACATTTACC 
AGCCTACCTT 
CATCCATCAT 
AATCATATTT 
TGATTAGTTT 
CTGCCCAGGC 
GAGTACTTCC 
TTTGAGCAAT 
TGTCCTCATC 



ACCTCAATCG 
ATAATCTTCA 
CCGTATTCAC 
CGAGCTGGAT 
CGACGAATCG 
TCGACCCCTG 
CCCTTCATAG 
CCAGCGATAC 
TTGACCTCTT 
GTCTGAGCCA 
TCTGCTTCTT 
GCTCTATCAG 
TCGTCCTTAC 
TCTTGACGTT 
TTGGCCTCCA 
AGTAAATCCG 
AAAAGCATCA 
CCAATGACTA 
TTTTTACCTC 
AAAGTGATTT 
AAAATAAACT 
TCAATAGACT 
ATAAATCAAG 
GAGTGGTTCT 
ATTGATTCTT 
ATCTTCTGTA 
AGCACGAACT 
TCTCAGGGTT 
TGGATTTTTA 



703 
AATGGCGCAA 
TCAAGTCTGG 
GAATCTTATT 
GTATACGACC 
GATCAAATCC 
TCAAACTTTC 
TATCGTCTGG 
GTTGCATAGC 
GCTCAGCTTC 
AGATAATATC 
TTTGTCTTTC 
AAATACTTTG 
GGTCAAGGCT 
CTGATTTGAA 
ATAGTGCTTC 
CTTCACGCTC 
ACTCTGCAGC 
AACCAATGAT 
ATTTTATTGT 
TCACAAACCT 
TGTTGTTTAG 
TAGTAATGAT 
CGATTTTCTG 
GGTTTACCTA 
TTTAACGTTT 
AACTTGAGCA 
TGAGCCATTC 
TCTTGGATAC 
TTGACCACCG 



AACATTTTGT 
ATGAAGGTTT 
GTCAATCTCT 
ATCTTTGAGC 
TGACAAGGTC 
AAAGGTACGA 
CAGATGAACT 
TTGAACCAAG 
GCGAATGCGA 
TCGTGCTTCT 
GACTTCCTCT 
TTCTTTTTGT 
AGTAGCTCTC 
TTCAGCGTCC 
TTTTTTAAGA 
AGCTTGTCCA 
TTCCTGAGAT 
GACGGCAAAA 
TATTCCGAAT 
AAAATAGAAT 
AAATAGTAGT 
CTTTAAAGGA 
CATCAATACG 
TTCCTGTAAA 
TCTTATCCTG 
GCATTTCTTA 
CTCGCAAAAC 
TATCCCACTC 
TCAAAGGCTC 



CCATATGAAG 
GGCGCACCAA 
TGACGGTTTT 
AACATTTCCA 
ACCACTTCTG 
ATGTTACGAC 
GTTGAGTTTG 
ATGTCCTTGG 
CTGGCAATCT 
GCCTGAGACA 
AATTGCTCTT 
TCAAGTGTTT 
TCTGTCAAAC 
ACTTCTTCAC 
GACTTGCTTT 
CGTAAATTAG 
GATTTCATCT 
ACAGCAATCG 
GACATACATT 
ATGTTTTGAG 
TTAGTAGAGA 
CAAGAAAGCC 
CCGTGACCAG 
GGGATCACGT 
ATTTTGCCAG 
CTCCTCAATA 
CTTATCAGAA 
ACTCTTTGAA 
AAATTCATCA 



16440 

16500 

16560 

16620 

16680 

16740 

16800 

16860 

16920 

16980 

17040 

17100 

17160 

17220 

17280 

17340 

17400 

17460 

17520 

17580 

17640 

1770O 

17760 

17820 

17880 

17940 

18000 

18060 

18120 
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TTTACCTTCT 


704 

TCATGTAGTC CTTTAAATGA TTTCGGAATG 


TTGAGTAAAG GACTGCTTCC 


18180 


ATAACCATAC 


CTCGTTTTAG CTCTTTTCCA CTATTATACA 


CGAAAAGAAA 


GAAATTGTCA 


18240 


GGAACTTGTA 


CAAGATTTTC TTTTCTATCT ATTTATACTC 


AATGAAAATC 


AAAGAGCAAA 


18300 


CTAGGAAACT 


AGCCGCAGGC TGTACTTGAG TACGGCAAGG 


CGACGTTGAC 


GCGATTTGAA 


18360 


TTTGATTTTC 


GAAGAGTATT ATTCGTAAAA AATCTCAAAA 


AGCCTACCTT 


TCGGTAGACT 


18420 
18436 


TAGTTTGTTT 


CTATTC 






(2) INFORMATION FOR SEQ ID NO: 88; 









(i) SEQUENCE CHARACTERISTICS- 

(A) LENGTH: 7001 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 88: 
ACGTAGAAAA ACTATTTCTA TCACAGATAA TATTCCGTAT GTTGTTGGAG 
AAACGTCCTA GGTATCTTTC TCAGTCTATG TGACTTACAA GGGAAAACTC 
AGAAATTTTG AATGAAGATT ATCCTATTTC AGAAATCAAT TCCACCATTA 
AAAAACAGCT ATAGAGTACG TCCCTTTGGA AACAAAATTA CTTGGATTTG 
ACCTGGACAT TATAACAAAG ACTCCGGAAG TATCATTACA AACAACCCCA 
TTTTAATTTA TTAAATGTAA TTAAAAGATT CAATTTTCCT TTTATTGTAA 
CGATTGTATG GCTATAGGAC AATACCTTTT TAATCCACAC AATACCCCCG 
TTTCCTACAC GCTGGATTAG GTATTTACAC TTCCTTTTTC ACAAAAGAAA 
CTCTAAAAAT CCTTATATCG GAGAAATTGG ACACACCATT GTCGAATTGA 
TTGTGAATGC GGAAAAAAAG GTTGTTTACA AACATATATT TCGGATGCTT 
ACACGCCCAA TTATTATTTA AAAATTCCCA ACTAACTGTA CTAAAAAGCC 
TGAAAAAGAC ATTCATTTAG ACACCCTTTT AACGGCTTAT AATTTAGGCG 
ACGTCAACAA ATTGATAAAG GAGTCAATTT ATTAGCCACT TCTATTGCAA 
CATCAATCCT GCTGATAAAA TCTATATCAA CAGTCAATTG CTTAATTATC 
TCATGAAGTC AGGGATAAAA TCCAAGACCA GCTCCACTTC GTTCCCTTTA 
AGAAATTGAA ATTTTACCTT ACAACAAACA TCGTGGAAGT ATAGGAGCTT 
TATCGTCGCT TTTTTCATAG AACATAGCAA TGTATTACAA GATATTATTT 
TATTAGAAAT CTATAGACCT GTTTAAATCA ACTATAACCT GTAGTAGATA 



GTATTGAAAT 
TTTTCGAGAC 
CCAATATGAT 
GCTTATCAAT 
TATGGGAATC 
AAAATAATAT 
ATAACTTTAT 
AAATAGGAGC 
ATGGGCAATA 
GGTTAATCAA 
TTGTAAAGAC 
ACTCCGCTTT 
ATCTCCTCCT 
AACCTTTCAC 
CTCGTAATAT 
GTGCATTAGC 
CACCTTAATA 
TCTCGTATTT 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
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AGACAATATG AAAACAAGAC GACTTCCATA TAGGAAACCG CCTTCTCGCT ATGTTGAGTG 1140 

ATTTATATTA AAATAACTTT TCTTCTAGCT GCATTTTATT ATTATAAAAA CATTCATCAT 1200 

AACCCCCAGA ACTTAAATAA CAATTTTTAT TCAAGATACA TACTCCTAGA ATAAACTTTA 1260 

TATGAAATTC TCATTTTTGT TTTTACAATT CTCCTTAGTT AAATCTTGTT TAATATATGT 1320 

TTTACATATA GTATTTAGCG CCACATAGTA CTGAACTCTC TCCAAAAACG GTTATTCCTC 1380 

TTTGAATAGG GCGTTATCAC AAGAAAAGCA TCTCCACGTT TCAACTTCAT ATGGCTCAAA 1440 

AACAATCAAT TGATGCTAAA ACCTGTACCT AGATGTTTCG GTTCATAAAA CCATGAAACT 1500 

GTAAAAGTGG ATGAAATTGA TAGCGATAGT CAAATCAAGA GGCATCATAA CTCTAAAAAG 1560 

TCACAATATA TAAGTTCATC CTCGGAAAAA TATCATTCTA ATTGTTGAAA TGCCTACATG 1620 

AAAAGAAACG TCAAATGCTC ATGAAACAAC GAATACAGGT ATCAAAACTA TGACAAAACA 1680 

AATCCCTAAA TTTACTAAAG ACACTGCTCA ACTTTACACC TGTAAATGGT TGTTGTATAA 1740 

TAAAGTTACA AAGATGTACG ACCACACTGT TGTAAATCAT AGTGTTCGCG AATATATTAC 1800 

TGATAGCATT TCTACAAATA CAAGTAAAGA GAGCGGATGA GATTCAAACG AAATATGTCA 1860 

GTGCTTTGGC ATTCCTAGCC TT CAT AT CAT TTAAAGAATT CTATAGACAA AATTTTTTCC 1920 

AATACAGACA CTCGTAACAA CTGCTTCATT TTTCTACCAA CATATTTAGG AACAGGATAA 1980 

GATACAAGAG TATTAATCCA TAGCTCAGTT CTATACCAAT CTAAGACAAA TAAGCTAAAA 2040 

AAACGATTGA TAATAAGCAA ATAGATTCCA AATTTTCTCT ATCTGCTCAT TTTAATAAAC 2100 

AATACTAGTG TAACTATCCT TCCAGTCAGA AGCTTGTCAA ATCACACCGA AAATTCTTCT 2160 

AAAATTTATC TCGTTAGGCA ATCAAGCAAA AACTCGACGA TAGTACAAAC ATTATCATAC 2220 

AGGATTGACT TCCTAAATTA TATACTTTAG TAAGGTTTTC GGATAAGAAA AAAGGTTCAT 2280 

TTTACATTTC TAAACATTCT TTTCTAAGAT GAAAAACAGA ATTTTTCGAT TGTGATTTAA 2340 

AGCAACAAGA AGATTTTCAG TATCATCCTA TAGATACGAG CTAATTAAGA AAAACTACAT 2400 

TTTTGAATAT AAACTACAAT AATATAAACT AAATTTTATA GGAGGAAGAC AATGGATTGG 2460 

TACGATTATA TGATACAGGC ATCCAAACAA TCACAATTCA ACGCAAGCCA TTGGTTTCGC 2520 

TATTTGCGAA AAGTTATTTT TGAAGACTAT TCTTATTTAA CAAACCAAGA TGTAGAAAAG 2580 

TTGCTAGACT CCAAAGAACT AACCCGTTTT CAAAAAATTA GCTTGAAGTA TGCCTTTCAA 2640 

GAG CAT ACT C CAACTCATAA ATATGTGATT TCATTAAATA AACCTGCTAA GTTAACCAAT 2700 

GTTCAAAAAT TGATGGAGAA ATACAAACAT GGATAAAATG AAACCGGTCT TCCAAGCCCT 2760 

AAATAAGGAA TTAATTCAGG AAAATCTGAC TTTAACAATT ATCTGTGTCG GTGGTTATGT 2820 
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CTTAGAATAT CATGGTTTAC GTGCCACACA AGATGTTGAT GCTTTTATGG CTCTATAATA 2880 

TTTGTAGTGG GTAAATCCCC TATGGATATT ATGGAGCCTA TTTTTGTGTA GAAAAAAAGT 2940 

CCCATATGAC CTATAATGAA AAGCGACAAA ACAACTCATT AGAAAGAATC ATATGGAACA 3000 

ATTACATTTT ATCACAAAAT TACTAGACAT TAAAGACCCT AATATCCAGA TTTTAGACAT 3060 

CATCAATAAG GATACACACA AGGAAATCAT CGCCAAACTG GACTACGACG CCCCATCTTG 3120 

CCCTGAGTGC GGAAACCAAT TGAAGAAATA TGACTTTCAA AAACCGTCTA AGATCCCTTA 3180 

CCTCGAAACA ACTGGTATGC CTTCTAGAAT TCTCCTTAGA AAACGCCGTT TCAAGTGCTA 3240 

TCACTGTTCA AAAATGATGG TCGCTGAAAC TTCTATCGTC AAGAAGAATC ATCAAATTCC 3300 

TCGTATTATC AACCAAAAAA TTGCGCAAAA GTTGATTGAG AAGATTTCTA TGACCGATAT 3360 

TGCTCATCAG CTGGCCATTT CAACTTCAAC TGTCATTCGC AAGCTCAATG ATTCTCACTT 3420 

TGAGCATGAT TTTTCGCGTC TTCCTGAGAT TATGTCCTGG GACGTTGAAA CAGTCCGGGG 3480 

AGTGACTGTT TCAATCGGGA GATGGAGATG AGCTTTATTG CGCAAGATTT TGAAAAGCTC 3 540 

GATATCATCA CTGTTCTTGA AGGTAGAACA CAAGCTGTCA TCCGAGATCA CTTTCTTAAA 3 600 

TATGATAGAG CCGTCCGATG TCGCGTCAAA ATT ATT ACTA TGGATATGTT TAGTCCTTAC 3660 

TATGACTTAG CTAGACAACT TTTCCCGTGT GCTAAAATCG TTCTTGATCG CTTTCACATT 3720 

GTACAACATC TTAGCCGTGC TATGAGTCGT GTGCGTGTCC AAATCATGAA TCAGTTTCAT 3780 

CGAAAATCCC ATGAATACAA GGCTATCAAG CGCTACTGGA AACTCATTCA ACAGGATAGC 3840 

CGTAAACTCA GCGATAAACA TTTTTATCGC CCTACTTTTC GTATGCATTT AACCAATAAA 3900 

GAGATTTTAG ACAAGCTTTT GAGCTATTCA CAAGACTTGA AACATCACTA TCAGCTCTAT 3960 

CAACTCTTGC TGTTTCACTT TCAGAATAAG GAACCGGAGA AATTTTTCGA ACTTATCGAG 4020 

GACAATCTTA AGCAGGTTCA TCCTATTTTT CAGACTGTCT TTAAAACCTT CCTCAAAGAT 4080 

AAAGAAAAGG TTATCAACGC CCTTCAACTA CACTATTCTA ATGCCAAACT GGAAGCGACC 4140 

AATAATCTCA TCAAACTTAT CAAGCGCAAT GCCTTTGGTT TTCGAAACTT TGAAAACTTC 4200 

AAAAAACGGA TTTTTATCGC TCTGAATATC AAAAAAGAAA GGACAAAATT TGTCCTTTCT 4260 

CGAGCTTAGC TTTTTTTCAA CCCACTACAG TTGACAAAGA GCCGGAAAAA GGAACAGCCT 4320 

TAGCTTTCCT TTCATTTCTT TTTATTTCCC TCGTAGTAAA CGTGCTAGCT TCCACAAAAC 4380 

AAACAGGATT CCCAGAAATG CCAGTACCAC TAGCCCACGG TACAACCATT GAGAGGTTGC 4440 

AACACGCGAT ACAGATTGTC CTTCTTTCGT AAAAGCAACC CTCGCAACTG CAGCTGTTTG 4500 

TGGATCTGAT TTTTGATAAA CAGCGACTCG TTCAAAATTC ACTAATAAGC GTTTATTAAA 4560 

GGTAGGAATC GGATCGCAGG TTATCAAGGT CATGATATTT TTAGAGCTAA CCGATTCTAA 4620 
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TTTTTCCCAT TCCGACGGTA AAATAATCTC 


TGTGTCCATC 


ATCTGATATT 


CTACAATTTC 


4680 


CTGGCCATTA TCATAATAAA 


GAGCATCTCC 


AACTTTTAGC 


TGATCCAAAT 


GGCGGAAAAA 


4740 


GACATGGCTT GGCTCTGCAC 


GGTGCCCAGC 


AATCACTGAG 


CGAATCCCTG 


TACCATCCAG 


4800 


AGGCAGCGGT GTACCATCCA 


CATGAGCCAA 


GCCCATCCCT 


AAATGATGAT 


AATCTGCTCC 


4860 


CAAATAAACC GGCTCCATGA TTTCCAAACT TGGAATAGAC AAGTAACCAT AGACTGCATC 


4920 


AC3GGTCGTCA GACACTTGGT 


AATTGACCTC 


ATATCCCTCC 


GCCAAAAAAG 


GATCTACAAT 


4980 


GCGATTTTGC GAAGCCAAGC 


GTTGATTGTA 


GGCGAGAGAA 


TGGTTCTGTT 


GTTCTTGGTA 


5040 


CATTTCAGTT GTCATGGATT 


TCACAAATGT 


AGCATGACCT 


TTCACCTGTC 


CAAGAGACTG 


5100 


CAACACCATC TGTCCAAAAC 


AATAAATAGG 


AATCAAACAG 


GCTACCAACA 


TCAACAAGTA 


5160 


TCCCAATAAG GCTCGTAGTT 


TAGTCCTTGA 


CATGACGCCC 


CTCCAATTGC 


TTTTCTAGTC 


5220 


CTTTGACAAT CCGTCGATTA 


CGATACACGC 


GATACAGCAA 


GAGAAGGATG 


ACCGCCATCG 


5280 


CTCCTAGTAA TAACCACAAC 


CAGAATTGCC 


CACGCTCTCT 


CACCGCTCGA 


TTCCGCTCTG 


5340 


CAATTGGTGC CGTATACGGA ATCCGCTTCC 


CACGTACCAA 


CAGACGATGA 


CTGTTAATCA 


5400 


TATACGGTGT ACAAGTCAAC 


AAGGTCGCAT 


AATCTTCCCC 


ATGTTGAATC 


AAGACAGGCT 


54 60 


CAAAGTCATT CGGCTCCACC 


GTCACTATCT 


GATCCACTTG 


GTAGGCCAAC 


ACCTGATCTA 


5520 


AAACGTGAAG ATAAAAGATA 


TCCCCTTTTT 


TCATCTTATC 


CAATTGACTG 


AACAATTCTG 


5580 


CCGTTGGCAA TCCTCTGTGA GCAGTGATCA 


CTGTATGGGT 


ATTTTCACCT 


CCAACAGGCA 


5640 


GCGAAGCCCC TTCTAACAGC 


CCTGCCCCTT 


TCTGAAGAAT 


GTCCTCACTC 


GTTCCGACAT 


5700 


ACATCGGAAT TTCCTGATCA 


ATCGCAGGAA 


TTTCCACATA 


GCCAATCCGC 


TCATGGACCT 


5760 


TTAGCATATT GGCATATTCT 


GAGACGCCTT 


TCTTTTTCTC 


TTGCTCTGTA 


AAAGGATCAA 


5820 


GAATTTCAGA TGGTTTCAAG 


GTCGCATTGA 


AGGCTTGAGC 


CAAGCGCCAA 


CGCTCCTCAA 


5880 


GTTCTGCCTT ATCCATCTGG 


GAAACCGTCT 


CATCAAACTC 


TTTAATAACC 


TCGTTTGACT 


5940 


CAATACGATA ATAATAACGA 


GACACCAATG 


GATATATCGC 


AACGGCGAAT 


CCTACTAAGA 


6000 


AAATCAGAAG AAGGATCAGC 


GGATGTTTCT 


TCTTTTTTGT 


GCCTTTTTTT 


CGTGAACGTC 


6060 


TACTGTTGTC CATCCTCCAC 


CTTCACTTCC 


TTCCTTGCTG 


CTTTCAGCGC 


CTTCAAAGCC 


6120 


TTTTCCGGTT GTTTTTTCTT 


CTTGCGCAAG 


CGTCGAATAA 


TCCATAAAAG 


AATCACAATC 


6180 


AAACCAACTG CCACATAAAA 


CAGGTAGCGA 


TAGAGATGAC 


TGAGTTTGTT 


TGCTGCAATA 


6240 


AATTCTTCCT CAACCTCTGC 


TACGTACGGT 


ATCCGATGCC 


CCCGAACCAA 


TAGACGATGG 


6300 


GTATTGATCA TGTATGGCGT 


ACAAGTCAGC 


AAGGTCACAT 


AATCATGACC 


TGGTACAATC 


6360 
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AATAAATCAT CAAAGTTCGT CGGCTCAATC ACCTTTACTT GATCCACTTG ATAGGCCATC 6420 

ACTTCCTTGA TATTGTGCAC ATAAAACTTA TCCCCAACTT TAAGTTTGGT CAAATCCGTA 6480 

AACATCTTAG CTGTTGGCAA ACCTGTATGT GCCGTAATCA CCGCATGGGT CGAATTGCCT 6540 

CCGATCGGCA GAGAAGTTCC CTCTAGATGC CCAGCCCCTT GCTGCAATAC CTCTTCAGCA 6600 

GTACCAGCAT AAACCGGCAA ATCCACGTCA ATAACGGGGA TTTCCACATG CCCCATCCGC 6660 

TCATGGATTT CTAACATACG TGCATACTCT GCTCGCCCTT TTTTCTTCAT TTCTTCCGAC 6720 

CAAGGATCGC CACTCACTAC ATTATTCAAA GAGTCATTGA AGGCTTGTGC CAATTTCATT 6780 

CGTTCATCAA TGTCAGCCTC ATCCAACGTT GCTTTTTCCT TATCAAAGTC AGCAATTTGT 6840 

TGATTTGATT CCACTCGATA ATACAAGCGA GACACCAGCG GATACGCCAT TACCGCCATT 6900 

CCAATGAAAA ATACCACTCC TAATAGGAGA TTATTTCGTT TTTGCTTTTT TGTTTTTACC 6960 

ATTTTTATCA GCATCCCTTT ATCTTCAAAC TTCAGGGTAT C 7001 



(2) INFORMATION FOR SEQ ID NO: 89: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10411 base pairs 

(B) TYPE: nucleic acid 
{C) STRANDEDNESS: double 
(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 89: 



GAGGGAGCTT 


AAGAAGTTAC 


CACCGTCCTC 


TAGCGCCTTA 


TCCGCATCAA 


AGTTAAGGTT 


60 


GATATTTTTA 


AAACTGTCGC 


CAGCTTGTGA 


TACGATGCTT 


TGTTTAAGGT 


CATTTAGGGT 


120 


TTTAGTGAAA 


TCTGCATTGC 


TGAGGATATC 


ACTCTTTGAG 


AGATTCAAGG 


CAAAATTGAT 


180 


GATGATATTG 


ATCTGGTTTC 


CTGTTATGAC 


CTGATCAAGT 


TTGTAATTTT 


TTAAGGTATC 


240 


TTCAACAATC 


TTGCGGATAT 


CTTCTTCTGT 


CAGATTTCCC 


TTACTTTCTT 


TAGCTTTGGC 


300 


GAGTCCTGAC 


TTGATATCAG 


CTAGGGCAAC 


GTTTAATTTA 


TTAGCATCAT 


AGCCTGA1-7T 


360 


GTCCTTGTTT 


TCAGCATTGA 


TATCTGACAA 


AGCTTTTAGC 


TCTTCTTGAG 


CCAAATCTTT 


420 


ATTAGCTTGT 


GGCACCTTGG 


CTCCATTAGC 


CTCTAGCGAA 


TAGTAAATCC 


CTGCTAAAGC 


480 


ACTTTCTCCT 


GTAACTGGAA 


TAGGGGCTGC 


TACAGTGATT 


TTGGCATGTT 


CCATACCCAG 


540 


CGTTACTGCT 


GCGTTTCGGT 


ACATATCCTG 


AGTCACCTTA 


GTGATATTTT 


CTGGTGTTTC 


600 


AATCTTGACC 


TCAAGTGGCG 


ATTTGTCACC 


TAGCTTTTGA 


ATCTTGGCTG 


ATGAATACAA 


660 


CTGTAAGCTA 


GAGTCATTGG 


CCACATTCAT 


GATTTTAGAA 


TAAACATCAG 


GTGTCATGGT 


720 


CTTGAGTTCT 


TTGGTATCTG 


TTGAGGCATT 


GTAGCCCAGT 


TTTTTAAGAG 


TTTGATTTTT 


780 
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TTGGTCTTCA 


GATAGGGAGG 


AACCTAGGAC 


ATATTCAGGT 


TGGACATAGG 


TTTCATCGAT 


840 


AACTTTTTGA 


ACATCTGTTG 


CTGCATGGAC 


GCTATTCATA 


GCTGTTACTG 


CCCACAAGAT 


900 


CGCAGCGCTA 


GTCAGAAAGA 


GTTTCTTTCT 


CATAGGGAAT 


TTCCTCCTTT 


ACTTCTTTAG 


960 


AGTAATATAT 


CTATCTTAAA 


GAAAACTTAT 


AACAAAAACA 


CCTGGTCTAG 


CCAGATGTTG 


1020 


AAAAGAGAGT 


GAAACATTTG 


ATGATGTAAA 


GGTTAAGTCG 


TACCTGTCTA 


GAATAATAAT 


1080 


. AGTTTCCTCC 


ATTTACATAG 


AGTTCAGCAC 


CGTGAAAAAT 


GGAAATGGGG 


TGAATATAAC 


1140 


TATAAGTCTT 


TCCAGTCCTA 


TTACCAAGCA 


AGGGGGCAAC 


AGTCTCACGA 


GAGTACTGTT 


1200 


TGGCTAGAGC 


CAGGGTATTT 


TCCTTGCCAT 


TTTGGGCGAT 


AAAATCGATA 


TAGGCAGGTC 


1260 


CAAAATTATA 


GGCTTGAACA 


GCTGTCCAGA 


TATCTACCCC 


CTTCTTCTGC 


GCCAGATAGA 


1320 


GATTGCCTGT 


CAGAGTTTGA 


ATGCCTTGCC 


GAATGCTAGA 


GGCATTATCA 


TTGATGGTGT 


1380 


TGGTGGAACC 


ACTTGCAGAC 


TCACTAGACT 


GCATAACATC 


GCCTTCTTTT 


CCTTTTGTTT 


1440 


CAGTATAAAT 


CATAGCAAGC 


ACAAGCTCTT 


CGTTTGCTGG 


GGTGTCTTGT 


TCACTCAATA 


1500 


TTTCTCGCAC 


CATGGGTTGA 


TAGGTCATGA 


CTTGTTTGAC 


ATCTTGATGA 


ACGCGGTAAG 


1560 


CTTTATAGCC AGCAAAAAGG 


AAGACTGCTA 


GTACAAGCAC 


TCTTCGAATT 


CGTTTAAACA 


1620 


TTATTTACTT 


TGGATATCCT 


CGATATTTTT 


GATTAAGATA 


GAGTAGGTTC 


CATTTTCGTT 


1680 


TTGGATAAAC 


TCAACAGACT 


CGGCGTCTTG 


ATAGACGTTA 


TTGGGAACGA 


TGAGCTCAAT 


1740 


TCCATTTGAT 


AAGGAGAGTT 


TTTGGTTTTC 


AAATTTCTTT 


AATTGGCGAC 


TGGCATCAAT 


1800 


TTCATCAAAT 


TGAACAGGTT 


CTGGTACGGC 


TTCTTTGACT 


TGGTCAATAA 


AGCTCAAACG 


1860 


AGCCGTCAGA 


TTGTTGTCAA 


AAAGGTCATT 


AGCCAATTTC 


TCAGGTGACA 


ATTCATTGCT 


1920 


TTCTTCTAGG 


TTGTTGAAAA 


TAGCTGATTT 


GACCTTGGAT 


TGAAATTGAA 


AATCATCTGT 


1980 


GTTAAAAGAT 


TTAGCAATTC 


TCTGGGCTGT 


TTTTTCCAGT 


TCCTTGATAG 


ATTTTTTAGG 


2040 


AGAAATCTTA GGAGCGACAG 


CAAGAAGATT 


ATCTGAAAAA 


TAGTTCAAAA 


AAGTCCCGTT 


2100 


GTACTTGATT 


CGTTTTTCAA 


TCAGGTGATA 


CTTGCTACTC 


TGAAGATTGA 


CCACCAAGGC 


2160 


CTCATCAGCT 


CCTGTTCCAA 


ATCCAGGCAG 


GTTATTCTGA 


GTTAGCTTGA 


TTGGATTATC 


2220 . 


AACTTCTCCT 


CCGAGGTGGG 


TCAAGGTCTC 


CCGCAGGGCA 


ATTCGCAAGA 


AAGCGAAATG 


2280 


TTCTACACCT 


TCTTTAGAAA 


ATTGCACAAA 


AATCAAGTCA 


TTGGTCTTGA 


GATTTTCAGA 


2340 


AATGCTAAAC 


TCCTCTTTCC 


AGAGATTAGC 


CAGCGTTACT 


GATGTCTCCA 


ACAAATCGTC 


2400 


TGTAATATGA 


TTGAAGAAGG 


GATTTTCTTC 


TTCGAAAATC 


CCAGTCTTGG 


CTTCATCTGA 


2460 


ATACACATGT 


TCAATTTTTT 


TACGCAGGTA 


TTCTTCGATT 


TTTGGAGTAA 


TATTGAGAAA 


2520 
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CTTATCTGCT AAGAACAGTT CGGTATCATC CGGACTGAAC TGGTGAATAA TGGCTTTCTT 
AATATAAATG TCCATAAAAG TTTTAGTCCT CGTATAATGG GAAGGCATCT GTCAATTCTT 
TGACTGCACT TCTCACTTCT TCTAATACAG CCTCATTTTC TGAATTCTTA AGGGTTTTAA 
TGATGAGTTC AGCCACTTTG CGACTTTCTT CTTCACCAAA TCCACGTGCA GTAATGGCTG 
CTGCTCCGAT ACGAATCCCA CTTGTCTTGA ATGGTGACAA GCTTTCGTAA GGGATTGAGT 
TTTTATTTAA GGTAATATTG ACTTCATCCA ACAAGTTTTG AGCAACTTTG CCGTTTTCTA 
CAACTTTAGT CACATCAACA AGGAAGAGAT GGTTTTCAGT TCCACCTGAA ATAATACGGA 
AATCAGGGTC TTGCAAGAAG ACATCTGCCA TAGCCTTGCT GTTCTTAATT ACATTGGCAG 
CATATTCCTT GAAGGCTGGA TCCAAAACTT CTTTGAAGGA AACTGCCTTA GCCGCCACAA 
CATGCTCTAA AGGACCGCCC TGAATACCTG GGAAAATAGC TGAATTGATT TTTTTAGCAA 
GTTCTTCGTC ATTGGTCAAA ATCAAACCAC CACGAGGTCC ACGAAGGGTT TTGTGGGTCG 
TTGTTGTTGT GATATGAGCG TATGGAACTG GGCTTGGATG AAGGCCAGCC GCAACCAAGC 
CAGCGATATG GGCCATGTCC ACCATGAGCT TCGCACCGAC AGCATCTGCG ATTTCACGGA 
ATTTTGAAAA ATCGATAATT TGAGAATAGG CTGAAGCACC AGCTACAATC AGTTTTGGTT 
TTACTTCTTG GGCTTGTTTC AAGATAGCAT CAAAGTCTAA GAGTTCCGTT TTAGGATCAA 
CACTATAAGA AACAAAGTTG TAGGTTTGAC CAGAGAAGCT AACAGGAGCC CCATGAGTCA 
AATGACCACC TGATGCCAAA TCCATTCCCA TAACCGTATC ACCTGGCTCA ATCAAGGACA 
TGTAAGCCGC ACAGTTAGCT TGGCTTCCTG AATGTGGTTG AACATTGGCA AATTTAGCAC 
CGAAAATTTC TTTTGCGCGT TCAATAGCAA GAGTCTCTAC AACGTCTACT ACATCAGTTC 
CACCATAATA ACGGCGTCCT GGGTAACCCT CGGCATATTT ATTTGTCAAG ATAGACCCTT 
GAGCTGCCAT AACAGCCTTG GAAACTACGT TTTCCGAAGC AATTAACTCG ATATTATTTT 
GTTGGCGTTC TTCTTCTTTG GCAATAGCAT TCCAGAGATC AGCATCATAT GCTTTAAAAT 
CATCTTTGTC AAAAATCATA GGTCTTCTCC TTTATTGTGT GACTAGTCCA TTAGTTTGAT 
TTTACAATAA GAAAATCAAA CTAACAGATG CGAATAAACC GTTTCTGCAT TTTATCACAA 
GTATAGCCAA CTTTTTCATA AAATGCATGA GCACCCAGAC GATGATTGGC AGAATTTAAG 
CGGATAAACC CATAACCACA TCTTTTTGCT TCTTCTTCCA ACCCTTGTAG TAAACTTTTA 
CCAATACCTT GACCTTGCGC TTGAGGTGAA ACTGCTAAAG CTAAGATATT AAATCCTGCT 
TTGGAATAGA GTGATTCGTA AACTTCAGCG TGGACATATC CAAGTAAGAC ATGATTAGCT 
GCATCCTCAT AGCCAAGTAG GAAATGATGG GAATCCTGAG ACAGTCTAGC TAGTTGGCTA 
GCCGTTTCCT CTGGACTAAA AGTATAACCC AAAGCCTCTT GGTTGATGTC ACATATAGCT 
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TTCACATCAG 


TTTCTCTTAA 


ATCTCTTAGC 


ATCTCATTCC 


TCCTCAAAAG 


AAATCTTTGG 


4380 


CAACCGAGCA 


AGAATATCTT 


CTCGCTTAAT 


GGCCCCTTGA 


CGTAAGATTT 


TCACCTTGTC 


4440 


TCCCGACAAA 


TTCAAAATAG 


TTGAATCCTG 


TCCAGTTAGA 


AAAGCATCGT 


CTTCCAGACC 


4500 


CAGAACCTCT 


TGGTCAAAAT 


CCTCTAGAAT 


TTGATTAAAG 


GTCACTCCAC 


TCGCCTGACC 


4560 


TGAGATATTG 


GCAGACGGCC 


CAATCAAGGG 


ACCTGTCTCT 


CGAATCAAAT 


CAAGGGTAAT 


4620 


GGGATGACT A 


GGCATCCGAA 


ATCCAACAGT 


TGCAAGGCCA 


GAATTGACCC 


AATAGGGAAC 


4680 


TCGGTCATTA 


GCTTCGAGAA 


TAATGGTCAA 


GGGACCTGGT 


AAAAAGATCT 


CTACAAGTTT 


4740 


TTGAAGATAA 


GTTGGCTGAT 


TCTTTGAAAA 


GTACAAGATG 


TCCTCTAAAG 


AGGCAACATT 


4800 


GAGATTGAGC 


GCCTTGTCTC 


TACGTCGACG 


TTTAAGCTGG 


TAAACATGGT 


CAACTGCTTT 


4860 


TTCGTCTAGC 


GCCTTAGCAA 


AGAGACCGTA AACTGTCTCT 


GTAGGCAAAA 


CGACAGCTCC 


4920 


ACCATTTTCC 


AACTCTTGTC 


TAATCCTGTC 


CATCATCAAC 


GACAACCATC 


CTATCTTGAC 


4980 


CAAATTGGTC 


CTTGAGTGTT 


CGTACTCGCT 


TTTCAGGAAG 


ATGTTTCCTA 


AAAAGTTCAG 


5040 


GAACACTTTG 


ACCTTGCTTG 


TATCCAATTT 


CAAGGTAAAT 


CTTACCACCA 


TCTTTGAGAT 


5100 


AGTCTTTTGC 


ATCTTCCGCA 


ATTCTACGGT 


AAATAGCTAG 


GCCATCCTCA 


TCTGCAAAGA 


5160 


GAGCTAGATG 


AGGCTCCGAA 


TACAAGACAT 


TCAAGCCTAC 


CTCTGACTCA 


TCTTCACGAG 


5220 


AGATATAGGG 


TGGATTGGAA 


ACAATTATAT 


CATATTTTTC 


AGAAATTTCT 


GTAAAACAGT 


5280 


CAGATTTTTT 


TAAAAATATT 


TGAAGATTTT 


GATTTTTAGC 


ATTTTCGCTA 


GCTACATCTA 


5340 


AAGCATCTTG 


GGAAATATCT 


GCTGCCGTCA 


CTGACCAATC 


TGGTCTGTTT 


TTTGCTAGAG 


5400 


CGAGAGCAAT 


AGCTCCACTA 


CCTGTTCCGA TATCTAGGAC 


CATAAGATTT 


TTCACAGGAT 


5460 


TTTCAGCCAG 


GATAAGCTCC 


ACCAACTCCT 


CTGTTTCTGG 


ACGAGGAATC 


AAAACCCGTT 


5520 


CATCCACCTT 


TAAATGCATT 


CCATAAAAAT 


CTGCCTGTCC 


AATGATGTAC 


TGAGCTGGCT 


5580 


TGTGAGCTGC 


TAGTTGCTGG 


TAAATATCTT 


CTACAAATTG 


TTTTTCTTCC 


TCTGTTGTCA 


5640 


CCTCCTGCTG 


GAGGGCAAAA 


ATAAAGTCTG 


TAAAAGATAG 


ATTTTTCAGA 


CTACGATAGA 


5700 


CAAAAGAGAG 


GCTTTCCGCT 


TCCTCTCCTT 


GTCTTATCAA 


CTCTTCTTCA 


AAATTTGAAA 


5760 


ATAATTGAGC 


TAATTTCATT 


ATTTGTTTAA 


TTCTTCTAGT 


TTTTGTGTTT 


GGTCATAAAG 


5820 


CACCAAGGCA 


TCCACAACTT 


CGTCCAATTT 


ACCAGACAAA 


ATCGTATCTA 


GTTTTTGGAG 


5880 


GGTCAAGCCG 


ATACGGTGGT 


CTGTGACACG 


GTTTTGTGGG 


AAGTTATAAG 


TTCGGATCCG 


5940 


TTCTGAACGG 


TCACCAGTAC 


CGATTGTCGA 


CTTACGCTCA 


GCGTCCTGCT 


CATCTTGAGC 


6000 


AATCTGAGCA 


AAGTGGTCAG 


CAACACGGGC 


ACGGATGATT 


TTCATGGCCT 


TCTCACGGTT 


6060 
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CTTCTGCTGG 
ACGAACGGCA 
GTCGACACGA 
AAGAACTGTC 
CACACGGTGG 
AGCAACCACT 
CCAACCTTGG 
CGCTTCGTCT 
ATCCTTTGGA 
ATCTTTGAGT 
CATCTCTTCG 
GGTGTCACGA 
GACATCAGGG 
TTGATCATAG 
TACTTCATTC 
TGCCTGTTCA 
TGTTCGAACA 
ATTCTTATCA 
ATCATAAGCT 
TACAAAATCT 
AATGAATTGT 
TGTAACATCT 
CAAAATAATG 
CTCCATCGTA 
AATACTGACA 
AACCTTCGAA 
CTAACTCGTC 
CATCGACCAA 
TTGTTTCCTC 
CGCCGTCACG 



GTACGTTCTT 
GTCGCAACCT 
AGGTCTTTTG 
GCTGTCGAAG 
GCACCTGATT 
TCTTTAAAAC 
GCTTCCGCAT 
CCACCAGCTG 
AGGAGCAAAA 
TCTTGCTTGG 
GCATCGACGA 
TTGGAAGCTT 
TCACTCAGCA 
ATGTTCATTT 
TCGGATATTT 
TTTTCTGGTT 
TAGTCTAGTA 
ATCATAAAAC 
ATCATCACAA 
CCATTTTTAG 
TTTTGATATT 
CTTAGTTCAA 
TTTACGGCAA 
AACGGGCAGT 
GATGGTCTTG 
CAATTCATTG 
CACAACACGA 
AACACAGTAA 
AATCGCAAgG 
CGTATCCAGA 



CCTGCATCTC 
TATTGACGTT 
GATCAATGTC 
TATGAACACG 
CATACTTAAG 
CACCGACACC 
ACTTTTGGTA 
CTCCACGGAT 
TTTTCAGTTT 
CCAATTCTTC 
TATTTTGAAG 
CTTCTTTTGA 
ATTCTCCTAA 
TTTCTCCTTA 
CCCCAGTTTC 
CATArGACAA 
CTTTATCCAT 
GAAATAGTAA 
AACCTATAAT 
TGTAGACGTA 
CCTTGACATC 
TTGTCATAGT 
ACTGAGATAT 
CCATCCTGTG 
ATTTCGTCAA 
CGAAAGTCAT 
GCTAGGTCGT 
GGTTTTTCTG 
GCAGGGCGTT 
GCCGAGGTCA 



712 
AACCTTGATA 



CTGTCCACCA 
GTATTCAACC 
GCCTTGGCTT 
CTTAGAGTAT 
ATTCATAGAG 
CATAGTTAGC 
TTCAAGGATG 
TTCTTCATAT 
CAAGTCCGCA 
GACTTGTTTA 
AAGCTCCATA 
TTCTTCATAA 
TTTCTCAATT 
TTTAAATCCA 
CCAAAGTTTA 
AATTGGTTTA 
ATAATTTCCA 
TGCATCATTA 
TGCTTCAGCT 
CAAATTTAAA 
TTTGCTCCTT 
AGGTTTCGTT 
TTCGCAACAC 
TCTTGTCTGC 
TTTTCAAGCC 
AAACATGGTG 
GTAGGTCTCG 
TCATGCCAAT 
TAATCACAAC 



TTGGTTGGCA 
GCACCAGAGG 
TCTTCAACTT 
TCTGTCACAG 
ACAGACTGAC 
GCTTCCATGA 
AAATCTCCAG 
ATATTCTTGT 
TCTTCTTTTT 
TCTCCGCCTG 
TACTCACGGT 
AAACGCTTGG 
CGGTCTTCTA 
GTTAAATCAT 
TAACTGAGGT 
TTGCTTAAAC 
AAATATCCTT 
CTACTAATTC 
TCATAAACTG 
AAACTAATTG 
ACATCAAAAT 
GTTAGAGGTT 
ACCACCAATC 
CATGGTCGCC 
TAAAAGCAAG 
AAAAGCCATG 
GCGTTTGAGA 
GATATAGCCA 
TCGACTCGAC 
ACCTTTTCCT 



AGTGAACGAT 
CGTGATAGAT 
CTGGCATAAC 
GAACACGTTG 
CTGAAACCAT 
CTTCAAAGCG 
CGAAAAGTGC 
CATCGTTTGG 
CAGCCTTGGC 
ATTCCTTAAT 
AGGCTATTAC 
TGTCTGAAAC 
CAACTTGTAG 
AGATTGCTAC 
AACAAAATCT 
CTGCTGGCGC 
GATTTTGAAA 
CGATCTTTTT 
CCAATGGAGC 
CGTTGGTTGC 
AATTTTCCAT 
ATCATTGGCG 
TGGATCTC-T 
TTTTTCTTGC 
AGATATTTGG 
ACGGGTATGT 
AACTGGGCTT 
AAGATATCCG 
ACATAGCCAA 
TGCTCCTCGT 
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AGTTATAGGC CACTTTGAGA ATCTCAATCG TTTTACCAGA GTTCATGGTC CCATAACGAT 
AGTACAACTG TGCCATGTTT CTTGCTTCAC GTCCATTTCT AAATTTTTGC TACATTCTAG 
TATATCATAA TTTTCTTAAG CTTTAAACGG CAAAATGTGG TAAAATAGAA GAAATCAAAA 
ACTAGTGGAG GAAGCTATTA TGCCATTTGT ACGCATCGAT TTATTTGAAG GACGCACGCT 
CGAGCAAAAG AAAGCTCTTG CTAAGGAAGT AACGGAAGCA GTTGTCCGCA ACACTGGAGC 
GCCTCAATCT GCTGTCCATG TCATCATCAA CGACATGCCA GAAGGAACTT ACTTCCCACA 
AGGGGAAATG CGTACTAAAT AAGCTAGCTT AAGCAGAATT GCTTAGGCTT TTTCAATCTC 
CAAGTAGCAT TCATTGAAGA AATATCCTAA ATTTGTTACA ATTTGAAAAG AAACTTGGAG 
AATTTCCAAG AAAAGAGCTA TTAATTAAAG GAAACATTAT GATTACACGT GAATTTGATA 
CCATCGCTCC TATCTCTACT CCACTAGGTG AAGGGGCTAT TGGTATTGTC CGCCTGAGCG 

GTAACAGACAG ttttgctatt gggcaaaaga t^taaagg aaaagacttg aacaaggptg 

CCAGCCACAC TCTCAACTAC GGTCACATTA TTGATCCTCT GACTGGTAAA GTCATGGACG 

aggttatggt tggggctatg aagtctccaa agaccttcac tcgtgaggat attatcgaga 

TTAACACCCA CGGTGGGATT GCGGTGACCA ATGAAATTCT CCAGCTAGCT ATTCGTGAAG 
GGGCTCGGTT GGCAGAACCT GGTGAATTTA CCAAACGTGC TTTTTTAAAC GGTCGCGTAG 
ACTTGACACA GGCAGAGGCT GTGATGGATA TCATCCGTGC CAAGACTGAC AAGGCCATGA 
ACATTGCGGT CAAACAATTA GACGGCTCCC TTTCTGACCT CATTAACAAT ACCCGTCAAG 
AAATCCTCAA TACACTTGCC CAAGTTGAGG TCAATATCGA CTATCCTGAG TATGACGATG 
TTGAGGAAGC CACTACTGCT GTTGTCCGAG AGAAGACAAT GGAGTTTGAG CAATTACTAA 
CCAAACTCCT TAGGACAGCA CGTCGTGGTA AAATCCTTCG TGAAGGAATT TCAACGGCTA 
TCATTGGACG TCCCAACGTT GGGAAATCAA GCCTTCTCAA CAACCTCTTG CGTGAGGACA 
AGGCTATCGT AACAGATATC GCTGGGACAA CACGAGATGT CATCGAAGAG TACGTCAACA 
TCAATGGTGT ACCTCTCAAA TTGATTGATA CAGCCGGTAT TCGTGAAACG GATGATATCG 
TTGAACAAAT TGGAGTTGAG CGTTCGAAAA AAGCTCTTAA GGAAGCTGAC CTAGTTCTGC 
TAGTACTAAA CGCTAGTGAA CCACTAACCG CCCAAGATCG CCAACTCCTA GAAATCAGTC 
AGGAGACTAA TCGCATTATT CTTCTTAACA AAACTGACCT GCCTGAAACG ATTGAAACTT 
CGGAACTACC TGAAGATGTC ATCCGCATTT CAGTTCTTAA AAATCAAAAC ATCGATAAAA 
TCGAAGAGAG AATCAACAAC CTCTTCTTTG AAAATGCTGG TTTGGTTGAG CAAGATGCTA 
CCTACTTGTC AAACGCCCGT CACATTTCCT TGATTGAGAA GGCCGTTGAA AGCCTACAAG 
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CTGTTAACCA AGGTCTTGAA CTAGGGATGC CAGTTGACTT GCTTCAAGTT GACTTGACCC 9660 

GTACTTGGGA AATTCTAGGA GAAATCACTG GAGATGCTGC TCCAGATGAA CTCATCACCC 9720 

AACTCTTTAG CCAATTCTGT TTAGGAAAAT AAGAAAAATC CATGATCCTT CATTCGGTCA 9780 

TGGATTTTAG GTTCTATAAT ATTTGTAGTG GGTAAATCCA CTATAGATAT TATGGAGCCT 9840 

ATTTTATTGT AGAAAAAAAG TCCCATATGA CCTATAATGA AAAGCGACAA AACAACTCAT 9900 

TAGAAAGAAT CATATGGAAC AATTACATTT TATCACAAAA TTACTAGACA TTAAAGACCC 9960 

TAATATCCAG ATTTTAGACA TCATCAATAA GGATACACAC AAGGAAATCA TCGCCAAACT 10020 

GGACTACGAC GCCCCATCTT GCCCTGAGTG CGGAAACCAA TTGAAGAAAT ATGACTTTCA 10080 

AAAAACCTTC TAAAATTCCT TATCTTGAAA CGACTGGTAT GCCCACTAGA ATTCTCCTTA 10140 

GAAAGCGTCG ATTCAAGTGC TATCACTGTT CAAAAATGAT GGTCGCTGAA ACTTCTATCG 10200 

TCAAGAAGAA TCACCAAATC CCTCGTATCA TCAACCAAAA GATTGCTCAA AAGTTAATTG 10260 

AAAAGATTTC TATGACTGAT ATTGCCCATC AGCTTTCCAT CTCAACTTCA ACTGTTATTC 10320 

GTAAGCTCAA TGACTTTCAC TTTAAACATG ATTTTTCTTG TCTTCCTGAG ATTATGTCTT 10380 

GGGATGAGTA TGCTTTTACA AAAGGGAAGA T 10411 
(2) INFORMATION FOR SEQ ID NO: 90: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2393 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
<D) TOPOLOGY; linear 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 90: 

GTTTTGGGTT CTGGAAATTA TCAGATGGTT GGAAAAGCCG TCCACATCAA GATAGTGTTC 60 

GGAGATTTAA GTTTAAATTG AAGAAACTAA CACAGAGGAA ATGGAGTATA GACCTAACAA 120 

GACGTATTGA GCAACTGAAT TTGTCTATTC GAGGATGGAT AAACTATTGC TCATTGGGAA 180 

ATATGAAAAG TATAGTCGCC AGCATAGATG AGCGCTTGCG TACTCGCCTA CGAGTGATTA 240 

TCTGGAAGCA ATGGAAGAAG AAATCGAGAC GATTATGGGG ATTGCTTAAG TTAGGAGTTC 300 

CTAAATGGAT AGCAGATAAG GTATCTGGCT GGGGCGACCA TTATCAATTA GTAGCTCAGA 360 

AGTCGGTACT TAAACGTGCT ATATCAAAAC CAGTCCTGGA AAAACGTGGA CTGGTTTCGT 420 

GTTTGGATTA TTACCTTGAA CGACATGCGT TAAAAGTTAG TTGAACCGCC GTATGCCAAA 480 

CGGCACGTAC GGTGGTGTGA GAGGGGCTAG AGATTATCCC CTACTCGATT AACTCCCCTG 540 

AAATTTATTT TAATTATGCA AATTTCACGT ATTTTTGATG CTGAGACGAC GATCCTGGGA 600 
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ACTTTTCAGA TATTTTTTTG ACTATCTAAA TCTATCATTA GAAAAGCTTA GAGCGCCAAA 660 

GGATTTGAGC GTTTTTCTGA TTTTTAAGAC TTTTTCCAGT CTCTTTTTCG ATTGAAGATG 720 

TAATTATTCT ACTAACTAAC TAACTTCTTA GTACTAGCCA ACAACGATAA TCATAATTCC 780 

TCCTAAAATT AGGAATAATA AAGGCAATAG TTTTTGTTTT TTCATGTAAA AAACCTCACT 840 

TTTGTTTTCT GCTATTTTAT GCTAAAATAT TAAAAATCAA ATTTAATTCC AAAGTTTGTA 900 

ACTAAAGGGG GAGCGCTACA TGTCTAATTC ATTTGTCAAG TTGTTAGTCT CTCAATTATT 960 

TGCAAATTTA GCAGATATTT TCTTTAGAGT AACAATCATT GCTAACATAT ACATTATTTC 1020 

AAAATCAGTA ATTGCCACAT CACTAGTTCC TATCTTAATA GGAATATCCT CTTTTGTTGC 1080 

GAGTCTTTTA GTTCCGTTGG TTACTAAAAG GTTAGCGCTA AATAGGGTTT TATCTTTATC 1140 

TCAATTTGGA AAGACTATAT TATTGGCGAT ACTGGTAGGA ATGTTTACCG TAATGCAATC 1200 

CGTAGCGCCT TTGGTGACCT ATCTATTTGT TGTTGCAATT TCCATACTAG ATGGTTTTGC 1260 

AGCACCCGTT TCCTATGCTA TTGTGCCACG CTATGCGACC GATTTGGGTA AGGCTAATTC 1320 

AGCCTTATCA ATGACTGGTG AAGCTGTTCA ATTGATAGGT TGGGGATTAG GTGGACTCTT 1380 

GTTTGCAACA ATTGGTCTGT TACCTACCAC GTGTATCAAT TTAGTCTTGT ATATCATTTC 1440 

TAGCTTTCTG ATGTTATTTC TTCCTAACGC TGAAGTGGAG GTGTTAGAGT CAGAAACTAA 1500 

TCTTGAAATT TTGCTCAAAG GTTGGAAGTT AGTTGCTAGA AATCCTAGAT TAAGACTTTT 1560 

TGTATCAGCA AATTTATTGG AAATTTTTTC AAATACGATT TGGGTTTCTT CCATTATACT 1620 

TGTTTTTGTA ACGGAGTTAT TAAATAAAAC GGAAAGTTAC TGGGGATATT CTAATACAGC 1680 

ATACTCTATT GGTATTATAA TTAGTGGCTT AATTGCTTTT AGGCTATCTG AAAAGTTCCT ' 1740 

TGCTGCTAAA TGGGAACCCC AATTATTCAC CCCAAATCTA AAAACCATCC AGAATCCTTG 1800 

CCTTAGCTTA GATCCTGGAT GGTTTCTTTT TTCACCCAAT GGGTGTTTTT TACTAGACAA I860 

AAAAGAGTTT CCCCTTTATG GTATAAGTGT AGAAAAAAAC ACAAAAAGAA AGGAAACTCA 1920 

CATGAACAGT TTACCAAATC ATCACTTCCA AAACAAGTCT TTTTACCAAC TATCTTTCGA 1980 

TGGAGGTCAT TTAACCCAGT ATGGTGGTCT TATCTTTTTT CAGGAACTTT TTTCCCAGTT 2040 

GAAACTAAAA GAGCGGATTT CTAAGTATTT AGTAACGAAT GACCAACGCC GCTACTGTCG 2100 

TTATTCGGAT TCAGATATCC TTGTCCAGTT CCTCTTTCAA CTGTTAACAG GTTATGGAAC 2160 

GGACTATGCT TGTAAAGAAT TGTCAGCTGA TGCCTACTTT CCAAAATTGT TGGAAGGAGG 2220 

GCAGCTTGCT TCACAGCCAA CCTTATCCCG TTTTCTTTCC AGAACTGACG AGGAAACAGT 2280 

CCATAGTTTG CGATGCCTCA ACCTTGAATT GGTCGAATTC TTTTTACAGT TTCACCAGCT 2340 
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AAACCAACTC ATTGTAGATA ACGATTCTAC CCATTTCACA ACTTATGGCA AGC 
(2) INFORMATION FOR SEQ ID NO: 91: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4762 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 91: 

TTTGTATCTT TTTAGGTCTC TTTCAATCCA AACCCTTTAA ACTATACGTC ATTTCGGTTC 60 

CTGCAAGTCT TGTGGTAATT TTAGGTTTGA TTTTACTTTT CTTTTCACAA GAGCCTCTGC 120 

ACGCTTCTTA TTTGATGGTC GTCTTCCCTG TTTTCCTACT TTTATTGGTA ACCAATATTA 180 

AGAGTCAACA GAGGGGGCGT AGTGCTAGAA GAAGCCGAAG AGAAACGCCA TTATGCCTAT 240 

GGAGTCGTTT CTTCAAAGGA AATCTATATC TGCTAGTTTT TGGGTTTGTC TATCTTTTGT 300 

CTGTTCCTTT TTTGATGAAG TTTGTCCTTT ATCCAGTACC TTATCAAGAA CGTAATCGTC 3 60 

TTGCTGATTT GGTAAAAGAG GAGACAAATA CGGAAGATGC TATCTCATGC ATGGGATGAT 420 

ACTGCGACTC TTTATCGTAA GAGTGAGCGC TTGTCCCATC GGCGATTTTG TCCCCGTTGC 480 

AC TAT AC AGC AACTGAGGAA AATCGTAATA AGTTACTTAA TGACTTGAAA GAAAAACAAC 540 

CTAAGGTGAT TGTGGTAAAT GATAAGGTGG TAGTCTGGTC TGAAGTGGAA ACACTCTTAA 600 

AAGAAAATTA CCAACAAGTA AAGACTGATT ACTCAGAGTT TAAAGTCTAT AAAATTAAAT 660 

AACCAAATCA ATATCTTGTG TATTTTTAAA AATTTTAGGA TTTTTAACAC AAGATATTGA 720 

TTTTTCTTTT TAGAGTGGTA TAATACTTTT TAGAAAGAAC ATTTTAGAAA AGAGCATGCA 780 

TATGATTGCA CTAGAAGAAA AAATTACAAT TTTGCCAACT CTCTTCGTCG AGAAACGAGA 840 

TGGGAGACGT GTTGTATTTG ATGTGGACAA GATTGACAAG GCTCTCCACA AGGCGGCTGA 900 

CAAGGTTATG GATGTGACAC CCCTGGTTGA AAAATGCCTC AATGATCTGA CTGAGCGAAT 960 

TATTACAGAA ATTCATAGTC GCTTTCCACA GGGAATTAAG ATTTACGAAA TTCAAAATAT 1020 

CGTAGAACAT GAACTCCTTG AAGCCAAAGA ATATGCGCTG GCTGAGGAGT ATATTACTTA 1080 

TCGGACACAG AGGGATTTTG AGCGCTCAAA AGCGACGGAT ATCAACTTTA GTATTCATAA 1140 

ACTTCTCAAC AAAGACCAGA CAGTTGTCAA TGAAAACGCT AATAAAGACA GTGATGTCTT 1200 

TAACACTCAG CGTGATTTGA CAGCAGGGAT TGTTGGGAAA TCAATCGGAC TGCAAATGCT 1260 

TCCTAAGCAC GTAGCCAATG CCCACCAAAA GGGGGATATC CACTATCACG ATTTGGACTA 1320 

CAGTCCCTAT ACCCCTATGA CCAACTGCTG TTTGATTGAT TTTAAGGGTA TGTTGGAAAA 1380 
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TGGTTTTAAG 
ACAGATTTCT 
CCGTATCGAT 
TGCAGAAGAG 
GGACATCTAC 
. TGGACAAACA 
AATTCAAAAA 
TATCTTTCCT 
CAACTATGAC 
CTTGTCTTAT 
TTCTTTCCTT 
TCTGGGTGTT 
TAAGTTCTGG 
TGTCGAACGC 
TTTTGGCCAT 
GACCGTTTCG 
CTGGGAAAGT 
CCGTGTAGAA 
CGAAAGTCTG 
TATCACAGAC 
ACCGTTTGAA 
CATCCATTAT 
GGATTATGCT 
CAAGTGTGAC 
TGGCAATAGC 
TCCTCAAGCA 
TATGAATGGT 
ATGGGAAAAT 
CACTCTAAAG 



ATTGGAAATG 
CAAATCATTG 
GAAATTTTGG 
TGGGTATTGC 
GATGCCATGC 
CCTTTTACTT 
GCTATTTTAA 
AAACTTATCT 
ATCAAGCAGT 
GATAAGATTG 
CAAGGGTGGA 
GTGACGGTTA 
GAAATCTTCA 
ACTAAAGAGG 
CGTCTAGGTA 
CTGGGCTATA 
AATCCAGATG 
GAGTGGTCAG 
ACAGACCGTT 
AAGGAATACT 
AAATTGGACT 
TGTGAGTATC 
TATGACCGTG 
TTTGAAGGGG 
GACCCTAAAA 
AGACCGATGG 
TCAACGATTA 
ATCAACTAGA 
GTGGAGCTGG 



CAGAGGTAGA 
CCAACGTTGC 
CGCCTTATGC 
CTGAAAAACA 
AATCTCTTGA 
CGTTAGGTTT 
ACATTCGCAT 
TTACGCTTAA 
TGGCTCTAGA 
TTGATTTGAC 
AGGATGAAAA 
ATCTGCCTCG 
ACGAGCGAAT 
CGACACCAGC 
AAGAAGAAAG 
TCGGCTTGTA 
CTAAGGAATT 
ACCAATATGG 
TCTGCCGACT 
ACACCAACTC 
TTGAGAAAGT 
CAGTCCTTCA 
TAGGCTATCT 
ATTTTGAACC 
CAGTAGATGT 
TCAACGGGCG 
AAATAGCTGG 
CGATAAGGGG 
TAAGAAAGAA 
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GAGTCCCAAG 
TTCTAGCCAG 
AGAGAAGAAT 
GGAAGATTAC 
GTATGAAATC 
TGGTCTGGGA 
CAAGGGTCTT 
AAGAGGCCTC 
GTGTGCAACC 
AGGTTCTTTC 
TGGTGTAGAA 
TATTGCTCTT 
GAATATCGCA 
GAATGCTCCT 
TGTTGACCAG 
TGAAGTAGCG 
CACGCTAGAC 
CTACCATTTC 
AGATATAGAC 
TTTCCACTAC 
CTATCCGGAA 
GCAAAATCCA 
AGGCACCAAT 
AACTGAGAGA 
GGTGAAACGA 
TCACAAGGAA 
GCATCAAGTA 
CGCGCACAAG 
CGCTTGCTTA 



TCTATCCAGA 
TACGGTGGCT 
TATCAAAAAC 
GCTTGGAAGA 
AATACTCTCT 
ACCAGTCGTT 
GGTTCAGAAC 
AACTTAGAGG 
AAGCGGATGT 
AAGGTGCCTA 
GTCAATTCAG 
GAGTCTGAAG 
GAAGATGCTC 
ATTCTTTATC 
CTCTTTAAGA 
ACAGTTTTCT 
ATCATTCACG 
TCTATCTACT 
AAGTTTGGCT 
GATGTTCGTA 
GCAGGTGCGT 
AAGGCCTTGG 
ACTCCGATTG 
GGGTTTGCTT 
ACTTGTGGCT 
ATCGCTGCGC 
ACAAATTAGA 
TGACCCGTTA 
GCTTCAGAGA 



CTGCGACAGC 
GTTCAGCTGA 
ATCTCAAAGA 
AAGCGCAAAA 
TCACTTCAAA 
TTGAACGAGA 
ACCGTACGGC 
AAGGAACTCC 
ATCCAGACGT 
TGGGCTGCCG 
GTCGCATGAA 
GTGATATGAA 
TTGTTTACCG 
AGTACGGTGC 
ATCGTCGTGC 
TTGGTAACAG 
ATATGAAACG 
CAACACCATC 
CTATTCCTGA 
AAAATCCAAC 
CAGGTGGTTT 
AAGCTGTCTG 
ACCGTTGCTA 
GTCCAAACTG 
ACCTAGGTAA 
GTGTCAAACA 
AAGAAATGAA 
TCACGAGAAA 
ACAATTTTTA 



1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2880 

2940 

3000 

3060 

3120 



WO 98/18931 



PCT/US97/19588 



718 



AACAAGAACA 


AGAAAAAATA AAAGTGAGAG CCAGCTCTCG 


CTTTTCTCAT 


AGTGGGAGGT 


3180 


AAGGATGGAA 


TTACGCAGAC CAAGATTAGC GGATAAGAAA 


GCTGTTTTAG 


ATATGATGAC 


3240 


AGAGTTTGAA 


AAATTTCAGT CGCCTCACGA CGGCGGTTTC TGGGATACAG AGAACTTTGT 


3300 


GTATGAAGAC 


TGGTTAGAAA GCAATCAGGA ACAGGAAATG 


GGGATTAATC 


TGCCTGAAGG 


3360 


ATGGGTTTCT 


GCAATTCAGT TAGTGGCTTT TTCTGAGAAA 


GGTCAAGCAG 


TTGGATTTCT 


3420 


TAATCTCCGG 


TTGCGCCTCA GTAACTTTCT ACTAGAAGAA 


GGTGGCCACA 


TTGGCTACTC 


3480 


CATTCGTCCA 


TCTGAAAGAG GCAAGGGTTA TGCAAAAGAG 


ACTCTCCGTC 


AGGGCTTGCA 


3540 


AGTTGCTAAG 


GAAAAGAACA TCAAGAAAGC TCTGGTGACC 


TGTAGTGTGA 


ATAATCCTGC 


3600 


TAGCAGAGCA 


GTCATTCTAG CAAATGGTGG AATATTTGAG 


GATGCTCGCA 


ATGGAGTCGA 


3660 


GCGTTATTGG 


ATAGAGGTAG CGAATGAATA ATCCAAAACC 


ACAAGAATGG 


AAAAGCGAGG 


3720 


AACTTAGTCA 


AGGTCGTATC ATTGACTACA AGGCCTTTAA 


CTTTGTGGAC 


GGCGAAGGCG 


3780 


TGCGCAACTC 


TCTCTATGTA TCAGGCTGTA TGTTTCACTG 


CGAGGGATGT 


TATAATGTTG 


3840 


CGACTTGGTC 


TTTTAATGCT GGCATTCCCT ATACAGCAGA 


ATTAGAAGAG 


CAGATTATGG 


3900 


CAGACCTTGC 


CCAACCCTAT GTTCAAGGCT TGACTTTGCT 


GGGAGGGGAG 


CCTTTTCTCA 


3960 


ATACTGGGAT 


TCTCTTGCCA CTTGTTAAGC GGATTCGGAA 


GGAATTGCCA 


GACAAGGACA 


4020 


TCTGGTCCTG 


GACCGGCTAC ACTTGGGAAG AAATGATGTT 


GGAAACTCCA 


GATAAACTGG 


4080 


AATTCTTGTC 


ACTGATTGAC ATTCTTGTCG ATGGAAGATA 


TGATCGAACT 


AAGAGAAATC 


4140 


TTATGCTCCA 


GTTTCGAGGT TCATCTAACC AACGAATTAT 


CGATGTGCAA 


AAATCGCTCA 


4200 


AAAGTGGGCA AGTAGTGATT TGGGACAAGC TCAATGACGG AAAAGAAAGC TATGAACAGG 


4260 


TGAAGAGAGA ATGAAGAAAA AGGACTTAGT AGACCAACTA GTCTCAGAGA TCGAGACGGG 


4320 


GAAAGTCAGG 


ACACTGGGAA TATACGGTCA TGGAGCTTCA 


GGTAAATCAA 


CCTTTGCACA 


4380 


GGAATTGTAC 


CAAGCTTTAG ATTCTACTAC AGTAAATTTG 


CTAGAGACAG 


ATCCTTATAT 


4440 


CACCTCAGGA 


CGCCATCTGG TAGTACCCAA GGACGCGCCG 


AATCAAAAGG 


TGACAGCCAG 


4500 


TCTGCCAGTG 


GCGCATGAAC TGGAGAGTTT GCAGAGAGAT 


ATCCTTGCTT 


GCAGGCGGGT 


4560 


ATGGATGTCT 


TGACAATTGA AGAACCTTGG AAGGCTAGTG 


AGGTCTTGTC 


TGGAGCCAAA 


4620 


CCAATTTTGA 


TTGTCGAAGG GATGTCTGTT GGCTTTCTAC 


CCAAGGAACT 


CTTTGAAAAA 


4680 


ACCATCTGTT 


TCTACACGGA TGAGGAGACC GAATTAAAGC 


GACGCCTTGC 


TAGAGATACG 


4740 


ACTGTGAGAA 


ATCGCGATGC GG 






4762 


(2) INFORMATION FOR SEQ ID NO: 92: 









(i) SEQUENCE CHARACTERISTICS: 
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A LENGTH : 3832 base pairs 
B TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO- 92- 
GATGCAGGTT ^GACCCACA TATTCCAGAA AATTACTTTA AAGATGATGA 
■ CTACC^TC TTTGTTGGTC TTCATCTGCA GCCC^ TCAGTAATTG 
GCGGTCTATC AGGAGACGCC TTTTGATTGG AGAAAGATAG AAGATGATGC 
GGGTATTTAT AAGAGGAATT ATGACATATT TAGACGCTTT TAAATCAGGT 
TACCGAGTGC CCTGCTCTTG CATTTTAAGG AACTC^C T^AGCGAC 
TTTGGCAATT TTTCTATTTG CAAAATACGA CAGGCTTAGA AGAAATGTCG 
TTGCTGAAAG 6AWGGCAAG GAAATTTCGG ATGTCAACCA GTCCATTTCT 
AAAGGGGACT GCTCCAGTAT CGTACTATCG AATTAAATGG CGAAATTGAA 
ATGCTAGTTT GGCCTTGGAA CGTTTGGATG ACC^TT^G AGCAGTTCAT 
ACCAGCTAAC ACCTCAAAAC CAGCTCAAGG ATTTGGTGGA AACCTTCCAG 
GACGATTGTT GACGCCTTTT GAGATTGAGG ATTTGACCAA GACACTAAAG 
CCAGTGCTGA C^AAG GAGGCTCTTC GTGAAGCTGT TTTGAATGGA 
GGAAGTACAT TCAGGCGATT TTGAGAAACT GGCGCCATGA AGGAATCAAG 
AAATTGAGGC CAAGAGAGCA GAAAGAGAAG CAAGCAATCC TCAGTTGACA 
CAGATTTCAT AAATGCCATG GATCTCTGGA AGGATTAATC CATGCAAGTA 
CCGAGTAAGA TTTGCAAGCT GTGTATAATT GTGATAGAAT AAATAGAAAA 
AAAGAGGTAT GTGAAATGTC ACGTAAACCA TTTATCGCTG GTAACTGGAA 
AATCCAGAAG AAGCTAAAGC ATTCGTTGAA GCAGTTGCAT CAAAACTTCC 
CTTGTTGAAG CAGGTATCGC TGCTCCAGCT CTTGATTTGA CAACTGTTCT 
AAAGGCTCAA ACCTTAAAGT TGCTGCTCAA AACTGCTACT TTGAAAATGC 
ACTGGTGAAA CTAGCCCACA AGTTTTGAAA GAAATCGGTA CTGACTACGT 
CACTCAGAAC GCCGTGACTA CTTCCATGAA ACTGATGAAG ATATCAACAA 
GCAAT^ CGAACGGTAT GC^CCAATC AWWnW0 GTGAATCACT 
GAAGCTGGTA AAGCTGCTGA ATTCGTAGGT GCTCAAGTAT CTGCTGCATT 
ACTGCTGAAC AAGTTGCTGC CTCAGTTATC GC^GC CAATCTGGGC 



TGTTAATCAG 
GGTAGACCAT 
ATCTGCATAT 
ACCTTGGTTT 
GATTTTCTGG 
CCAAGCCAGA 
AATCTGACGG 
TTGCTCTTTG 
TCAAGTTCAG 
CAGGAGTTGG 
GAAGATGGAA 
AAACCAAACT 
AGTGTGGCTC 
CAGGTATCTG 
GGCTTGAAAT 
TAAATTGAAA 
AATGAACAAA 
TTCATCAGAT 
TGCTGTTGCA 
AGGTGCTTTC 
TGTTATCGGT 
AAAAGCAAAA 
TGAAACTTAC 
GGCTGGATTG 
TATCGGTACT 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
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GGTAAATCAG CTTCACAAGA CGATGCACAA AAAATGTGTA AAGTTGTTCG TGACGTTGTA 1560 

GCTGCTGACT TTGGTCAAGA AGTCGCAGAC AAAGTTCGTG TTCAATACGG TGGTTCTGTT 1620 

AAACCTGAAA ATGTTGCTTC ATACATGGCT TGCCCAGACG TTGACGGTGC CCTTGTAGGT 1680 

GGTGCGTCAC TTGAAGCTGA AAGCTTCTTG GCTTTGCTTG ACTTTGTAAA ATAATCAGTA 1740 

AGTAGCAAAA GCTAGGTGGA ACAGCATTCA GATGTCTGTT ACATTTTTTA TAGGAGAGAA 1800 

AGATTGAAAA CAAAAATTGG ATTAGCAAGT ATCTGTTTAC TAGGCTTGGC AACTAGTCAT 1860 

GTCGCTGCAA ATGAAACTGA AGTAGCAAAA ACTTCGCAGG ATACAACGAC AGCTTCAAGT 1920 

AGTTCAGAGC AAAATCAGTC TTCTAATAAA ACGCAAACGA GCGCAGAAGT ACAGACTAAT 1980 

GCTGCTGCCC ACTGGGATGG GGATTATTAT GTAAAGGATG ATGGTTCTAA AGCTCAAAGT 2040 

GAATGGATTT TTGACAACTA CTATAAGGCT TGGTTTTATA TTAATTCAGA TGGTCGTTAC 2100 

TCGCAGAATG AATGGCATGG AAATTACTAC CTGAAATCAG GTGGATATAT GGCCCAAAAC 2160 

GAGTGGATCT ATGACAGTAA TTACAAGAGT TGGTTTTATC TCAAGTCAGA TGGGGCTTAT 2220 

GCTCATCAAG AATGGCAATT GATTGGAAAT AAGTGGTACT ACTTCAAGAA GTGGGGTTAC 2280 

ATGGCTAAAA GCCAATGGCA AGGAAGTTAT TTCTTGAATG GTCAAGGAGC TATGATGCAA 2340 

AATGAATGGC TCTATGATCC AGCCTATTCT GCTTATTTTT ATCTAAAATC CGATGGAACT 2400 

TATGCTAACC AAGAGTGGCA AAAAGTGGGC GGCAAATGGT ACTATTTCAA GAAGTGGGGC 2460 

TATATGGCTC GGAATGAGTG GCAAGGCAAC TACTATTTGA CTGGAAGTGG TGCCATGGCG 2520 

ACTGACGAAG TGATTATGGA TGGTACTCGC TATATCTTTG CGGCCTCTGG TGAGCTCAAA 2580 

GAAAAAAAAG ATTTGAATGT CGGCTGGGTT CACAGAGATG GTAAGCGCTA TTTCTTTAAT 2640 

AATAGAGAAG AACAAGTGGG AACCGAACAT GCTAAGAAAG TCATTGATAT TAGTGAGCAC 2700 

AATGGTCGTA TCAATGATTG GAAAAAGGTT ATTGATGAGA ACGAAGTGGA TGGTGTCATT 2760 

GTTCGTCTAG GTTATAGCGG TAAAGAAGAC AAGGAATTGG CGCATAACAT TAAGGAGTTA 2820 

AACCGTCTGG GAATTCCTTA TGGTGTCTAT CTCTATACCT ATGCTGAAAA TGAGACCGAT 2880 

GCTGAGAGTG ACGCTAAACA GACCATTGAA CTTATAAAGA AATACAATAT GAACCTGTCT 2940 

TACCCTATCT ATTATGATGT TGAGAATTGG GAATATGTAA ATAAGAGCAA GAGAGCTCCA 3000 

AGTGATACAG GCACTTGGGT TAAAATCATC AACAAGTACA TGGACACGAT GAAGCAGGCG 3060 

GGTTATCAAA ATGTGTATGT CTATAGCTAT CGTAGTTTAT TACAGACGCG TTTAAAACAC 3120 

CCAGATATTT TAAAACATGT AAACTGGGTA GCGGCCTATA CGAATGCTTT AGAATGGGAA 3180 

AACCCTCATT ATTCAGGAAA AAAAGGTTGG CAATATACCT CTTCTGAATA CATGAAAGGA 3240 

ATCCAAGGGC GCGTAGATGT CAGCGTTTGG TATTAAGCGA TGATTTGAAA GAGGGATGTG 3300 
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ATAGTAGCAC CCTCTTTTTC TTTGTTTTAT GATAGTTCAT CCTCGAGTAA ATTCAAGTTC 3360 

TTGCTCGGAA ATGAAGCTTA TATAGTAGAT TGAATATAGA CAAATACCTT GTGATTGGTA 3420 

AAACATTTTA GAAATTCATT TACCTTTCCT AATCGACTTG GTTTCATCTT ATTTCAATCT 3480 

ATTATAGTAT TGGGGAATTT CTTCAAACCA CATCAGCTTG GTCAGTTCTA CCTGCGACCT 3540 

CAAAACTTGT GCTTTGGTCA AGCTGGGTTT AGTTTCCTAG TTTGCTGATG GATTTCCATT 3600 

. GACTATAAGC ATCCAACCCT CTTTTTGTCT TCTAAAGAAT TCTTAAATTA TCAGTCTATT 3660 

GCAACTTTTC TCATATAAGT TCTTTGTCTT GCTATTGGTT TTCCTTAGTA GTATACTAAG 3720 

GTAGTAATCA TTAAGAAGTG GTTACAAAAA ATAATGAATG AGGTAAAGAA AATGGTAGAA 3780 

TTGAAAAAAG AAGCAGTAAA AGACGTAACA TCATTGACAA AAGCAGCGCC GG 3832 
(2) INFORMATION FOR SEQ ID NO: 93: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10690 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 93: 

TGAAAAAATC CTCATGAACC TGGCGCCAAT AGACAAGTGT CTTGTTTCCC TCACCTTCCT 60 

TATAGGCATG GTCAGCTGAC ACTCGATTGA AGGGTTTAAC AGAAACCTTT GTAATTTCGA 120 

CAATGCAGAC AGCCTGATTT TGACTATCTA AAATGACATC GAAGGTCCCT ACTTGGGGAA 180 

GTGGTTCGTC TTCTAGCACA TAGAGGTCAT AGGCTGATGC TGTTGCTGTC TTTTCTCCTT 240 

TAAACACCAA ATCCGCTAAA AGGTCTGGTT CAACTCCAAA AGCCCAGGCA TCGATTTCAT 300 

CTCCGATCAA AGGATTGATT TGCTTGTATT TATTCCACAT TTCTTGCGGT ATCATGGGTG 360 

CTCCTTTGTA ATTTTTTACT TTCTTCTTTT ATGTGTTTAA GATGATCTGG ATGGTCAATC 420 

TCTAAATCAA AAATCTCTGG AATAGAACTG TAGTGGATAA TGCACTTGAT ACCCAACTGA 480 

TTCATTTTTT GTATGAAAGA AGTATTCAGA TAGCCTGCTA CAGCAAAATC AATCTTGTTC 540 

TTTCTTGCTT TATCCTGCAT ATCTCTTAGC ATATCTAACA TTATTGGACT TTCCATATCA 600 

TGCCATTGAC TGTTTCTCAT AGTCGCAAAA ACAAAGGAAG TCAAATCATT CATTCCAACT 660 
ACAATCTTTG AAATGCCCGT TTCCAGTATA CTAGATAAGT CAAAATACGC TGACGGTAAT • 720 

TCAATCATCG TTCCGACTTT CCCAGTAAAA CCCTGCTGAC GCAATACTGT AATAGCTTGT 780 

TTTAATTGGT CGG CATC ATT GACAAAAGGA AAGATAACAG ATAGATTGGG GTTGGTTTGA 840 
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TAAACTTCTG 


TAACGACATG 


TGCTTCAGCC 


TGAAATTCAT 


CCAAACACGC 


CAGTAAACGC 


900 


CTAGTTCCTC 


TATAGCCAAA 


CAAGGGATGC 


CCTTCGTCAA 


AAAACTCTTT 


AGTCCCCACT 


960 


AAACAATTGG 


CTTCTGTATT 


CGTTAATTCA 


GTAAAACGAT 


ACCAAACTTC 


CTTACCTAAG 


1020 


TAAAAGGAGC 


AAATAGTATC 


AAGATAATCT 


TTCACAAATT 


CCTGACAACT 


TTGTAATAGT 


1080 


ATATTTTGAT 


TGAGCTCTCT 


CAATAAGTAT 


TCCCCACGAA 


TCATGCCGAC 


GTGGTGAAAT 


1140 


AGTTGAGGAT 


AAATTTTTTC 


AAGAATTTTT 


TCGCCACTAA 


GGGCAAGTTG 


ATTTCTCATC 


1200 


ATTCACCTTC 


CAATTCATGT 


AAGAAGTCTT 


GTCCAGTTCT 


GGAAATCCTA 


ATAATTCAGA 


1260 


CTTAACCTTC 


AAGACTAATG 


GCGATGCATT 


TTCTTCTGTA 


ATCTCTTGAA 


TATCCATCCA 


1320 


AATATATCCA 


AGTGAATCAT 


TCGCACCATC 


AGACACAGCT 


TCCGAAATCG 


TAACTTGAGG 


1380 


TGCACTCTCA 


TTCATTTCAA 


CATCATACAA 


GGCTATGACA 


TGGTGAACCA 


TAAAATTTTT 


1440 


TAACTCTTCC 


CTGACGAAAA 


CATCGTAGAT 


TCGAGGATTA GAGTAGCTTC 


TAACAGTAAA 


1500 


TCCCGTCTCT 


TCCATAACTT 


CTCTAGTCAG 


CGTTTCCGTC 


AGTCCTTCAC 


CAAGTTGCTG 


1560 


ACTGCCTCCA 


GGTAGATCAT 


ACCGATGTTG 


ATAAGGGCCT 


CTCGTTTTTT 


CAATGCAAAG 


1620 


TAACTTTCCA 


TTTTCAAAGC 


AAACACAGTA 


GACCCCAAAG 


TGATTTTTGA 


TTTCCATCCA 


1680 


ACTCCTCCTA 


CTTCAAAGAC 


CAGCCACCAT 


CTATTGTCAA 


GATTTGTCCT 


TGCATGGCGC 


1740 


TCGCTTTTCC 


ACTTGCTAAA 


AAAAGACTAA 


GCTCTGCTAT 


TTCCTCTGGC 


TCAATCCAGC 


1800 


GCTTGATTGG 


GGTTTCACTA 


GCCACCCAGT 


CAGCCAAACC 


ACCTGGTTCA 


AAATCCGCAG 


1860 


CGGTCATAGC 


TGTCTTGACT 


GCTCCTGGAG 


CGATACCAAA 


GACCTGAATC 


CCAGCTTCAG 


1920 


CATAGTCTAG 


AGCCAACTGC 


TTGGTGAAGC 


CAGCCAAGGC 


ATGCTTGGAT 


GAAGTATAGG 


1980 


CGTGACCACC 


TCCACCTGCT 


AGGCTAGAAG 


CAATGGAACA 


CATATTGATG 


ATGATTCCCT 


2040 


TTTTATTTTC 


CAGCATTTGT 


GTCAAATAAT 


ACCGAGTCAA 


CTCTACTGGA 


ATAATGTAGT 


2100 


TGATTTCAAA 


AATCTCTTGA ATGTCCTGCG 


CCGTTTGTTC 


CAACAGTGGT 


TTGTAATCAT 


2160 


CCAAAACTCC 


AGCAGTATTA 


CACAAAACAT 


CCACCTGAGG 


GCACCAGTCA 


AAAATAGGTT 


2220 


CCAAGTCCAA 


GGTCAAATCT 


CTCTGTAAAA AGCGAAAATC 


ACCCTCTAAG 


AGTGGCTTTT 


2280 


CACCTTGGTC 


AACTCCATAA ACTTGATAGC 


CCTTCTCTAA 


AAAGAGGCGA 


GCTTGAGCCA 


2340 


ATCCGATCCC 


TGAACTCACT 


CCTGTAATGA 


GTACACGTTT 


AGTCATGCAC 


TTCTACCCAA 


2400 


TCCGTTGCCA 


AAACATCACA 


AACTGTCGGG 


CTCCACATGG 


AAAAACCTTC 


TCCTTCGCCA 


2460 


GAAACGTTGA 


TTAGGAAATA 


AGGTGTCATT 


TCAAGTGCAA 


GCCCATTTTG 


CTCGATGGTA 


2520 


TCAAAGAGTT 


GGACATAGTT 


TTCCGCACCT 


CCCCAACCAG 


TTCGTACATA 


TTTTCTCTTA 


2580 


GCCTTTAACC 


CAGGCAGGAT 


CTCTTCAAAT 


GTCATGTTTT 


TCTCCTTTAA 


TTCTACATTC 


2640 
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TTCATTTAAT TATAGCAAAA AACCGCTTTA TACGGCTTTT 


TGAATGTGAG 


! TTATTCAAAC 


2700 


CTGCTACTAC TTACGGCAAA TTATTCCCTG CAGCAAGATA 


AATTTCATAC 


CATTCTTTTC 


2760 


TTGTTAAGCT AAAGTTTGCC GCTCGGCTAA CTTCTCTCAA 


GTGCTTAGGA 


TTTGTTGTAC 


2820 


CTACGACTGC CTGCATTTTT GCTGGATAAC GCAATATCCA 


AGAAATGGCA 


ATAGTTGAAG 


2880 


AGGTTACTCC ATATTTAATA GCTAAACGAT CAAGTACTTG 


ATTTAAAGCT 


TGAAATTTCT 


2940 


CATTTCCAAC AAAATTCCCT TTAAAATACC CGAATTGTAA 


GACAGACCAT 


GCTTGAATGA 


3000 


CCACATCGTG TAATTGGCAA TATTCAAAAA TGCTGCCATC 


TCGCATAGCT 


GCTTGACTAT 


3060 


CTTCCATATT AACATGAAAA GCTGATTCAA ATCCTGGAGT 


AAAAGCCGCA 


CTCAATTGTA 


3120 


GCTGATTAAC AGCTAACGGC TGCTTGACAT CTTTTTTAAG 


CAACTCCATC 


ATCATAGGAT 


3180 


TTTGATTAGA AACTCCAAAA TCTCGAACTT TACCTTGTTT ATAAAGGAGA TTAAAGGCTT 


3240 


CTGCTACTTG GTCAGATTCC ATCAAAGCAT CTGGTCGATG 


AAGGAGCAAG 


CTATCTAGAT 


3300 


GATCAATCTT CAATCTTTGC AAAATACCGT CTACTGATTT 


TATAATATAG 


TCCTTAGAAA 


3360 


AATCAAAATA GGTAAATTCT TCAATGCGAA TGCCACATTT 


GGACTGAATC 


CACATCTTTT 


3420 


CTCTTAAATC TGGACGATTT TTTAGGACAA GACCTAACAG 


TTCTTCACAA 


CGACCACGAC 


3480 


CATAAATATC AGCCAAGTCG AAGGCATTGA TTCCAACAGA 


AAGTGCTGTT 


TCTACAAGCT 


3540 


CTTCAACTTC TTTTACAGAT TTATCTTTTA TTCTCATCAT 


TCCGAGAACA 


ATTTCTGATA 


3600 


ATTCTTTGTC ATCTTGACCA AGAGTTATGT ATCTCATCAA 


ATTTTTCTCC 


TTTAATTTCT 


3660 


AACATTCTTC CCTTCATTAT AACAAAAAAC CGCTTTGCAA 


CGACTTTTTG 


ACTATACTTC 


3720 


ACTCCATTTT ATCTTCTTAA ACCCACGGAA CAAGACAAAG 


ATTCCAATAA 


AGAGGACAGC 


3780 


TAAAGGAATA ACTTTTGTAA GGAAAACATT TGAAATTCCC 


ATCCACTCAT 


AATAACGGAG 


3840 


CAGAGAACCC ACCACAAGAT GGGCAATAAT CATACTGACA 


AATGGACGAA 


AGACCGCTTC 


3900 


TTTCCAATTC CAAATACCGA TAACTAGCGA AATCGTAAAG 


ACAGACAAAC 


TATCCCAGGG 


3960 


AGCCGGAATA TAAAAGGCTC CTTCTTGTAT GAAGCTTGCC ATTCCTACAT 


ATCCTAAAAC 


4020 


AACTAGAAGA ACTATAGTCC CAACAACAAT GTAAGTGCCA ATTTTCATTT 


TAGGAGAATC 


4080 


TTGGACTAAA CTTCTTCGTA AAATTGTGGC CACAAGTCCA 


AATCCAATCA 


GAAAAATAAG 


4140 


AAGTTGCCCT AAAAATGTGA GCAAATTGAC TGTTAAGAGA 


GGACCTTTAG 


AAAAATCACT 


4200 


TAGTAGTTGA TAATAACGTA ATACCGCCAG GACAAGAATT 


GGCGTCAAAA 


GGGACTCTTT 


4260. 


GATAGAACTG CGAGGTGCTC CCTTGAGAAT CTCTTTCATT 


ATTTTTTTAG 


GATTCTTACC 


4320 


TAGATAATCC TCTGCACTCA TGCCATCTCG TTCTGCTTCT 


GAGAAATCTA 


GCATCATCAA 


4380 



WO 98/18931 



PCT/US97/19588 



ATAGATCTGC TCTCTGAGAT AGTCTTCATC ATAgIgAAAT CCAGCAAGAT TAAAACPTTC 
CCACAACTCC TCAAAATACT TTTGATTCTC CTCAGAAAAC TCATGTAGCA AAGCGCTTGT 

rrcrrcGTAA tacttcattt tcttcatggt ttaaccccca ttcttaatcc cttctac™ 

TTGACTCAAA TCGTCCCATT GTTGCCAAAA GACTGAGACA CGCTCTTCTC CTTCTTTCAT 
TAATGAAAAA tacttccgat CTGGACCATC tggcgacggg CGCATGTCGC CTCTTATCCA 
TTGATTTTTT TCTAACTTTT GCAACAAAGG ATAAATAGTT CCTGGAACGA TAGTATCAAA 
" TCCAGCCTCT CGCAAAGTCT GAACCAACTC ATAACCATAC CGCTCTTTTT GACCAATCAT 
ATCCAAGACA CAACCTTCAA GAACACCTTT TAATAGCTGA GTTTCTTTCA TCACTTCTCC 
CTTCTAATCT A ™ TAAT ACCTACTAGT GACTTCACCT ATAGTATATC ACTTCTACAC 
TAGTTTGTAA AGCATAATAG TTAATACTCT TCGAAAATCT CTTCAAACCA CGTCAGCGTC 
GCCCTACCGT ATGTATGGTT ACTGACTTCG TCAGTTTCAT CTACAACCTC AAAAACATGT 
TTTGAGCTGA CTTCGTCAGT TTCATCTACA ACCTCAAAAC AGTGTTTTGA GCTGACTTCG 
TCAGTTTCAT CTACAACCTC AAAACAGTGT TTTGAGCTGA CTTCGTCAGT TTCATCTACA 
ACCTCAAAAA CATGTTTTGA GCTGACTTCG TCAGTTTCGT CTACAACCTC AAAACAGTGT 
TTTGAGCAAC CTGCGGCTAG CTTCCTAGTT TGCTCTTTGA TTTTCATTGA GTATAAATAA 
AAAAACAGAA CTAGCCTGAA CTAGTCCTGT CTACTTTTAC CCAATCACAC TTCCATTTGG 
TACAGCTGGA TCAACTGTGA GAAGGGTTAA TTTGCCATCA TGTTCAGCTG AGAGAATCAT 
ACCCTGGCTG ACATATTTTT TCATCATTTT ACGTGGTTTG AGGTTAGCAA CGATTTGAAC 
TTTCTTGCCG ACCAATTCTT GTTCATTTGG ATAGTATTTT GCAATTCCTG AAAGAATCTG 
ACGATCTTCT CCATCACCAG CATCCAAGCG GAATTGAAGC AACTTATCTG AACCTTCTAC 
TTTAGACACT TCTTTGACTT CTGCGACACG GATTTCAACC TTGTCAAAGT CTTCAAACTT 
GATTTCATCC TTGTTTAGTT TGAGCTCAAC TTCGTCCGGA TTCCATTCTT TTTCGACTGC 
TGGTTTATTG CCTTCCA^TT GTTCCTTGAT ATAGGCGATT TCTTCTTCCA TATTTAGACG 
TGGAAAGATA GGTGTTCCTT TGGCAACTAC AGTCACATCT GCTGGGAAGT CAGCCAAACT 
CAAGTTTTCA AGACTAGAAA CTTCTTCCAA ACCAAGTTGA GTCAAAACTG CACGACTAGT 
TTCCATCATA AATGGTTCAA TCAAGTGAGC AACTACACGA ATGCTGGCTG CCAAGTGGCT 
CATGACACTT GCCAATTGGT CACGAAGAGC TTCATCCTTG GCCAAGACCC ATGGTGCGGT 
CTCATCGATG TATTTATTGG TACGAGAGAT CAGAGTCCAG ACTGCTTCAA GCGCACGTGG 
ATAGTCAACT GCTTCCATGT GTGTATGGAA GTCTGCGATT GATTGTwCTG CAACCTCAGC 
AAGAACATGA TCATATTCAG TCACACCTTC TACATAGGCA GGGATTTGTC CATCAAAGTA 
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CTTATTAATC ATGGAAACCG TACGGTTAAG GAGGTTCCCA AGGTCATTAG CCAATTCATA 
GTTGATACGG CCGACATAGT CTTCAGGAGT AAAGGTTCCG TCTGAACCAA CTGGAAGGTT 
ACGCATGAGG TAGTAACGAA GTGGATCTAG TCCATAACGC TCTACCAACA TTTCAGGGTA 
AACGACATTC CCTTTTGACT TAGACATTTT TCCGTCTTTC ATGACAAACC AACCATGGGC 
AATCAAACGA TCAGGTAATT TAACATCCAA CATCATAAGA AGGATTGGCC AGTAGATAGA 
. GTGGAAGCGA AGGATATCTT TTCCTACCAT ATGGAAGACT GTTCCATTCC AGAACTTGTC 
AAAGTTACCA TGTTCGTCTT GAGCGTAGCC AAGAGCTGTC GCATAGTTAA GAAGGGCATC 
AATCCAAACG TAGACAACGT GTTTTGGATT TGATGGGACA GGCACTCCCC ATGTAAAGGT 
TGTACGAGAT ACCGCCAAAT CTTCCAAGCC TGGCTCGATG AAGTTGCGTA GCATTTCATT 
AAGGCGACCA TCTGGCGTGA TAAATTCAGG ATGAGCTTTG AAAAATTCGA CCAAACGGTC 
TTGGTATTTG CTAAGGCGAA GGAAGTATGA TTCTTCAGAA AcCCATTCAA CCTCATGACC 
TGATGGAGCA ATACCACCAG TCACATTTCC AGCTTCATCA CGGAAAACTT CTGCCAGCTG 
GCTTTCTGTA AAGAATTCTT CGTCTGATAC TGAATACCAA CCAGAGTATT CACCCAAGTA 
GATATCATCT TGAGCAAGTA AGCGTTCAAA GACTTGTGCG ACAACTTTTT CATGGTAGTC 
ATCAGTTGTA CGGATAAATT TATCGTATGA GATATCTAGT AATTGCCAGA GTTCTTTAAC 
TCCAACCGCC ATTCCATCAA CATAGGCTTG AGGTGTAATA CCAGCTTCTT CCGCTTTCTG 
CTGGATTTTC TGACCATGTT CATCAAGACC TGTCAGATAA AATACATCGT AGCCCATCAG 
GCGTTTGTAA CGTGCTAGGA CATCACATGC GATAGTTGTG TAGGCAGAAC CGATATGAAG 
TTTCCCAGAT GGATAGTAAA TCGGCGTTGT AATATAAAAA TTTTTTTCAG ACATAATTTT 
TCCTTTCCAG GCAAATGAAA CCTGTTTTTC TAACACTTCA TTATATCACA TTTTTAATGA 
ATTTCAATAG GGAAATCCAT ACAAAAACAA GATAGACGAG TGTCCATCTT GTTGATCTCA 
TTCATAACGA AGGGCTTCAA TTGGATCAAG TTTCGATGCC TTGTTGGCTG GCAAGACTCC 
AAAAATCATA CCAACACTAG CCGAAACTGC AAGACTAAAT AGGGCGACTG GGATTGATAC 
TCCAACTTCT ATACCTTCTA TTAAACCTTG CAGTAACAAA CCTGCTAAGg CAGTTAAACC 
ACTTGCAATT GTCAAGCCAA TTAAGCCACC TAACAAGGTC AAAATCATGG ATTCAATCAA 
AAACTGAATT AAAATATTGG CACGTGTTGC ACCCAAAGCC TTACGAAGAC CAATCTCACG 
AGTGCGCTCT GTCACCGAAA CCAGCATGAT GTTCATGACA CCAGTTCCTC CAACAAAGAG 
AGAAATCCCT GCGATGGAAC TAATAATCGT CGTCATAAAA CTAAACGATT GTTGAATTTC 
TGCAAATACA ACGGACTCAT CTGCCAGCTG GTATTCTCCC TGTTGTAAGC CTGCAAGCTC 
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TGTCATTTTT 
AAAGACAATA 
AATAGGCAAG 
CCCAATGACC 
AGATTCAAAT 
GAAATCTTGC 
AGTTCTGTTT 
GGCATTCGTT 
GACCCAGGAT 
CGTAAAAGCT 
GACGCTAATA 
ACCCAAAGCC 
CAAAGAACGC 
AGTTTTCCTC 
TCTGACGTTT 
CTTCTTTATT 
CTCCTGTCGG 
CTACACGTTG 
ATTCAACCTT 
CGTAAATCAA 
GCTGAAAGAC 
CAGCCACTTC 
TCGTATTCAT 
CCTCATTCAC 
AACTTCTGAA 
TCTTCCAAGG 
ATTTCTTGAT 
TGTTCATCCA 
ACAAGAATAG 
TCACCAACCT 



CGTGCCAGTT 
TTAGCTATTT 
CCACCAAACC 
CGGTAACTAA 
AAACTAATGG 
TCTCTCAGAC 
CCACCTGTCA 
GAATTGGTTA 
TCTTGCGGTT 
GATTGTTTCT 
TTTTTCTGAG 
ATAATCACAA 
ATCTTGTGAG 
CTTTCCTAAC 
GGCATAGGCA 
CAAATCAACC 
TTCATCCGCT 
CTTTTGACCA 
GTCTAAATAT 
GGGCAATTCT 
AAAACCGATT 
TTGACCTTCA 
CAGAGTGGAC 
TTCTAGATTG 
GATATTTTTT 
AAGATGTTGG 
TTTCTGCGTC 
CAATCCAGAC 
CCTTAGTTTT 
CGCCTGTCAC 



CTGGACCCAG 
CATCTACATT 
CATATATTTT 
ATCCATTGAC 
ACAATTCCTC 
TACGACCTGC 
AATTAGCATT 
CATAGTAACT 
TTGGCGGTTC 
GAGTAAAAGA 
ATTTAGTCAT 
CAACTGATGA 
CCATGATAGA 
TGAGCACTGT 
GCAATCTCAG 
AATAATTGCA 
AGGATAATAG 
CCAGATAATT 
TCCTCAGCCA 
ACATTTTGCA 
TGTTGGTTAC 
AGATAATATT 
TTACCAGACC 
ATATTTTTGA 
AGACTAATTA 
ATTACTGATG 
AGCATTTCCC 
ATAATTTTTA 
GCTTTTAACC 
ATCAATAGTA 



726 
AGTTGGGGTT 



AAAATTCGCA 
TGATCTTTTA 
TTCTACAACC 
ATCTAGCAAA 
AATAATTTCA 
CTCAACCTTT 
ATCCACTCCC 
AACAGGAACT 
CCCGTCTTTA 
ATCTTTATTG 
AACACCGATA 
TGAAAAGGCA 
CAGACGAAAT 
GCTCATGCGT 
TAATTTGGTT 
AAGGATTGTT 
CTGAAGGTAA 
ACTTGCGACG 
GAGCATTGAG 
GGACCTTAGC 
CTCCACTGGT 
CAGATGGTCC 
GAACCTGCAG 
GTTGCTTCAT 
ACCTTAGCAC 
AATGAAACCT 
CTATCATCCA 
TCAATGTTGA 
TAAGGGTATT 



AAACTGGTAT 
GCAAGGGAGA 
GCCTCCGGAC 
TTGTTAATAG 
ATGACACTTG 
TTCTTAACAG 
TTATCTTGAT 
TTCAGTTTAG 
TCCTCTTCCT 
CTTTTTTTAG 
ACTTGACGAG 
ATAATCCCAA 
AATTTCAGAT 
GACCCCATCC 
TACCATGATA 
ACCTGTTTTG 
TACCAAGGCA 
ATGGTGACTA 
TTTTGAAGAC 
CTTCGATAGA 
TAGTTGTTTT 
TGGTGTATCC 
CATGATGGCT 
TTCTTGGTCA 
CAGCCTTCAC 
CGTTCGTTAA 
CAACTTTTTT 
TTACTAGACT 
CAGAAAAACC 
TAGAACCTGT 



CATTCACTCG 
TATTGGTAGT 
TAGTATAAAC 
CCTCTTGAGG 
CAAACTCTTT 
CGTCCATGTA 
AGGTCAAGAT 
CTGCCTCTTG 
TTCCAGAAAC 
GAGAGAAAAA 
ATAGGGAATC 
TCATAGTAAG 
TCTGCATCTT 
CGAATGACAA 
ATGGTTTTTC 
GTATCCAAGG 
CGCGCAATGG 
CGTTCTGTCA 
GAAACTCCTG 
AGAAAGAACT 
TCACCAAGCC 
AACATGCCAA 
ACAAATTCAC 
CCATTACGGT 
CTCTTTTCCT 
ACCAGAAGTG 
AGCCTTTTGT 
GCTAACAGGA 
TTGTTTCAAA 
ATTATTCCCG 
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GCTGCTGGAC TAGCTGCTTC ACCATTGTTT TTAGGATAGT CAGAAATATA GCTTAATTTC 
CCAGTCCATT TTTTATCAGG ATACACTTTA GAAGTAAAGC TTACTTCTTG ACCTACAGAA 
AGGTTGGCTA GATTGTACTC AGACAATTCT CCCTTGACTT GTAAATTTTC ATTGCTGACA 
ATATGAACCA TAACTTGACT CGCCCCTGTT GGAGATTTAG AAACATTGCT ATTGACTTCG 
ACCACAGTTC CCTCTAGGGT ACTGAGAACA GTTGTTGCAT CCAATTGACT TTGAGCCTTG 
. CTTAATTGCG CCGCAGCATC TGCACGCGCA TCACGGGCAT CACCCAATTG AGCGTCAATA 
GAAGCAACAG AATTTCCAGC CACTGGAGTT GGGCTTTGCA CCGTTGCATC TTCTCCTCCT 
ACTGGCGCTG GTAACTGTGG AGCCGGAGCT GAAGCGGCTT CATTTCGTGC TTGATTGAGT 
TCATTGATAT GACGATCTGC CCTAGCTACT GCTCGACTAG CTGAATCATA GGCCGCCTGC 
GCTTCTGAAC TACTGTACTT GACTAAAGCC TGCCCTTCGC TGACCTTATC GCCCACAGAA 
ACAAGGATTT CATCTAAATC ACCCTTACTA GCATCAAAAT AAACATATTG TTCATTTTTT 
GCTGTTACTG TCCCTGACAA TAAAACAGAG GAGGCCACGC TTCCTTCCTT GGCAACAACA 
AGATGAGTAG GCTCATCTTT TAGAGCAGTC TGAGAAGGTT GTCTAAAGAG TAAAATCCCC 
CCAGCACCCA ATACAACTAC ACTCGCAGCA CCGATTGCTG CATACAGTTG CCACTTTTTA 
GCTTTACCAT TCTTTTTCTT CATAATGAAA CTCCTTTTCT TTTTTACAAT ACTTTGCTAT 
TATACCAAAT TTCCCTCCAG CAAACAATAC AGTTCAGGAT TAAACAATCG TTCGGAATTT 
TGCTTTTCGG 

(2) INFORMATION FOR SEQ ID NO: 94: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8195 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
(D> TOPOLOGY: linear 
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(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 94: 
GAGAAAGCGC CCACGTTTCC CCGAAGGGAG AAAGGCGGAC AGGTATCCGG TAAGCGGCCA 
GGGTCGGAAC AGGAGAGCGC AACGAGGGAG CTTCCCAGGG GGAAACGCCT GGTATCTTTA 
TAGTCCTGTC GGGTTTCGCC ACCTCTGACT TGAGCGTCGA TTTTTGTGAT GCTCGTCAGG 
GGGGCGGAGC CTATGGAAAA ACGCCAGCAA CGCGGCCTTT TTACGGTTCC TGGCCTTTTG 
CTGGCCTTTT GCTCACATGT TCTTTCCTGC GTTATCCCCT GATTCTGTGG ATAACCGTAT 
TACCGCCTTT GAGTGAGCTG ATACCGCTCG CCGCAGCCGA ACGACCGAGC GCAGCGAGTC 
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AGTGAGCGAG 


GAAGCGGAAG 


AGCGCCCAAT ACGCAAACCG 


CCTCTCCCCG 


CGCGTTGGCC 


420 


GATTCATTAA 


TGCAGCTGGC 


ACGACAGGTT 


TCCCGACTGG 


AAAGCGGGCA 


GTGAGCGCAA 


480 


CGCAATTAAT 


GTGAGTTAGC 


TCACTCATTA 


GGCACCCCAG 


GCTTTACACT 


TTATGCTTCC 


540 


GGCTCGTATG 


TTGTGTGGAA 


TTGTGAGCGG 


ATAACAATTT 


CACACAGGAA 


ACAGCTATGA 


600 


CaTGATTACG 


AATTCGAGCT 


CGGTACCCGG 


AAAATCCAGA 


AAATGCTTGA 


AAAAAATCCT 


660 


AGAAGATGGT 


ATAATACTAA 


ATTGTAAGGG 


TTATCACATA 


TAACTCAAAA 


AAAGAAAGAA 


720 


" CAAAAGGAGA 


GTCAAACTAT 


GGCTTCTAAA 


GATTTCCACG 


TAGTGGCAGA 


AACAGGTATT 


780 


CACGCACGTC 


CAGCAACATT 


GTTGGTACAA 


ACTGCTAGCA 


AATTTGCTTC 


AGATATCACT 


840 


CTTGAGTACA 


AAGGTAAATC 


AGTTAACCTT 


AAATCAATTA 


TGGGTGTTAT 


GAGTCTTGGT 


900 


GTTGGCCAAG 


GTGCTGACGT 


AACTATCTCA 


GCTGAAGGTG 


CAGATGCAGA 


TGACGCTATC 


960 


GCTGCAATCT 


CAGAAACAAT 


GGAAAAAGAA 


GGATTGGCAT 


AAGGGAAATG 


ACAGAAATGC 


1020 


TTAAAGGAAT 


CGCAGCATCT 


GACGGTGTTG 


CAGTTGCAAA 


AGCATATCTA 


CTCGTTCAGC 


1080 


CGGATTTGTC 


ATTTGAGACT 


ATTACAGTCG 


AAGATACAAA 


CGCAGAAGAA 


GCTCGCCTTG 


1140 


ATGCCGCTCT 


ACAGGCATCA 


CAAGACGAGC 


TTTCTGTTAT 


TCGCGAGAAA 


GCAGTAGGTA 


1200 


CGCTCGGTGA 


AGAAGCAGCT 


CAAGTTTTTG 


ATGCTCACTT 


AATGGTTCTT 


GCTGACCCAG 


1260 


AAATGATCAG 


CCAAATCAAG 


GAAACTATCC 


GTGCGAAGAA 


AGTGAATGCA 


GAAGCAGGTC 


1320 


TGAAAGAAGT 


TACAGATATG 


TTTATCACTA 


TCTTTGAAGG 


CATGGAAGAC 


AACCCATACA 


1380 


TGCAAGAACG 


CGCAGcGGAT 


wTCCGCGACG 


TGACAAAACG 


TGTATTGGCA 


AACCTTCTTG 


1440 


GTAAAAAATT 


GCCAAACCCA GCTTCTATCA ATGAAGAAGT 


GATTGTGATT 


GCGCATGACT 


1500 


TGACTCCTTC 


AGATACAGCT 


CAATTGGACA 


AAAACTTTGT 


AAAAGCTTTT 


GTAACCAACA 


1560 


TTGGTGGACG 


TACAAGCCAC 


TCAGCTATCA 


TGGCACGTAC 


ACTTGAAATT 


GCTGCTGTAT 


1620 


TAGGTACAAA 


TAACATCACT 


GAAATCGTTA 


AAGACGGTGA 


CATCCTTGCT 


GTTAACGGGA 


1680 


TCACTGGAGA 


AGTGATTATC 


AACCCAACAG 


ATGAACAAGC 


GGCAGAATTT 


AAAGCAGCTG 


1740 


GTGAAGCCTA 


TGCGAAACAA 


AAAGCTGAAT 


GGGCACTTTT 


GAAAGATGCT 


CAAACAGTGA 


1800 


CTGCTGACGG 


TAAACACTTC 


GAGTTGGCTG 


CTAATATCGG 


TACTCCAAAA 


GACGTTGAAG 


1860 


GTGTTAACAA 


CAACGGTGCA 


GAAGCTGTTG 


GACTTTACCG 


TACAGAGTTC 


TTGTACATGG 


1920 


ATTCTCAAGA 


CTTCCCAACT 


GAAGATGAGC 


AGTATGAAGC 


ATACAAGGCT 


GTTCTTGAAG 


1980 


GAATGAACGG 


TAAACCTGTT 


GTCGTTCGTA CAATGGATAT 


CGGTGGAGAT 


AAGGAACTTC 


2040 


CTTACTTCGA 


TATGCCTCAC 


GAAATGAACC 


CATTCCTTGG 


ATTCCGTGCT 


CTTCGTATCT 


2100 


CTATCTCTGA GACTGGAGAT 


GCTATGTTCC 


GCACACAAAT 


CCGTGCTCTT 


CTTCGTGCGT 


2160 



WO 98/18931 



PCT/US97/19588 



729 

C^CACGG rc^ CGI ATCATGTTCC CAA TGG ™ C GCTCTTGAAA GAA^CC*. 
CAGCGAAAGC AGTCTTTGAT GAAGAAAAAG CAAACCTCT TGCTGAAGGT GTTGCAGTTG 
CGGATAACAT CCAAGTTGGT ATCATGATCG AGATTCCTGC AGCGGCTATG CTI^CAGACC 
AATTTGCTAA AGAAGTTGAC TTCTTCTCAA TTGGTACAAA CGACTTGATC CAATATACAA 
TGGCAGCAGA CCGTATGAAC GAACAAGTTT CATACC^A CCAACCATAC AACCCATCAA 
. TCCTACGC^ GATTAACAAT GTGATCAAAG CAGCTCACGC TGAAGGTAAA ^GCC^a 
TGTGTGGTGA GATGGCTGGT GACCAACAAG C^CAC. ATGGGCTTGG 
ATGAGTTCTC TATGTCAGCA ACATCTGTAC TTCGTACACG COC^ro AAGAAACTCG 
ACACAGCTAA GATGGAAGAG TACGCAAACC O-CCCmc AGAATGCTCA ACAATGGAAG 
AAGTTCTTGA ACTTCAAAAA GAATACGTTA ATTTTGATTA ATCGAAAAGT CCC^CAACT 
CAGTTACAGG GATTTTTTTG ATATTTTAAA AAGAATTTTC AAGAAAATCT TTCTTATAGA 
AAGTCCAACC TTGAAAAAGT AGTGGTCAGA ACAAAAAATA CTTAAATGGT TCATAAAATT 
CTTGACAAGT TGGATATTTA GGAGTAAACT ATTAACCAGT TAAGTAATAG AGAGGAGTTT 
C^CAA^TA GAAATGAATT GCAACTAGAA ATATCAAATA GAAAGAGAGT TTCGATGAAA 
ATTAATAAGA AATACCTTGT TGg^gcc GCACTTTGAT ^AAG TCTT TX^ACG 
AGTTGGGACT GTATCAAGCT AGAACGGTTA AGGAAAATAA rc^^CC TATATAGATG 
GAAAACAAGC GACGCAAAAA ACGGAGAATT TGACTCCTGA TGAGGTTAGC AAGCGTGAAG 
GAATCAATGC TGAGCAAATC GTCATCAAGA TAACAGACCA AGGCTATGTC ACTTCACATG 
OCGACCACTA TCATTATTAC AATGGTAAGG TTCCTTATGA CGCTATCATC AGTGAAGAAT 
TACTCATGAA AGATCCAAAC TATAAGCTAA AAGATGAGGA TATTGTTAAT GAGGTCAAGG 
GTGGATATGT TATCAAGGTA GATGGAAAAT ACTATGTTTA CCTTAAGGAT GCTGCCCACG 
CGGATAACGT CCOTACAAAA GAGGAAATCA ATCGACAAAA ACAAGAGCAT AGTCAACATC 
GTGAAGGTGG AACTCCAAGA AACGATGGTG CK^CTT GGCACGTTCG CAAGGACGCT 
ATACTACAGA ^AT^T ATCTTTAATG CTTCO^AT CATAGAGGAT AC^GTGA^ 
CTTATATCGT GATCATTACC ATTACATTCC TAAGAATGAG TTATCAGCTA 

GCGAGTTGGC TGCTGCAGAA GCCTTCCTAT CTGGTCGAGG AAATCTGTCA AATTCAAGAA 
CCTA^GCCG ACAAAATAGC GATAACACTT CAAGAACAAA CTGGGTACCT TCTGTAAGCA 
ATCCAGGAAC TACAAATACT AACACAAGCA ACAACAGCAA CACTAACAGT CAAGCAAGTC 
AAAGTAATGA CATTGATAGT CTCTTGAAAC AGCTCTACAA AC^CCm* AGTCA ACGAC 
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ATGTAGAATC TGATGGCCTT 


GTCTTTGATC 


CAGCACAAAT 


CACAAGTCGA 


ACAGCTAGAG 


3960 


GTGTTGCAGT GCCACACGGA 


GATCATTACC 


ACTTCATCCC 


TTACTCTCAA 


ATGTCTGAAT 


4020 


TGGAAGAACG AATCGCTCGT 


ATTATTCCCC 


TTCGTTATCG 


TTCAAACCAT 


TGGGTACCAG 


4080 


ATTCAAGGCC AGAACAACCA 


AGTCCACAAC 


CGACTCCGGA ACCTAGTCCA GGCCCGCAAC 


4140 


CTGCACCAAA TCTTAAAATA GACTCAAATT CTTCTTTGGT 


TAGTCAGCTG 


GTACGAAAAG 


4200 


TTGGGGAAGG ATATGTATTC 


GAAGAAAAGG 


GCATCTCTCG 


TTATGTCTTT 


GCGAAAGATT 


4260 


TACCATCTGA AACTGTTAAA 


AATCTTGAAA 


GCAAGTTATC 


AAAACAAGAG 


AGTGTTTCAC 


4320 


ACACTTTAAC TGCTAAAAAA GAAAATGTTG CTCCTCGTGA CCAAGAATTT 


TATGATAAAG 


4380 


CATATAATCT GTTAACTGAG 


GCTCATAAAG 


CCTTGTTTGA 


AAATAAGGGT 


CGTAATTCTG 


4440 


ATTTCCAAGC CTTAGACAAA TTATTAGAAC 


GCTTGAATGA 


TGAATCGACT 


AATAAAGAAA 


4500 


AATTGGTAGA TGATTTATTG GCATTCCTAG CACCAATTAC CCATCCAGAG 


CGACTTGGCA 


4560 


AACCAAATTC TCAAATTGAG 


TATACTGAAG 


ACGAAGTTCG 


TATTGCTCAA 


TTAGCTGATA 


4620 


AGTATACAAC GTCAGATGGT 


TACATTTTTG 


ATGAACATGA 


TATAATCAGT 


GATGAAGGAG 


4680 


ATGCATATGT AACGCCTCAT 


ATGGGCCATA 


GTCACTGGAT 


TGGAAAAGAT 


AGCCTTTCTG 


4740 


ATAAGGAAAA AGTTGCAGCT 


CAAGCCTATA 


CTAAAGAAAA 


AGGTATCCTA 


CCTCCATCTC 


4800 


CAGACGCAGA TGTTAAAGCA 


AATCCAACTG 


GAGATAGTGC 


AGCAGCTATT 


TACAATCGTG 


4860 


TGAAAGGGGA AAAACGAATT 


CCACTCGTTC 


GACTTCCATA 


TATGGTTGAG 


CATACAGTTG 


4920 


AGGTTAAAAA CGGTAATTTG 


ATTATTCCTC 


ATAAGGATCA 


TTACCATAAT 


ATTAAATTTG 


4980 


CTTGGTTTGA TGATCACACA 


TACAAAGCTC 


CAAATGGCTA 


TACCTTGGAA 


GATTTGTTTG 


5040 


CGACGATTAA GTACTACGTA 


GAACACCCTG 


ACGAACGTCC 


ACATTCTAAT 


GATGGATGGG 


5100 


GCAATGCCAG TGAGCATGTG 


TTAGGCAAGA 


AAGACCACAG 


TGAAGATCCA 


AATAAGAACT 


5160 


TCAAAGCGGA TGAAGAGCCA GTAGAGGAAA CACCTGCTGA GCCAGAAGTC CCTCAAGTAG 


5220 


AGACTGAAAA AGTAGAAGCC 


CAACTCAAAG 


AAGCAGAAGT 


TTTGCTTGCG 


AAAGTAACGG 


dZou 


ATTCTAGTCT GAAAGCCAAT 


GCAACAGAAA 


CTCTAGCTGG 


TTTACGAAAT 


AATTTGACTC 


5340 


TTCAAATTAT GGATAACAAT 


AGTATCATGG 


CAGAAGCAGA 


AAAATTACTT 


GCGTTGTTAA 


5400 


AAGGAAGTAA TCCTTCATCT 


GTAAGTAAGG 


AAAAAATAAA 


CTAATGAAAA 


ATGAAAGTCT 


5460 


CGATAAAGAG GCTTTCATTT 


TTATTATGTA 


TATATGTAAA 


ATTCTTGACA 


AGCAATATTA 


5520 


AAAAGAGTAA ACTATTAACT 


AGTTAATTAA 


CCGGTTTATT 


ACTTTATAGT 


GAATCAAATA 


5580 


TACTTAAGAA AAGAGGAAAG 


AATGAAAATT 


AATAAAAAAT 


ATCTAGCAGG 


TTCAGTGGCA 


5640 


GTCCTTGCCC TAAGTGTTTG 


TTCCTATGAA 


CTTGGTCGTC 


ACCAAGCTGG 


TCAGGTTAAG 


5700 



WO 98/18931 



PCTAJS97/19588 



731 



AAAGAGTCTA 


ATCGAGTTkC 


TTATATAGAT 


GGTGATCAGG 


CTGGTCAAAA 


GGCAGAAAAC 


5760 


TTGACACCAG 


ATGAAGTCAG 


TAAGAGGGAG 


GGGATCAACG 


CCGAACAAAT 


CGTCATCAAG 


5820 


ATTACGGATC 


AAGGTTATGT 


GACCTCTCAT 


GGAGACCATT 


ATCATTACTA 


TAATGGCAAG 


5880 


GTCCCTTATG 


ATGCCATCAT 


CAGTGAAGAG 


CTCCTCATGA 


AAGATCCGAA 


TTATCAGTTG 


5940 


AAGGATTCAG 


ACATTGTCAA 


TGAAATCAAG 


GGTGGTTATG 


TTATCAAGGT 


AGATGGAAAA 


6000 


TACTATGTTT 


ACCTTAAGGA 


TGCAGCTCAT 


GCGGATAATA 


TTCGGACAAA 


AGAAGAGATT 


6060 


AAACGTCAGA AGCAGGAACA CAGTCATAAT CACGGGGGTG GTTCTAACGA TCAAGCAGTA 


6120 


GTTGCAGCCA 


GAGCCCAAGG 


ACGCTATACA 


ACGGATGATG 


GTTATATCTT 


CAATGCATCT 


6180 


GATATCATTG 


AGGACACGGG 


TGATGCTTAT 


ATCGTTCCTC 


ACGGCGACCA 


TTACCATTAC 


6240 


ATTCCTAAGA 


ATGAGTTATC 


AGCTAGCGAG 


TTAGCTGCTG 


CAGAAGCCTA 


TTGGAATGGG 


6300 


AAGCAGGGAT 


CTCGTCCTTC 


TTCAAGTTCT 


AGTTATAATG 


CAAATCCAGC 


TCAACCAAGA 


6360 


TTGTCAGAGA 


ACCACAATCT 


GACTGTCACT 


CCAACTTATC 


ATCAAAATCA 


AGGGGAAAAC 


6420 


ATTTCAAGCC 


TTTTACGTGA 


ATTGTATGCT 


AAACCCTTAT 


CAGAACGCCA 


TGTGGAATCT 


6460 


GATGGCCTTA 


TTTTCGACCC 


AGCGCAAATC 


ACAAGTCGAA 


CCGCCAGAGG 


TGTAGCTGTC 


6540 


CCTCATGGTA 


ACCATTACCA 


CTTTATCCCT 


TATGAACAAA 


TGTCTGAATT 


GGAAAAACGA 


6600 


ATTGCTCGTA 


TTATTCCCCT 


TCGTTATCGT 


TCAAACCATT 


GGGTACCAGA 


TTCAAGACCA 


6660 


GAACAACCAA 


GTCCACAATC 


GACTCCGGAA 


CCTAGTCCAA 


GTCCGCAACC 


TGCACCAAAT 


6720 


CCTCAACCAG 


CTCCAAGCAA 


TCCAATTGAT 


GAGAAATTGG 


TCAAAGAAGC 


TGTTCGAAAA 


6780 


GTAGGCGATG 


GTTATGTCTT 


TGAGGAGAAT 


GGAGTTTCTC 


GTTATATCCC 


AGCCAAGGAT 


6840 


CTTTCAGCAG 


AAACAGCAGC 


AGGCATTGAT 


AGCAAACTGG 


CCAAGCAGGA 


AAGTTTATCT 


6900 


CATAAGCTAG 


GAGCTAAGAA 


AACTGACCTC 


CCATCTAGTG 


ATCGAGAATT 


TTACAATAAG 


6960 


GCTTATGACT 


TACTAGCAAG 


AATTCACCAA 


GATTTACTTG 


ATAATAAAGG 


TCGACAAGTT 


7020 


GATTTTGAGG 


CTTTGGATAA CCTGTTGGAA CGACTCAAGG ATGTCyCAAG TGATAAAGTC 


7080 


AAGTTAGTGG 


ATGATATTCT 


TGCCTTCTTA 


GCTCCGATTC 


GTCATCCAGA 


ACGTTTAGGA 


7140 


AAACCAAATG 


CGCAAATTAC 


CTACACTGAT 


GATGAGATTC 


AAGTAGCCAA 


GTTGGCAGGC 


7200 


AAGTACACAA 


CAGAAGACGG 


TTATATCTTT 


GATCCTCGTG 


ATATAACCAG 


TGATGAGGGG 


7260 


GATGCCTATG 


TAACTCCACA 


TATGACCCAT 


AGCCACTGGA 


TTAAAAAAGA 


TAGTTTGTCT 


7320 


GAAGCTGAGA GAGCGGCAGC CCAGGCTTAT GCTAAAGAGA AAGGTTTGAC CCCTCCTTCG 


7380 


ACAGACCATC 


AGGATTCAGG 


AAATACTGAG 


GCAAAAGGAG 


CAGAAGCTAT 


CTACAACCGC 


7440 
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GTGAAAGCAG CTAAGAAGGT GCCACTTGAT CGTATGCCTT ACAATCTTCA ATATACTGTA 7500 

GAAGTCAAAA ACGGTAGTTT AATCATACCT CATTATGACC ATTACCATAA CATCAAATTT 7560 

GAGTGGTTTG ACGAAGGCCT TTATGAGGCA CCTAAGGGGT ATACTCTTGA GGATCTTTTG 7620 

GCGACTGTCA AGTACTATGT CGAACATCCA AACGAACGTC CGCATTCAGA TAATGGTTTT 7680 

GGTAACGCTA GCGACCATGT TCGTAAAAAT AAGGTAGACC AAGACAGTAA ACCTGATGAA 7740 

GATAAGGAAC ATGATGAAGT AAGTGAGCCA ACTCACCCTG AATCTGATGA AAAAGAGAAT 7800 

CACGCTGGTT TAAATCCTTC AGCAGATAAT CTTTATAAAC CAAGCACTGA TACGGAAGAG 7860 

ACAGAGGAAG AAGCTGAAGA TACCACAGAT GAGGCTGAAA TTCCTCAAGT AGAGAATTCT 7920 

GTTATTAACG CTAAGATAGC AGATGCGGAG GCCTTGCTAG AAAAAGTAAC AGATCCTAGT 7980 

ATTAGACAAA ATGCTATGGA GACATTGACT GGTCTAAAAA GTAGTCTTCT TCTCGGAACG 8040 

AAAGATAATA ACACTATTTC AGCAGAAGTA GATAGTCTCT TGGCTTTGTT AAAAGAAAGT 8100 

CAACCGGCTC CTATACAGTA GTAAAATGAA TGGAGCATAT TTTATGGAGA AGTAACCTTT 8160 

CGTGTTACTT CTCTTTTTTA GAAAAACGTA ACAGA 8195 



(2) INFORMATION FOR SEQ ID NO: 95: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2004 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 95: 

TTTACTAAAA GGAAAAAAGA ACTGATTTCT CAGTCCTTCA TTAATCTTAT TCCACACTAA 60 

ATAGGTATGG GTAAACAGGT TGTTGACCTT GGTGAATCTC GACTTCAACG TCTTCGAATT 120 

CTTCTACGAT TTCTTGAGCG ATTTCATTGG CAAGTTCTTC GCTTCCGTCT TCACCTACAT 180 

AGAAGGTTAC GATTTCACTG TCTTCATCCA ACATATGTTT CAAGGTTTCA GTCAATGTrT 240 

GGTGCATATC AGGGTTTGAC ACAAGAATTT TTCCATCCAC CAT AC CT AAA TTATCGTTTT 300 

CATGGATTTC TAAGCCATCG ATCGTTGTAT CACGCACGGC TGTTGTGACG CTTCCGCTAA 360 

CGACATCGCT AAGAGCAGCT GTCATACGCT CTTGGTTTTC TTCAATGGAC TTGCTTGGAT 420 

CAAAGGCAAG AAGACTTGTC ATACCTTGAG GAAGAGTGCG AGCCTCTACC ACTACCGCTG 480 

GTTGCTCCAA AACTTCTGCC GCAGATTGAG CTGCCATGAA GATGTTCTTG TTGTTTGGCA 540 

AGAAGATGAT GTTACGGGCA TTAACCTGTT CAACAGCCTT GATAAAGTCT TCTGTTGAAG 600 

GGTTCATGGT TTGACCGCCT TCGATAACAT AATCCACGCC TTGAGAACAG AAGATATCTG 660 



WO 98/18931 



PCT/US97/19588 



733 

CTAGACCTTT ACCAGCCACC ACAGCAATCA AAGCATACTC TTTTTCTTCA GCCGACTTGA 720 

TAACTTGAGT AGCTTCTTTC TCAACCTGTG CTTCGTGTTG GTTACGCATA TTGTCAACTT 780 

TTACCTTGAC CAAGCTACCA TATTTGAGAC CTTCTTGCAT AACAAGTCCT GGATCTTCTG 840 

TATGAACATG GACTTTGACA ATTTCATCAT CGTTAACAAC AAGGAGAGAA TCTCCAAGCT 900 

CATCCAAGTA GTTACGGAAT TCATCGTAGT CAAAATCTTT AGCATAGGTT GGACCTTGCT 960 

TAAGAGCTAC CATGATTTCA GTACAGTAAC CAAACGTGAT GTCCTCAGTC GCTACGTGAC 1020 

CAGCTACAGA CTTATGATGC TCTACATTGA TCATCTCACT CATGTTGGCA GGAGTCGCTA 1080 

CAAAGTCCTC AGATGCAATA TATTCGCCAG TAAGGGCTGA AAGGAAACCT TCGTAGATGA 1140 

AGACCAATCC TTGACCACCT GAGTCCACAA CGCCAACTTC TTTCAATACT GGAAGCATGT 1200 

CTGGTGTTTT AGCTAGAGCT GTTTTAGCAC CTTCCAAGGC TGCGCGCATG ACTTCAACAG 1260 

CGTCATCTGT TTGCTCAGCT TTTTTCTTAG CACCGATAGC AGCTCCACGA GAAACTGTTA 1320 

AAATCGTTCC TTCAACAGGT TTCATCACTG CCTTATAGGC AACTTCCACA CCTGATTGGA 1380 

AGGCCAGAGC CAAGTCTTGA CCTGTTAACT CGTCTTTATC CTTGATAGCT TGGGAAAATC 1440 

CACGGAAAAG CTGAGACGTA ATCACTCCTG AGTTCCCACG CGCACCCATC AAAAGCCCTT 1500 

TGGCAAGAAT GCTCGCTACT TCTCCAACTG TAGAAGCTGG CTTGTCTGCA ACTTCTTTAG 1560 

CACCATTTTC AATGGTCATT CCCATATTTG TCCCAGTATC TCCATCTGGA ACTGGAAAGA 1620 

CGTTTAATGA ATTGACATAT TCAGCTTGCT TATTCAAGCG AGTTGATGCA GCCTGCACCA 1680 

TTTCTTGAAA TAAGCTAGTA GTAATTTTTG ACACGGTTAT TCTCCTACAA CTTTGATATT 1740 

TTGAATGTAG ACATTTACAG TCTGAGCAGT AATTCCAAGC TGGTTTTCCA AGCTAAAGGC 1800 

AACACGCTCT TGAATGTTTT TTGACACTTC ACTAATCTTT GTTCCGTAGC TTAACACGGT 1860 

ATATACATCA ACTGCAATAC TGCCATCTTC GGCTGCCTTT ACGACGACAC CTTTAGAATA 1920 

ATTTTCCTTA CCTAGCAGGG CTTGGAAATT ATCTTTGAGG GCATTTTTAC TAGCCATACC 1980 

GACCACACCA GAAATCTCAG TTGC 2004 
(2) INFORMATION FOR SEQ ID NO: 96: 

(i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11915 base pairs 

(B) TYPE: nucleic acid 
•(C) STRANDEDNESS: double 
(D) TOPOLOGY: linear 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 96: 
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CCGGGTTGGG 


CTGTTCGCCC 


ATTAAAGCGG 


CACCACAGCT 


GGGTTCAGAA 


CGTCGTGAGA 


60 


CAGTTCGGTC 


CCTATCCGTC 


GCGGGCGTAG 


GAAATTTGAG 


AGGATCTGCT 


CCTAGTACGA 


120 


GAGGACCAGA GTGGACTTAC 


CGCTGGTGTA 


CCAGTTGTCT 


TGCCAAAGGC 


ATCGCTGGGT 


180 


AGCTATGTAG 


GGAAGGGATA 


AACGCTGAAA 


GCATCTAAGT 


GTGAAACCCA 


CCTCAAGATG 


240 


AGATTTCCCA 


TGATTATATA 


TCAGTAAGAG 


CCCTGAGAGA 


TGATCAGGTA 


GATAGGTTAG 


300 


AAGTGGAAGT 


GTGGCGACAC 


ATGTAGCGGA 


CTAATACTAA 


TAGCTCGAGG 


ACTTATCCAA 


360 


AGTAACTG AG 


AATATGAAAG 


CGAACGGTTT 


TCTTAAATTG 


AATAGATATT 


CAATTTTGAG 


420 


TAGGTATTAC 


TCAGAGTTAA 


GTGACGATAG 


CCTAGGAGAT 


ACACCTGTAC 


CCATGCCGAA 


480 


CACAGAAGTT 


AAGCCCTAGA 


ACGCCGGAAG 


TAGTTGGGGG 


TTGCCCCCTG 


TGAGATAGGG 


540 


AAGTCGCTTA 


GCTCTAGGGA 


GTTTAGCTCA 


GCTGGGAGAG 


CATCTGCCTT 


ACAAGCAGAG 


600 


GGTCAGCGGT 


TCGATCCCGT 


TAACTCCCAT 


TTTAGCGGGT 


GTAGTTTAGT 


GGTAAAACTA 


660 


CAGCCTTCCA 


AGCTGTTGTC 


GCGAGTTCGA 


TTCTCGTCAC 


CCGCTTTGAA 


CTTTGTTCTT 


720 


TGTACCAAGT 


TTTTGACTTG 


GGCGCGTAGC 


TCAGGTGGTT 


AGAGCGCACG 


CCTGATAAGC 


780 


GTGAGGTCGG 


TGGTTCGAGT 


CCACTCGTGC 


CCATAGTGTT 


TAGTCCATTA 


CTAGGGGATT 


840 


GGAATATTAT 


CTGTTCACTA 


AGAGGACACG 


GGCTTGTTCC 


CGTATAAACT 


ATTTTGGAGG 


900 


ATTACCCAAG 


TCCGGCTGAA 


GGGAACGGTC 


TTGAAAACCG 


TCAGGCGTGT 


AAAAGCGTGC 


960 


GTGGGTTCGA 


ATCCCACATC 


CTCCTTTTAT 


ATTAACGCGG 


GATGGAGCAG 


CTCGGTAGCT 


1020 


CGTCGGGCTC 


ATAACCCGAA 


GGTCGTAGGT 


TCAAATCCTG 


CTCCCGCAAT 


AAGGCTCGGT 


1080 


AGCTCAGTTG 


GTAGAGCAAT 


GGATTGAAGC 


TCCATGTGTC 


GGCGGTTCGA 


TTCCGTCTCG 


1140 


CGCCATTTAT 


ATATTTTGGA 


AGGGTAGCGA 


AGAGGCTAAA 


CGCGGCGGAC 


TGTAAATCCG 


1200 


CTCCTTCGGG 


TTCGGGGGTT 


CGAATCCCTC 


CCCTTCCATT 


TTACGGGCAT 


AGTTTAAAGG 


1260 


TAGAACTAAG 


GTCTCCAAAA 


CCTTCAGTGT 


GGGTTCAATT 


CCTACTGCCC 


GTGTTAATAG 


1320 


AATTATGGCG 


GGTGTGGTGA 


AGTGGTTAAC 


ACACCAGATT 


GTGGCTCTGG 


CATGCGTGGG 


1380 


TTCGATCCCC 


ATCACTCGCC 


TATTTTATAT 


TGGGGTATAG 


CCAAGCGGTA 


AGGCAAGGGA 


1440 


CTTTGACTCC 


CTCATGCGTT 


GGTTCGAATC 


CAGCTACCCC 


AGTTACTATT 


TGCCGGCGTG 


1500 


GCGGAATTGG 


CAGACGCGCT 


GGACTCAAAA 


TCCAGTGTCC 


GCAAGGACGT 


GCCGGTTCGA 


1560 


CCCCGGCCGC 


CGGTATAGTA 


TAGTGTTAGG 


AACGTTGTTA 


TTCTTCGTTC 


CTTTTTTATA 


1620 


TTATTTTTGG 


TATAATTATA 


GTTATTCAAA 


TTTTATTTAG 


ATTAAGAAAG 


TGTAGGGGAG 


1680 


TATGTCTTGT TCTATCGATT 


TATTAAAACA 


TCGGTATTTG 


AAAAATATTA 


AAGAAAATCC 


1740 


TGAATTGTTT 


GTCGGAATTG 


AGTTGGAGTA 


TCCTGTTGCA 


AGTTTAGAAG 


GGGATGCTAC 


1800 
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AGATGTTGAA GTTATGAAGG ATCTATTTCA TTATTTAGTT TCTACTTTGG 


ATCTCACCGT 


1860 


AGCAAAGGTA 


GATGATTTTG 


GCAATCTGAT 


P P APTT Af! T A 


\m\ X X 


GTCAGGATGC 


1920 


TATTTTATTT 


GAAGTTTCCT 


ATACAACGAT 


TGACiTTTfJPA 


1 w I M IW2 r P A A/T/'i 
1 1 IwlAnlib 


CTGAAACGAT 


1980 


TCAAGAGGTC 


GAAAATCGTT 


TPAATAATTA 




Ai ILAuAuAA 


AGTTAGCTGA 


2040 


ATCAAATCAT 


GP/TATTGTTf; 




pp 11VPPBSP 


UPPPRT A A A A 

IajOUA I AAAA 


ATGAGAATTG 


2100 


TCCAGTGGCT 


TATPPAPfiPT 


ATPAflATV^TT 


O B 'VCC a rpT & T> 
uAloliAi 1A1 


11 L»AATTTGA 


GTAGAAATAT 


2160 


TATTAAATPA 


A mmrp a /"» A T>/-» 


ATTTCCCTGA 


ATATGGTACT 


TTTATCTGTG 


GGAGCCAGGT 


2220 


TCAGfIV"W5A m 
* X uun X 


Al 1 1 V-AAAAA 


CCAACTACTT 


ACGGGTGATT 


AATGCTTTTA 


CTCAAATTGA 


2280 




vatrl 1ATTTAT 


TTGCAAACTC 


TGAATTTTCG 


GGTGCGGATT 


GGGATACGAA 


2340 


A ATTTV* a a/v 


UATATTTTCT 


GGGAAGAATC 


TATGCATGGT 


ATCTATCCAG 


AGAATGTTGG 


2400 




AGACTCCTTA 


ATGATGAAAC 


TGATTTTTTT 


GACTATCTAA 


ATCATTCTGC 


2460 


wftl 111 1 


F^rrr* a a #vnv 
t»t-VA»AACGTG 


ATGGGCAGAC 


CTATTATTTT 


TATCCTATTC 


AGGCTGGGGA 


2520 




ALGTCv-GAAA 


TCCAAGCATT 


TGCTCTGAAT 


GGGGATGAGG 


TTATTATTTA 


2580 




A & /TV"* A l|lMUmH/"> 

nnuun 1 1 TlVj 


AAACTCATCG 


TAGTTACCAG 


TACCAAGATT 


TAACGACTCG 


2640 


AGGAAPAfiTT 


GAGTTTCGTA 


GTGTGTGTAC 


ACAGCCACTT 


GATAGGACTT 


TTGCTTCTGC 


2700 


AGCTTTTPAP 

VJ V- X X X X VnL 




TGGTTAATTT 


AGACAAGTTA 


GAAGCTTACT 


TAGAAACAGC 


2760 


ACCTTTCTTT 


AAAGTATTTG 


GTTATGATTA 


CAAGTCTTTA 


AGGAGACAAT 


TTTCTAAGAA 


2820 


AAATCTTACA 


GATGAGGAAG 


AAACTACGAT 


TATTGAATTT 


TCCAAAGACT 


TACTCCTACT 


2880 


AGCTGAGGAG 


GGACTAGTGG 


TGAGAAATAA 


GGAAGAAATG 


ACCTATTTAC 


AGCCTTTGAG 


2940 


AGAAGAATTG 


AGCCTATAAT 


TTCTCTTATA 


AAGGGAGAAT 


TTTCTGAAAA 


ATCATGATAT 


3000 


AATGGACGAG 


ACTATAGATA 


AAGGATAGAG 


AGTAATGACA 


TTAGTTTATC 


AATCAACGCG 


3060 


TGATGCCAAC 


AATACAGTAA 


CTGCCAGCCA AGCAATTTTG 


CAAGGTTTGG 


CGACGGACGG 


3120 


CGGTTTGTTT 


ACACCGGATA 


CTTATCCAAA 


GGTAGATTTG AACTTTGACA 


AATTGAAAGA 


3180 


TGCTTCTTAC 


CAGGAAGTTG 


CTAAGCTAGT 


TTTGTCAGCA 


TTTTTAGATG 


ACTTTACAGT 


3240 


TGAGGAGTTG GACTACTGTA TCAACAATGC CTACGATAGC AAATTTGATA CTCCAGCTAT 


3300 


TGCACCATTA 


GTGAAATTAG 


ATGGGCAATA 


CAATTTGGAA 


CTTTTCCATG 


GTTCAACGAT 


3360 


TGCCTTTAAG 


GATATGGCCT 


TGTCTATTTT 


GCCATACTTT 


ATGACGACTG 


CTGCTAAGAA 


3420 


ACATGGTTTG 


GAGAACAAGA 


TTGTTATCTT 


GACAGCGACA 


TCTGGTGACA 


CGGGGAAAGC 


3480 


TGCTATGGCG 


GGGTTTGCGA 


ATGTGCCTGG 


TACTGAGATT 


ATCGTCTTTT 


ATCCAAAGGA 


3540 
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TGGTGTCAGC 
TATTGCTATT 
CGTGGCTCTT 
GAACATTGGT 
TAAGACTGGT 
TGGAAATATC 
CTGTGCTTCA 
AAAACGTGAG 
GGAGCGCTTG 
TGCCTTGAAC 
CTTTGCAGCT 
GTTAGATTCT 
CCAATCGGCC 
GTTCCCAGTA 
CTTGGCTCAA 
AATAGCTCCA 
GGCTTATTTA 
TTTTCATCAG 
AGCTATGGGT 
TCATTTAGGA 
TCAGGCGATT 
GCAGAAAGAC 
ATTAAGTTTC 
GGGGACGGAG 
GATTGTTCTC 
ACGTCTGCCT 
AGCTATTTTT 
TTTGGTTGGT 
TGGTTTAGAT 
GAGGGCTGGA 



AAGATTCAAG 
GATGGTAACT 
CGTGAAAAAT 
CGTCTGGTGC 
GAAATTGTAG 
TTGGCTGCCT 
AATGACAACA 
TTTAAGGTAA 
ATTTTCCATC 
ACGCAAGGAC 
GAATATGCGA 
TATATCGAGG 
ACTGGAGATG 
GTTGCAGTAG 
TTACATGAAA 
ATTCGTCACA 
GGACTTTAAG 
GAGGAGAGAT 
GAAAACTTTT 
TTGATAGCTA 
TTCATCGCTC 
CAGTCGAAGT 
CTTTTAGGAT 
AGGGATGTAG 
TTAGGTTTAA 
CTCTATGTTA 
GTTCTGGATA 
CTTGTGATTT 
AAGGAACTGT 
GATGTAGTGA 



736 

AGTTACAAAT GACCACTCAG 



TTGACGATGC GCAAACAAAT 
TGACTACCAA CAAGTTGCAA 
CACAAATTGT TTATTATGTT 
CTGGTGAAAA GGTTAACTTC 
TTTATGCCAA ACAAATTGGT 
ATGTTTTGAC AGACTTCTTT 
CAACCAGCCC ATCTATGGAT 
TTTTGGGAAA TAATGCTGAA 
AATATAAGTT GACAGACTTT 
CTGAGGAAGA AACGGCAGCA 
ACCCTCATAC AGCTGTTGCT 
TAACTAAGAC AGTGATTGCT 
AAGCTGTAAC TGGAAAAGCA 
TCTCAGGCGT TGCAGTGCCA 
AGACAACAGT GGCAGCTGCT 
ACAGAGGGAG CAAACTCGGT 
TGTTTAAGAA AAATAAAGAC 
TGCAGATGCT AATGGGAATG 
TTTCAGGGGT TTCAGTAGCT 
TGGGAGCTGC TATTTCCAGT 
TGGCCTATCA TGTGACTGAG 
TTTTGTCCAT CTTCGCTGGG 
CTGAGAGTGG TGGACTGTAT 
TGACTAGTCT AGGAGCCTTG 
GTTTTTTATC CAATGCCTTG 
TGGGGATAGC TGGTGTTGCT 
TGTGGTCACA ATTAAAACTG 
TGACCTTGGC TTTACCAGCA 
TCATTGCCTT GGTCGTTTCT 



ACTGGCGACA 
GTGAAGCACA 
TTTTCATCAG 
TATGCTTACG 
ACAGTACCAA 
TTGCCAGTTG 
AAAACACGTG 
ATCTTGGTAT 
AAGACAACTG 
GATGCAGAGA 
GAGATCAAGC 
TCAGCAGTTT 
TCAACAGCTA 
GGTTTAACAG 
CCAGCAGTTG 
GACATGCAAG 
TGGGAAACCA 
ATTCTTAATA 
GTGGACAGTT 
GGTAATATTA 
GTTATTTCAA 
GCGTTGAAGA 
AAAGAGATGA 
CTATCTTTGG 
ATTCGTGCAA 
AATATTCTTT 
TGGGGGACAA 
CCTTATGGGA 
GCTGGAGAGC 
TTTGGGACGG 



ATACTCATGT 
TGTTTAACGA 
CTAACTCTAT 
CTCAATTGGT 
CAGGAAACTT 
GTAAATTAAT 
TCTATGACAA 
CTTCAAACTT 
AACTTATGAA 
TTTTGGACCT 
GTGTTTGTGA 
ATAAAAAATA 
GTCCATACAA 
ACTTTGAAGC 
ATGGGCTTGA 
CAGCGGTTGA 
ACTGAGTTTC 
TTGCATTGCC 
ATTTGGTTGC 
TCACCATTTA 
AAAGCATAGG 
TTACCTTACT 
TAGGACTT'iT 
TAGGCGGATC 
CGCATAATCC 
TTTCAAGTCT 
TTGTGTCTCG 
AGCCAACTTT 
GACTTATGAT 
AGGCAGTTGC 



3600 

3660 

3720 

3780 

3840 

3900 

3960 

4020 

4080 

4140 

4200 

4260 

4320 

4380 

4440 

4500 

4560 

4620 

4680 

4740 

4800 

4860 

4920 

4980 

5040 

5100 

5160 

5220 

5280 

5340 
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TGGGAATGCA 


ATCGGAGAAG 


TCTTGACCCA 


GTTTAACTAT 


ATGCCTGCCT 


TTGGCGTCGC 


5400 


TACGGCAACG 


GTCATGCTGT 


TGGCCCGAGC 


AGTTGGAGAG 


GATGATTGGA 


AAAGAGTTGC 


5460 


TAGTTTGAGT 


AAACAAACCT 


TTTGGCTTTC 


TCTGTTCCTC 


ATGTTGCCCC 


TGTCCTTTAG 


5520 


TATATATGTC 


TTGGGTGTAC 


CATTAACTCA 


TCTCTATACG 


ACTGATTCTC 


TAGCGGTGGA 


5580 


GGCTAGTGTT 


CTAGTGACAC 


TGTTTTCACT 


ACTTGGGACC 


CCTATGACGA 


CAGGAACAGT 


5640 


CATCTATACG 


GCAGTCTGGC 


AGGGATTAGG 


AAATGCACGC 


CTCCCTTTTT 


ATGCGACAAG 


5700 


TATAGGAATG 


TGGTGTATCC 


GCATTGGGAC 


AGGATATCTG 


ATGGGGATTG 


TGCTTGGTTG 


5760 


GGGCTTGCCT 


GGTATTTGGG 


CAGGGTCTCT 


CTTGGATAAT 


GGTTTTCGCT 


GGTTATTTCT 


5820 


ACGCTATCGT 


TACCAGCGCT 


ATATGAGCTT 


GAAAGGATAG 


GAAATGCAAA 


AAACAGCTTT 


5880 


TATTTGGGAT 


TTAGACGGGA 


CTTTATTGGA 


CTCTTACGAA 


GCGATTTTAT 


CAGGGATTGA 


5940 


GGAGACTTTT 


GCTCAGTTTT 


CTATTCCTTA 


TGATAAGGAG 


AAGGTGAGAG 


AGTTTATCTT 


6000 


CAAGTATTCG 


GTGCAAGATT 


TGCTTGTGCG 


GGTGGCAGAA 


GATAGAAATC 


TGGATGTTGA 


6060 


GGTGCTAAAT 


CAGGTGCGTG 


CCCAGAGTCT 


GGCTGAGAAG 


AATGCTCAGG 


TAGTTTTGAT 


6120 


, GCCAGGTGCG CGTGAGGTGC TAGCTTGGGC AGACGAATCA 


GGAATTCAGC 


AGTTTATATA 


6180 


TACTCATAAG 


GGGAACAACG 


CTTTTACCAT 


TCTCAAGGAC 


TTGGGGGTGG 


AATCCTATTT 


6240 


TACAGAGATT 


TTAACCAGTC 


AGAGTGGCTT 


TGTGCGGAAG 


CCAAGTCCAG 


AAGCGGCTAC 


6300 


CTATCTGCTA 


GATAAGTATC 


AGTTGAATTC 


TGATAATACT 


TATTATATAG 


GGGATCGGAC 


6360 


TCTGGATGTG 


GAATTTGCCC 


AGAATAGTGG 


GATTCAAAGC 


ATCAACTTTT 


TAGAGTCTAC 


6420 


TTATGAAGGG 


AATCACAGGA 


TTCAAGCGTT 


AGCAGATATT 


TCCCGTATTT 


TTGAGACTAA 


6480 


GTGATAAAAA 


GATTGTGTCA GTTTTGTGAC 


AGAGACCTAA 


CAAACTATTT 


CAAGTAACCT 


6540 


AGTTTGTTAC 


AAGGAATAGA 


CAGTTCTGTT 


AAATAGGCCC 


GAGAGGGCTT 


TTTTTCTACA 


6600 


TTTTTTGTGT 


TATGATAGAC 


AGGTACTCAT 


TTGAAAGGAA 


TTTGAAAGAA 


TGAAGAAAAG 


6660 


AATGTTATTA 


GCGTCAACAG 


TAGCCTTGTC 


ATTTGCCCCA 


GTATTGGCAA 


CTCAAGCAGA 


6720 


AGAAGTTCTT 


TGGACTGCAC 


GTAGTGTTGA 


GCAAATCCAA 


AACGATTTGA 


CTAAAACGGA 


6780 


CAACAAAACA 


AGTTATACCG 


TACAGTATGG 


TGATACTTTG 


AGCACCATTG 


CAGAAGCCTT 


6840 


GGGTGTAGAT 


GTCACAGTGC 


TTGCGAATCT 


GAACAAAATC 


ACTAATATGG 


ACTTGATTTT 


6900 


CCCAGAAACT 


GTTTTGACAA 


CGACTGTCAA 


TGAAGCAGAA 


GAAGTAACAG 


AAGTTGAAAT 


6960 


CCAAACACCT 


CAAGCAGACT 


CTAGTGAAGA 


AGTGACAACT 


GCGACAGCAG 


ATTTGACCAC 


7020 


TAATCAAGTG 


ACCGTTGATG 


ATCAAACTGT 


TCAGGTTGCA 


GACCTTTCTC 


AACCAATTGC 


7080 
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AGAAGTTACA AAGACAGTGA TTGCTTCTGA AGAAGTGGCA CCATCTACGG GCACTTCTGT 7140 

CCCAGAGGAG CAAACGACCG AAACAACTCG CCCAGTTGAA GAAGCAACTC CTCAGGAAAC 7200 

GACTCCAGCT GAGAAGCAGG AAACACAAGC AAGCCCTCAA GCTGCATCAG CAGTGGAAGT 7260 

AACTACAACA AGTTCAGAAG CAAAAGAAGT AGCATCATCA AATGGAGCTA CAGCAGCAGT 7320 

TTCTACTTAT CAACCAGAAG AGACGAAAAT AATTTCAACA ACTTACGAGG CTCCAGCTGC 7380 

GCCCGATTAT GCTGGACTTG CAGTAGCAAA ATCTGAAAAT GCAGGTCTTC AACCACAAAC 7440 

AGCTGCCTTT AAAGAAGAAA TTGCTAACTT GTTTGGCATT ACATCCTTTA GTGGTTATCG 7500 

TCCAGGAGAC AGTGGAGATC ACGGAAAAGG TTTGGCTATC GACTTTATGG TACCAGAACG 7560 

TTCAGAATTA GGGGATAAGA TTGCGGAATA TGCTATTCAA AATATGGCCA GCCGTGGCAT 7620 

TAGTTACATC ATCTGGAAAC AACGTTTCTA TGCTCCATTC GATAGCAAAT ATGGGCCAGC 7680 

TAACACTTGG AACCCAATGC CAGACCGTGG TAGTGTGACA GAAAATCACT ATGATCACGT 7740 

TCACGTTTCA ATGAATGGAT AAACCCGACT TGATAACATC ATTTTGACGA ATGAGATCTA 7800 

GCTTTCGTGA TGGAAAGCGA TTCTCGTTCG TTTTTTCTTT GTCATACTCT TCGAAAATCT 7860 

CTTCAAACCA CGTCAGTTTT ATCTGAAACT TCAAAGCTGT GCTTTGAGCA ACCTGCGACT 7920 

AGCTTCCTAG TTTGCTTTTT GATTTTCATT GAGTATCAAT TTGAATGGAA AATGGAAAGT 7980 

TATCATCTTG TAATGAGTTA AGCAACATTC TTGCAATCTA TTTTACTTTA TATCACAATT 8040 

AATTAGTCAA ATATTGATAA ATCAATAAAA AGAGAGGGGA AGAAATGCTA GAGATTCAAG 8100 

ATTTACTGTA TCAACTCCGC TTGTCTGAGC AAGCGAGTAC GCAATTGTTT GAAAAAAGGC 8160 

TTGGGATTAG TTTGACACGG TATCAGATTT TACTGTTTTT GCTGGAGCAT TCTCCTTGTA 8220 

ACCAAATGGC GGTTCAGGAG CGTTTGAAAA TTGATCAGGC TGCTTTGACA CGGCATTTCA 8280 

AAATTTTGGA AACGGAAGGT TTGGTGGAGC GTCATCGTAA TCCTGAAAAT CAGCGGGAAG 8340 

TGTTGGTAGA GGCTGCGAAG TATGCCAAGG AGCAGTTAGT GGTGAATCCC CCTCTGCAAC 8400 

ATATCAGGGT TAAGGAAGAG ATAGAAAGTA TCTTAACAGA GTTTGAGAGA ACAGAACTCA 8460 

GCCGTTTATT AAATAAATTG GTTTTGGGTA TTGAAAATAT AGAAATTTAA GGAGAAATAG 8520 

ATGTCAATTA TTTTAACAAC GATCGTTGCT TTGGAGCATT TTTACATTTT TTATTTGGAA 8580 

AGTATTGCCA CGCAATCAGA TGCGACTAGT CGTGTATTTA ATATGGAAAA GGAAGAATTG 8640 

GCTCATCCGT CAGTAAGTTC ATTGTTCAAA AATCAAGGAA TTTATAAGGC TCTGCTAGGA 8700 

GTCTTTCTCT TGTATGTCAT TTATTTCTCA CAGAATTTAG AAATTGTGAC TATTTTTGTC 8760 

TTATTTGTGA TTGGTGCTGC GACTTACGGC TCTTTAACAG CGGATAAAAA AATTATTTTG 8820 

AAACAAGGTG GATCAGCTAT TTTGGCCTTG ATTAGTATTT TACTCTTTAA ATACACTTGA 8880 
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AGGTCGATTC TAATCTCGCT AATCCTTTTT AATCCAGAAT AAGGGAAATA TGTTATACTT 8940 

GTTTTTAAGA AAAAAGTCTC ATTGAATTGG TTTTGAGGAG TTAGAAATGA AAGTATTAGT 9000 

GACAGGTTTT GAGCCCTTTG GAGGGGAAAA GGGCAATCCA GCTTTGGAGG CCATTAAAGG 9060 

TTTACCAGCT GAAATCCATG GTGCTGAGGT CCGTTGGCTA GAGGTGCCGA CAGTTTTTCA 9120 

CAAATCTGCT CAAGTATTGG AAGAAGAGAT GAATCGTTAT CAACCTGACT TTGTCCTTTG 9180 

.TATTGGGCAA GCTGGTGGAA GAACTAGTTT GACACCTGAA CGAGTGACCA TTAATCAAGA 9240 

CGATGCATGC ATTTCTGATA ACGAAGATAA TCAACCGATT GACCGTCCCA TTCGCCCAGA 9300 

TGGTGCTTCG GCCTACTTTA GTAGTTTGCC GATTAAAGCG ATGGTTCAAG CTATAAAAAA 9360 

AGAGGGCTTA CCGGCCTCTG TTTCCAATAC GGCAGGGACT TTTGTCTGCA GCCATTTGAT 9420 

GTATCAGGCT CTCTATTTGG TAGAAAAGAA ATCTCCATAT GTTAAGGCAG GTTTTATGCA 9480 

TATTCCTTAT ATGATGGAAC AGGTGGTGAA CAGACCGACT ACTCCAGCTA TGAGTTTAGT 9540 

GGATATTCGG CGAGGGATAG AAGCAGCAAT CGGCGCTATA ATAGAACATG GAGATCAGGA 9600 

ACTCAAGTTG GTAGGCGGAG AAACTCATTG ATAGAAAAAA GCTTGAGGGG AAAAACCTTC 9660 

AAGCTTTTGG ACGTTTTCGG GCCAATACTG CTCGGTAAAA CATAATTTTA GTGCATTGGA 9720 

TATAAGGTAG GAGTGAAAAA CTAGCAATGC CAAAGGTAAT CCAATTGAGG AAGTACCAAG 9780 

GAAGAAGCTG TAAATCTAGG ACAAAGTGCT GGAACTTGTA GCCCTTCATA AAGGAACGGC 9840 

TAGTTTTTAG GATTCGTCTT GGTGGGACCT GTCCTAGGTC TAGACTATAA CAGAGAAGAA 9900 

ATTCCACCTG TGAATAGGCA TAATACTGTG GAATATAGAG GATATTTCCT ACAATGATCA 9960 

AGATGAGACT TGCAAGAAAG TAGAGTCCAA AGACCATGAG GAAACGCTCG GTTTCAACTG 10020 

ATGAGAGATC TAGATTTGGA AACTCAGGAT GTAGGGTGAC GAATTTTTTG GCTAAAAAGC 10080 

TACTATAAAA GAGGAGGTAA ATCCCAAGTA AATTAGGGAT ACTCCATAAA AAGAGATAGA 10140 

AACGTTTGAG AAGTAGGGTC AAAAAGGTTT GAGAAAAGCG CTCCTCATCA AAGAGAGCTA 10200 

GGCTGTTTTT TACAGATGGC TCCGTTTTAG AATCTTTCAT GAGTGTCAGT GTTGCATAGA 10260 

CGGAACTGGT CAAAAGAATA GTCCCGATAA AGGAGACTAG TAGAGGAAAG AGGTAGGTTT 10320 

GAAGTATTTG GCCAAGTATG CTGAAAAATG GCTGTTCTAA AACAGTCCCG TGGATCCGAG 10380 

ATAAGGGATT AAGAAAACCA GATAAGATGA CCAGCATACT GGGAAGGATA TAGAGGAGAA 10440 

AGAGACGGGG GGTGTCAGCC TGAAAATGTT TTGACTCCTG ACGAATTGTT TTTAAATCAA 10500 

TTTTTGGATA GTTCATTCTC TTATTATACC ATAGTTCTTA TACATAGTTC GTGACAGTTC 10560 

CTACTTTTTT TGATAAAATC ATACAGTGTG TCCTTGGGCA CACTGTATGA ACTGGGACTG 10620 
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TCTTTCCCAG CTTCGGAGGT AAAAAATGTC AGATTCACCA ATCAAATATC GTTTGATTAA 10680 

GAAAGAAAAA CACACAGGAG CTCGTCTGGG AGAAATCATC ACTCCCCACG GTACCTTTCC 10740 

GACACCTATG TTTATGCCAG TTGGGACACA AGCCACTGTC AAAACTCAGT CACCTGAAGA 10800 

ATTGAAGGAG ATGGGTTCGG GAATTATCCT ATCAAACACC TATCATCTCT GGCTTCGCCC 10860 

TGGAGATGAA CTCATTGCAC GCGCTGGTGG TCTCCACAAG TTCATGAATT GGGACCAGCC 10920 

TATCTTGACA GATAGTGGTG GTTTTCAGGT TTATTCTTTA GCAGATAGCC GTAATATCAC 10980 

AGAAGAAGGA GTAACCTTTA AAAATCATCT AAATGGTTCT AAGATGTTCC TATCCCCAGA 11040 

AAAAGCCATC TCTATTCAGA ATAATCTGGG TTCAGACATC ATGATGTCCT TTGATGAATG 11100 

TCCTCAGTTT TATCAACCTT ATGACTACGT TAAGAAATCG ATCGAGCGTA CCAGCCGTTG 11160 

GGCTGAGCGT GGTTTGAAGG CTCACCGTCG TCCACATGAC CAAGGTTTGT TTGGAATTGT 11220 

GCAAGGTGCA GGATTTGAAG ACCTTCGCCG CCAATCAGCT CATGATCTTG TCAGCATGGA 11280 

TTTCTCAGGC TACTCTATCG GTGGTTTGGC AGTGGGAGAA ACCCATGAAG AGATGAATGC 11340 

GGTCTTGGAC TTTACAACTC AACTGCTGCC TGAAAATAAA CCTCGTTATC TGATGGGTGT 11400 

GGGAGCGCCA GATAGCTTGA TCGATGGGGT CATTCGTGGG GTGGATATGT TTGACTGTGT 11460 

CTTACCGACT CGAATTGCTC GTAACGGGAC TTGTATGACC AGTCAAGGAC GTTTGGTTGT 11520 

GAAAAATGCC CAGTTTGCTG AGGACTTTAC GCCACTGGAT CCTGAGTGTG ATTGCTACAC 11580 

ATGTAATAAC TATACACGCG CTTACCTTCG TCACCTGCTC AAGGCTGATG AAACCTTTGG 11640 

TATCCGCTTG ACTAGCTACC ACAATCTTTA CTTCTTGCTT AACCTGATGA AGCAAGTGCG 11700 

ACAAGCCATC ATGGATGACA ATCTCTTGGA ATTCCGTGAG TATTTTGTGG AAAAATATGG 11760 

CTATAATAAG TCAGGACGTA ATTTCTAAAA TGGAATTGAT ATAAAAAAAT CCTAAGTTTT 11820 

CTCTTAGGAT TTTTCTTCTT TTTTTGATAG AATAAAGTGT ACAATGAAAG GAAGAATAAA 11880 

CTCGTATGCG CATTAAATGG TTTTCCTCGA TTAGG 11915 
(2) INFORMATION FOR SEQ ID NO: 97: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9069 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 97: 

GAGAGGGCAA CAGTTCTATC GCTTCAAATT TTTTCTTGGT TTGCAGATAT TCAAGAATCG 60 

GGAGTTTTTC TATAGTATTC GGCAGATTTA TTACAGCCAA GCATCTCAAA AATACGGACA 120 



WO 98/18931 



PCT/US97/19588 



741 



GCATCCTCCA TCTTTTTCTG CCCTTCCTTG ACTCTACCTT GCTTGCTATC AAGGAGACCT 
TCTGCCCACA GATAAACAAT TCGGAAATAG GTCTCATTTT CCTTGTAGAA ATGCTCTTCG 
ATAACACGTT TAAAATAATA GGCATTGGTA AATTCTTCAC ACTCAATACT AGCTAAAAAG 
CCATTCAATA GTATAGTATG AAAAAGGTTT CGATTGCCAG ACATTTCCAT TAGAAAATCA 
GATTTACGTA CCATTTCTCG TACATATCTA GTAAAAAGAG AAACAGATAA AAATGGAGAA 
CTGACTGAAA ATAAATTGAG TTCATAGATT CCCCAGATCT CGGTAGAAAA CAAATAATCA 
TGAAGGACTT TTCCTTCCTC TGCTGTTAAG TCTACCCTTT CATCTATGCT CTTCATATAA 
GACTTGATAA TAATGGCATT TAGAATATGT TTCTGTTTGT TGTGAGAATG GGCATGCTTT 
TATACTCCCT GCGATATAAG TCCTCAAGAG GTGCTATATT CTTTGGTTCC AAGACATCTG 
TAATTTCTTT TCTCAACTCA GAATCTGTAT CATACTGGAA ACCTCTTGCC AGAAAGAGGA 
TCTCCTCCAC ACTGGCAGAT ATATTTTCCA GAGCAAATAG AAACTTTTCC ACCGAAAGCT 
CACTCTGACC TGTTTCAAAA CGGGACAACA TAGACGGCGA AAATTGTCCT CCGGTTGCTT 
GTCTCAGTGA GATATTTCTT GACTCTCGTA ATTGTCTAAA GACTTTTCCA ATCTGCTCCA 
TAGACTTCCC C^TTCCC TA^TC^C ATTTTATCAT ATTTTTCAGA AAATTCATCA 
AAAACTTGCC AAATTGTCAG AATTATGAGA AAATAGAGGA TATTTATCAC GTGGAGGGAC 
TGCTATGAGA GACGATATCA AAATCAATGA CCGTGCTTTG GCCTTGCAAG ACCAAATTAT 
CGAAAAACTA GAGAAAGTTT TTGATACAGA TGTGGAATTG GATGTTTACA ATCTAGGTCT 
GATTTATGAA ATCAATCTGG ATGAAACGGG GCTCTGCAAG ATTGTCATGA CCTTCACCGA 
TACTGCCTGT GATTGCGCCG AAAGCCTGCC TATTGAAATC GTGGCAGGTC TGAAACAAAT 
CGAGGGTATC AAAGATATCA AGGTTGAAGT TACCTGGTCG CCTGCTTGGA AAATCACACG 
AATCAGTCGC TATGGCCGTA TTGCCCTTGG ACTACCACCT CGTTAAGCAG ACCAATCACT 
TTTAAAGATG AAAATCAAAG GGCAAACTAG AAAACTAGCC GCAGGTTGCT CAAAACACTG 
TTTTGAAGTT ATGGATAGAA CTGACGAAGT CAGCTCAAAA CACTG™ AGGTTGTGGA 
TAGAACTGAC GAAGTCAgCT CAAAACACTG TTTTGAGGTT GTGGATAGAA CTGACGAAGT 
CAGCCCAAAA CACTGTTTTG AGGTTGTGGA TAGAACTGAC GAAGTCAGTA ACCATACCTA 
CGGCAAGGCG ACGTTGACGT GATTTGAAGA GATTTTCGAG TATGAGTTTA TTTTTTACCT 
GACTTGTCCA TATTCCAGAA GTCTGTCACG GCTCCGCGTG AAGCAGATGA TACGATGTGG 
GCATATTTAC CGAGGACACC ACGGCTGTAA AGTGGTGGCA AGGTTGTTTC TGCCTTGCGT 
TTTTCAAGTT CTTCTTCGGA TACGGCCATA GAAATTTCTT TGGTATCTTG GTCAACCGTA 



180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 



WO 98/18931 



PCT/US97/19588 



742 

ACGATATCGC CGGTACGGAG ATAGGCAATT GGTCCACCAT CCTGAGCTTC AGGAGCGATA 1920 

TGTCCAACAA CCAGACCATA AGTACCACCA GAGAAACGTC CGTCCGTCAA GAGGGCCACC 1980 

TTATCTCCCT GACCTTTACC AACAATCATT GAAGAAAGTG ATAGCATCTC AGGCATACCA 2040 

GGACCACCTT TAGGTCCAAC AAAACGAACA ACGACTACAT CGCCATCAAC GATTTCATCT 2100 

GTCAGAACGG CCTGAATCGC ATCTTCTTCT GAGTCAAAGA CCTTAGCTGG CCCAACGTGA 2160 

CGACGCACTT TAACACCTGA TACCTTGGCA ACTGCACCGT CAGGAGCAAG GTTCCCGTTC 2220 

AAGATGATAA GCGGACCATC CGCACGTTTT GGATTTTCAA GTGGCATGAT AACTTTTTGG 2280 

CCTGGAGTCA AGTCTGCAAA GTCAGCCAAG TTTTCAGCTA CAGTCTTACC AGTACATGTG 2340 

ATGCGATCTC CGTGAAGGAA ACCATTTGCC AACAAATACT TCATAACCGC AGGGACACCA 2400 

CCGACTTCGT AGAGGTCTTG GAAGACATAC TGACCAGATG GTTTCAAGTC GGCCAAGTGA 2460 

GGCACACGTT CTTGAATCGT ATTGAAGTCC TCAAGTGACA AGTCAACATT TGCGGCATGG 2520 

GCAATGGCGA GCAAGTGAAG AGTGGCGTTT GTAGAACCAC CGAGAGCCAT CGTTACAGTG 2580 

ATAGCATCTT CAAAGGCTTC ACGAGTCAAG ATATCTGATG GTTTGAGACC AAGTTCCAAC 2640 

ATCTTAACAA CAGCACGTCC TGCTGCTTCG ATATCTTCTT TCTTATCAGC TGATTCAGCT 2700 

GGGTGAGAGG ATGACCCTGG CAAACTCATC CCTAGAACTT CGATAGCAGT TGCCATGGTA 2760 

TTAGCAGTAT ACATACCACC ACAACCACCA GGGCCAGGGC AGGCATTACA TTCAAGACGT 2820 

TTCACGTCCT CAGCTGTCAT GTCACCGTGG TTCCATTTTC CGATACCTTC AAAGACAGAA . 2880 

ACCAAGTCGA TATCTTTACC ATCAAGATTT CCCGGTGCAA TAGTTCCACC ATAGGCGAAA 2940 

ATAGCTGGGA TATCCATATT AGCAATAGCA ATCATAGATC CAGGCATGTT CTTGTCACAG 3000 

CCACCGATAG CGACGAAGGC ATCCACGTTG TGACCACTCA TAGCCGCCTC GATGGAGTCC 3060 

GCGATGATGT CACGAGATGT TAGAGAGAAA CGCATACCAG GCGTTCCCAT AGCGATCCCG 3120 

TCCGCTACGG TAATGGTTCC AAACTGTACA GGCCAAGCGC CTGCAGATTT GACACCTTCT 3180 

TTAGCCAGTT TCCCGAAATC ATGCAAGTGA ATGTTACATG GTGTATTTTC CGCCCAAGTC 3240 

GAAATCACTC CCACAATCGA TGTTTCAAAG TCCTTATCTG TCATACCAGT CGCACGAAGC 3300 

ATAGCACGGT TAGGTGATTT AACCATGCTG TCATAAATGC TACTGCGGTG ACGTTTATCT 3360 

AATTCAGTCA TCTTATCCCT CCCATTTCAG TTTTTACTAT TATAGCACAA TTTTCGCATG 3420 

AAGAACAGAA TAAAATTCTT GAATTTTCAG AAAATTCTAT ACACATGTGA AATATTTAAA 3480 

ATTAAAAACA ACAAAGCGGA TTAGTGCACT TTCTGATGAC CAGAATATGC TTTTTAATCC 3540 

GCTTTCTTTA AATAACGTAC TGTAATTTTT ACAGAAATTC TTTCAAATAA GTGTATTTAA 3600 

CATCTATCTT GCATTATAAA TTTCTAGAAC CTTCTCTTTT ATATTCGATT CACTCAAACC 3660 
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ATACTCATTA AGAAGATAAT CCATTTTCCC TACTTGACCG AATCTTTCTT GAACACCCAT 3720 

CCGATGAATT TTTGTTATTC CATCATCAGA GAATAATTCA CATAAAGCAC TGCCAATTCC 3780 

ACCTATCTGA TTGTGGTTTT CTACAGTAAA TATAGTTTTT CCACTTAACA TTGTTTTTAT 3840 

CTGTTCTGGT ATCGGTTTGA TTCTAAATAA ATCTATCACA CCTACTGAAT AACCTAATTT 3900 

AGACAGTTCA TCTGCAACTC GAATACTTGG AGCAACCATT ATGCCAGAAG CAACGATTAC 3960 

.AAGATCTTCA CCATGCCTTA ACTCAATGTA GCCTTTAGAA AAATCTTCTC CACCTTGATA 4020 

CACAGGAACT GGAGCTTTTC TAATTGTTCG AATATATTTT AGTCCTTTTA AGTCTAATGT 4080 

CTGGTTCAAT ATTTCACGAA ATTGGATATC ATCAGTTGCT TCGAAAATGA TTGATTTAGG 4140 

AATTAAACGT AACAATCCAA TTTCTTCAAA TGGCATATGT GTTCCACCAT TCATCTCTGC 4200 

CGTTACTCCT GCATCTGATC CAATCACAGT GGCATCCAAT TGTGCGTATC CAAGAGAAAT 4260 

AAATAATTGA TCAAATACTC TTCGTGAAGC AAAAGGACCA AATGTATGAA GATAAGGTCT 4320 

AAACCCCTGA ATAGACAAGC CTGCTGCAAG GCCGACCATT TCTGCTTCCA TAATCCCAAC 4380 

ATTCACATAA CGGTCTCCAA AGTCCTTTTC AAGATTATTA GTAGCCATCG AACTTGACAA 4440 

ATCGGCTTCT AAGACTACTA TATCAGAATC ACTTTGATTA GCCTCTAAAA GGAAGTCTCT 4500 

ATATACATGC CGTAATTCTT TCGTACTTCT CATCATTCTG TTTCCTCCAA TTCCTGACTT 4560 

AATCTTTCTA CAACTGAAGT TAACATTTGT TTCTCCTCTA CAGTAGGGCG AAGATGATGA 4620 

TTGGATTTCA TTTCTTCCAG CTCTTGAACC CCTTGACCTT TAATAGTATC TAATACAATG 4680 

CACTTAGGTG ATGAATTATT TGACTGTTTT AATTGGACAA TCCCTTCATA AATTTCTCTA 4740 

ATATCTGAAC CCTTGACCCT AATGGATTCA AATCCAAATG CTGAAAATTT TTCTACGAAA 4800 

TCACCTGGAT TACAAATATC CTTTGTAAAA CCATCTAATT GTTTTTTGTT ATCATCAACA 4860 

AATACAATTA AGTTGGATAA CTGTTGATGA GAAGCAAACT GTATAGCCTC CCAACATTGT 4920 

CCCTCATTTA ACTCACCATC TCCAACAATA GCGTAAGTAT AAAAGGGACT CTTTCTTATT 4980 

CTCTGACCAT ATGCAAGTCC AGTTGCAACA CTAATTCCTT GTCCTAAAGA GCCCGTTGTC 5040 

ATATCTATGC CTGGCGTTAG ATTTCTATCA GGATGAGACG GTAATTTGGT TCCATTTGTA 5100 

TTTAAAGAAT ATAAGAATTC TTTGTCAAAG AAACCATTCA AATAGAGTGT ACTGTATAGA 5160 

GCTGGTCCTC CGTGACCTTT TGATAATATG AAATAATCTC TATCTCGTGC TGCAAATATT 5220 

TCTGGAGTCA TTGGCATTAT TTCACCATAA AGCACCGCTA AAACTTCTAC GATAGACAGA 5280 

CTTCCTCCGT AATGTCCGAA TCCAAGATGA TTCAATGTTC TAAGAGTATT TAATCGGATG 5340 

TTAGTCGCAA ATTTTCTTAA CCCATCTTCT CTATTTTTAC TTAAAATCAT CCCTTATTCC 5400 
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TCCGTTGCAG ATGGCTTTTT AATAAAGGAT ACTCCAAACA TAACTGCTAG AATAAGAACA 54 60 

AGACCAATCA CAATGCCTGC TTGTGAGCCA AATTGATTTA ACATTCCTAA AATAATTCCT 5520 

GATAGACCAA AATCTGCATC TGAGAAAGTT GATCCTTGGA AACCAAGTCC TCCCAAAACT 5580 

GGCATTAAAA AGACTGGAAG AAAACTGATT AAAATACCTT GTAAAAATGC TCCAATAGTG 5640 

GCTCCACGAA CACCACCAGA TGCATTCCCA ATGACACCTG CAGTCGCTCC ACAGAAGAAA 5700 

TGAGGCACAA CACCTGGTAA GATAACAACC GTTCCTGAAG CAATCATAAT TACCATACTT 5760 

ACTAAACCAC CAACAAAACT AGAGATAAAT CCAATTAGAA CTGCATTGGG TGCATAAGTA 5820 

TAAACAATCG GACAATCCAA AGCAGGTTTT GAATTAGGTA CAAGACGCTC TGAAATACCT 5880 

TTAAAGGCTG GAACAATTTC GCCCAAAATA AGGCGAACAC CTGCTAAAAT AACAAATACC 5940 

CCTGCTGCAA ATTGACCTGC TAATTGTAAA GCATAAACTA GACCACTTGT ACCACTACTG 6000 

ATTTCTTTTT CTATATATTC TGACCCTGCA AAGATAGCTA CAATAATGTA AATAACTGCC 6060 

ATGGATAAAG TAATACTAAC AGTACTATCA CGTAAAAAAG CTAAACTCTT TGGAAATTTA 6120 

ATGTCCTCTG TTGATTTTGA TTTGTCACCG ATAAGGCTAC CAGTAAAACC ACTCAACCAA 6180 

TATCCCAAAG AACTGAAATG ACCTAAAGCT ACCTTGTCAT TTCCAGTTAA TTGAACCATA 6240 

TATTTTTGCA CAAATGCTGG GGAAATACTC ATAATAATAC CGAGTGCTAA TCCTCCTAGT 6300 

AAGATGAGAG GCAAGCTAGT AAAGCCAGCA ACTGATAAAA TGACCGCAAT CATACATGCC 6360 

ATATATAGAG TGTGGTGCCC TGTTAAAAAA ATATATTTAA ATCGAGTAAA ACGAGCGATT 6420 

AAGATATTGA ACACCATGCC TGCAAACATA ATCATTGCAG TAGCTGAGCC ATATGTTGTT 6480 

AAAGCTACAG CTACAATTGC TTCATTATTC GGCACAACGC CAGATAAATG AAAAGCATGC 6540 

TCAAACATGG TACCAAATGG ATTTAAAGAA TTTTGTACAA TTCCTGCACC ACCAGATACA 6600 

ACTAAGAAAC CAACAAAGGT CTTAATTCCA CCTTTAATAA TATCAGGTAA TTTCTTCTTC 6660 

TGAAGAACTA ATCCTAAGAT TGCAATTAAA GCTACTAAAA TAGCTGGTGT ACTAACAATA 6720 

TCCAATATGA ACTTCATCAT GACGCTAGCC TCCTATATAA GTCCTTTTTC TTCACAAAGT 6780 

TTAGTAATTA ATTCTCGTAG TTCATCCATA TCAATAATAC TATTTAAGAT ACGAACATCT 6840 

CCAAGATGAC TAGCTGAATC AGCTAGATCA CGACCAACAA TCCAAATATC AGCTGCATTT 6900 

GGATCTGCTC CACCTAAATC ATAATGTTCA ACTTCTACAT CCGAAACATT CAAATCACTC 6960 

AATACAGATT CAATATTCAT CTGTACCATA AAACTTGAAC CTAATCCTGA ACCACAAGCT 7020 

GTACCAATTT TTAACATTAT CTAATCCTCC TGTTTAATTA TCATTTTAAT GTCATCATAG 7080 

TTTTTTGATG ATATTAAAGT TTGAACATGA TTTTTATCTC TTAAAATTGT TGTTAAATGT 7140 

GACAAAGCCT TTAAATGACT CTCATTATCA ATGGCTGCAA TACAAATCAA CAATCTTACC 7200 
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TCTTGTTCTG 


GATTATCCAA 


TAAATAAATC 


GGTTCTTCCA 


AAACTAACAT 


TGACATTCCT 


7260 


ATTTCATTCA 


CACCTTCATC 


TGGCCGAGCG 


TGAGGAATTG 


CTACTCCCTT 


C CCT AAATT A 


7320 


ATAAAAGGTC 


CAAACTCTTC 


TACTTTTTGA 


ATCATTGCCT 


CAGGGTAGTT 


CTCAGTTATC 


7380 


TTATCTTGAT 


CCAAAAGCGG 


TTTAGCTGCT 


AAACGAATCG 


CCTCCTTCCA 


TCCTAATTTT 


7440 


TGCGAACTAA 


CCTGATAGGT 


TTCTTTGGTA 


ATAAGTTGTT 


CTAGCACTGG 


TACAATTTCC 


7500 


TTTCT ATC AT 


TTTTTTGGTA 


AAGATAATTC 


TTTAACGCCA 


ATCTTAATTC 


CAATTCTTGT 


7560 


GTAATAATTC 


CATATCTTTT 


GACAATATTC 


AGGATTTGTT 


CAATCTCAAA 


ATCTCCATAC 


7620 


TCTAAATTCG 


GAAAATCTTT 


TAACACTAGT 


TCTACTAGTT 


GTATTGCTTG 


CTCTTCAGTC 


7680 


ATCATAACCG 


AAACTAGATA 


ATTTGGCTTT 


TCTGTCTCCA 


CCTTTATGGT 


AGAAAAAACC 


7740 


ATATCATAGT 


CACTACTAGC 


TTTCACCTGT 


AAATCATCAA 


TCTTTGAGGT 


TCCTATAAAC 


7800 


TCAATTTGAG 


GAAATAATGC 


TAATAGATTC 


TCTTTTAACA 


TCAATGAAGA 


ACTAACACCA 


7860 


TTAGGACAAA 


TGATTGCTGC 


TTTATACCAT 


TTTTGAGGCA 


AAGTATCTGC 


TTTCTTTAAA 


7920 


TAACCTCCGA 


AATGGATAAC 


AAAATATGCT 


GTTTCACTAT 


CAGGTATGGG 


ATTGTCAATA 


7980 


GCGTCCATCA 


AGGGCATCAA 


AGAATCTTTG 


ACTAATTCAA 


ATAAATCAGG 


ATAATGTTCT 


8040 


TTAACATGCA 


ATACATATTC 


ATTTGAACTA 


GGTAGGCCGA 


ACTTTAATCT 


ATAGTAAGCC 


8100 


GGTATAAGGT 


GGCGGCGAAG 


ATTTTCTCTC 


AATCCTTCCC 


TTTGTTTAAA 


ATGTAACAAA 


8160 


GAAATATCTT 


CCATTCTACT 


TATAATAGCC 


TCTGTTAATT 


GATTAAAGTA 


AACCGGAGCA 


8220 


ACATCTACTT 


CACCTTCAAA 


GCAACTTGAT 


AATAAAACGG 


TGATATAGCG 


ATAATCATCC 


8280 


TCAGAAAACA 


CCGTATCTAT 


AATTCCCAAA 


TCAACCACTG 


TATCCAATAA 


AATAGTGGTT 


8340 


ATATCTTGAA 


TAACAGGAGA 


TACTAATGTC 


TCTGAAAGAC 


ATACTCTTTC 


AACATCCCTT 


8400 


TGATACCTAC 


ACAGAATGAA 


TACTAAACCG 


AAAAGGTAAA 


CTTTTAATTG 


ATTAACAATA 


8460 


GGTACTAGCT 


GTAGCTTCTC 


ATAATAATCT 


TTAACTACCT 


GATCAATCAA 


ATCATAAGTT 


8520 


AATGAATACC 


CCCAACTGGA TAAAACATAA 


TCCAAACCCC 


AAATCCCTAT 


GGAGGATTCC 


8580 


AGCAACTCAC 


TAACCATTTG 


AAAAGCTAAG 


CGGTGCTTAT 


TCCACTCTGA 


ACCGTGTAAA 


8640 


GTATAACCTT 


TTGCTCTACT 


GTACCCTAGC 


TCCAAATCAT 


TATCTAACAT 


AATCTTTCTT 


8700 


AATGATTGAA 


TATCAGATAA 


GGTTGTATTC 


TTACTTACTT 


TCAAAAAGTC 


TTGGTAATGA 


8760 


CTATTCGATA 


TAAAATCTAA 


TCGGCAAAAA 


GTGTAAAGAT 


AGATTAAAGC 


TAAGCGAGTC 


8820 


GACTTTGGTA 


AAACCAATTC 


ATCCGACTTA 


ATAATATCTG 


TCAAAGACTG 


CTTCGTACGA 


8880 


TTTGATAAAC 


TATAGCGACC 


TTGCTTTTTA 


TCCAGCACTA 


TCCCTTTATT 


AGCTAGATAA 


8940 
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GGCACTAAAT AATCTATTCC TTCTTTGACT TCCTTTATAG GTAAGCTCAC CTTAACAGAT 9000 
AATTCATATA ACGATAGCTC ACAATGATCC ATCAAAGTCA TCAAAATAAC TAGTGCTCTA 9060 



TAATCAAAC 

(2) INFORMATION FOR SEQ ID NO: 98: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8654 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



9069 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 98: 

CGAGACAACA AGATGAAGAA AAATTTGCCC TATCGTTTGT GGCGCTTGCA AGTGTAGCAC 60 

TTCTTGCAGC CTGTGGAGAA GTGAAGTCTG GAGCAGTCAA CACTGCTGGT AACTCAGTAG 120 

AGGAAAAGAC AATTAAAATC GGGTTTAACT TTGAAGAATC AGGTTCTTTA GCTGCATACG 180 

GAACAGCTGA ACAAAAAGGT GCCCAATTGG CTGTTGATGA AATCAATGCC GCAGTGGTAT 240 

CGATGGAAAA CAAATCGAAG TAGTCGATAA AGATAATAAG TCTGAAACAG CTGAGGCTGC 300 

TTCAGTTACA ACTAACCTTG TAACCCAATC TAAAGTATCA GCAGTCGTAG GACCTGCGAC 360 

ATCTGGTGCG ACTGCAGCTG CGGTAGCGAA CGCTACAAAA GCAGGTGTTC CATTGATCTC 420 

ACCAAGTGCG ACTCAAGATG GATTGACTAA AGGTCAAGAT TACCTCTTTA TTGGAACTTT 480 

CCAAGATAGC TTCCAAGGAA AAATTATCTC AAACTATGTT TCTGAAAAAT TAAATGCTAA 540 

GAAAGTTGTT CTTTACACTG ACAATGCCAG TGACTATGCT AAAGGGATTG CAAAATCTTT 600 

CCGCGAGTCA TACAAGGGTG AAATCGTTGC AGATGAAACT TTCGTAGCAG GTGACACAGA 660 

CTTCCAAGCA GCCCTTACAA AAATGAAAGG GAAAGACTTT GATGCTATCG TTGTTCCTGG 720 

TTACTATAAT GAGGCTGGTA AAATTGTAAA CCAAGCGCGT GGCATGGGAA TTGACAAACC 780 

AATCGTTGGT GGTGATGGAT TCAACGGTGA GGAGTTTGTA CAACAAGCAA CTGCTGAAAA 840 

AGCATCAAAC ATCTACTTTA TCTCAGGCTT CTCAACTACT GTAGAAGTTT CAGCTAAAGC 900 

TAAAGCCTTC CTTGACGCTT ACCGTGCTAA GTACAATGAA GAGCCTTCAA CATTTGCAGC 960 

CTTGGCTTAT GATTCAGTTC ACCTTGTAGC AAACGCAGCA AAAGGTGCTA AAAATTCAGG 1020 

TGAAATCAAG AATAACCTTG CTAAAACAAA AGATTTTGAA GGTGTAACTG GTCAAACAAG 1080 

CTTCGATGCA GACCACAACA CAGTCAAAAC TGCTTACATG ATGACCATGA ACAATGGTAA 1140 

AGTTGAAGCA GCAGAAGTTG TAAAACCATA ATAGAAAAAT GTTGAAATAG GGAATGAGCC 1200 

TTTGACTCAC TCCCTGTTTC GATATTTAAT ACTCTTCGAA AATCTCTTCA AACTGCGTCA 1260 
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ACGTCGCCTT 


GGATTATATA 


, TGTGACTGAC 


TTCGTCAGTC 


TTATCTACAA 


CCTCAAAGCA 


1320 


GTGCTTTGAG 


CAACCTGCGG 


CTAGTTTCCT 


AGTTTGCTCT 


TTGATTTTCA 


TTGAGTATAA 


1380 


GAACCTATCA 


AAAAGTGAGG 


GAAAACCCTC 


GGAATTATAA 


ATAGAAAGAG 


TGAATCTTAT 


1440 


GCTCCAACAA 


CTCGTAAATG 


GTTTGATTCT 


AGGTAGTGTT 


TACGCGCTGT 


TAGCCCTAGG 


1500 


ATATACCATG 


GTTTACGGAA 


TTATCAAGCT 


CATCAACTTC 


GCCCATGGTG 


ATATTTATAT 


1560 


GATGGGAGCC 


TTTATCGGTT 


ATTTCTTGAT 


CAATTCTTTC 


CAAATGAATT 


TCTTTGTAGC 


1620 


GCTTATTGTA 


GCTATGCTAG 


CGACAGCTAT 


TCTTGGTGTC 


GTGATTGAGT 


TTCTTGCTTA 


1680 


CCGACCTTTG 


CGCCACTCTA 


CTCGTATTGC 


TGTTTTGATT 


ACGGCTATTG 


GGGTTTCTTT 


1740 


CCTATTGGAG 


TATGGAATGG 


TCTATCTGGT 


TGGTGCCAAT 


ACCCGTGCCT 


TCCCTCAAGC 


1800 


GATTCAAACA 


GTTCGATATG 


ATTTGGGACC 


AATTAGCTTA 


ACAAATGTGC 


AGTTAATGAT 


1860 


TTTGGCCATT 


TCCTTGATTT 


TGATGATTTT 


GTTACAAGTC 


ATTGTCCAAA 


AGACTAAGAT 


1920 


GGGGAAAGCC 


ATGCGTGCAG 


TATCAGTAGA 


TAGCGACGCG 


GCGCAATTGA 


TGGGGATCAA 


1980 


TGTAAACCGT 


ACGATTAGCT 


TTACCTTCGC 


TTTGGGTTCT 


GCTCTTGCGG 


GTGCGGCTGG 


2040 


TGTTCTGATT 


GCTCTTTATT 


ATAACTCTCT 


TGAGCCTTTG 


ATGGGGGTTA 


CTCCAGGTCT 


2100 


TAAATCTTTC 


GTTGCCGCAG 


TACTTGGTGG 


TATCGGAATT 


ATTCCTGGTG 


CGGCTCTTGG 


2160 


TGGCTTTGTG 


ATTGGTCTAT 


TGGAAACCTT 


TGCGACTGCC 


TTTGGGATGT 


CAGATTTCCG 


2220 


TGATGCCATT GTTTATGGAA TCTTGTTGTT GATCTTGATT GTCCGCCCAG 


CTGGTATCCT 


2280 


TGGTAAGAAT 


GTGAAAGAGA 


AGGTGTAAAC 


GATGAAGGAA 


AATTTAAAAG 


TTAATATTCT 


2340 


ATGGTTACTC 


CTTTTGTTAG 


CTGGCTATAG 


CTTGATTAGT 


GTACTGGTTT 


CAGTCGGAGT 


2400 


ACTTAATCTA 


TTCTATGTAC 


AGATTTTACA 


ACAAATTGGA 


ATTAATATTA 


TTTTGGCTGT 


2460 


TGGTCTCAAC 


TTAATCGTTG 


GTTTTTCAGG 


ACAATTTTCA 


CTTGGTCATG 


CTGGTTTCAT 


2520 


GGCGATTGGT 


GCCTATGCAG 


CAGCTATTAT 


TGGTTCTAAA 


TCACCAACCT 


ACGGTGCCTT 


2580 


CTTTGGAGCT 


ATGCTTGTAG 


GGGCTTTGCT 


TTCAGGAGCA 


GTTGCCTTAC 


TTGTCGGCAT 


2640 


TCCAACCTTG 


CGCTTGAAGG 


GGGACTATCT 


TGCGGTAGCA 


ACTCTGGGTG 


TTTCTGAAAT 


2700 


TATCCGTATC 


TTTATCATCA 


ATGGTGGAAG 


CCTTACAAAT 


GGTGCGGCAG 


GTATCTTAGG 


2760 


GATTCCTAAC 


TTTACAACTT 


GGCAAATGGT 


TTACTTCTTT 


GTCGTGATTA 


CAACCATTGC 


2820 


AACCTTGAAC 


TTCTTGCGTA 


GCCCAATTGG TCGTTCAACC 


CTCTCTGTTC 


GTGAAGATGA 


2880 


AATCGCTGCT 


GAGTCAGTTG 


GGGTTAATAC 


GACTAAAATT 


AAAATCATCG 


CTTTTGTCTT 


2940 


TGGTGCCATT 


ACTGCAAGTA 


TTGCTGGGTC 


ACTTCAGGCA 


GGATTTATCG 


GGTCTGTTGT 


3000 
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ACCGAAAGAT TACACCTTCA TCAACTCAAT CAACGTTTTG ATTATTGTTG TATTTGGTGG 3060 

ACTCGGTTCC ATTACAGGTG CGATTGTTTC GGCTATTGTT CTGGGAATTT TGAATATGCT 3120 

TCTCCAAGAT GTTGCTAGTG TGCGTATGAT TATTTACGCT TTGGCCTTGG TATTGGTAAT 3180 

GATTTTCAGA CCAGGTGGAC TCCTTGGAAC ATGGGAACTG AGCCTATCAC GTTTCTTTAA 3240 

AAAATCTAAG AAGGAGGAAC AAAACTAATG GCATTACTTG AAGTAAAACA GTTAACCAAA 3300 

CATTTTGGTG GTCTAACAGC TGTTGGAGAT GTGACTCTTG AATTGAACGA AGGGGAACTG 3360 

GTTGGATTAA TCGGTCCAAA CGGAGCTGGG AAAACCACCC TTTTCAACCT TTTGACCGGT 3420 

GTTTATGAAC CAAGCGAGGG AACAGTAACC CTAGATGGTC ACCTTTTGAA TGGGAAATCA 3480 

CCTTATAAGA TTGCCTCTTT GGGACTTGGA CGTACTTTCC AAAATATCCG TCTCTTTAAA 3540 

GATTTAACAG TTTTAGATAA TGTTTTGATT GCTTTTGGAA ACCATCACAA ACAGCATGTT 3600 

TTTACTAGTT TCTTACGCTT ACCAGCTTTT TACAAGAGTG AAAAAGAATT AAAGGCTAAA 3660 

GCTTTGGAAT TGTTGAAAAT CTTTGATTTA GATGGTGATG CAGAGACTCT TGCTAAAAAT 3720 

CTTTCCTACG GACAACAACG TCGTTTGGAA ATTGTTCGTG CCCTTGCTAC GGAACCTAAA 3780 

ATTCTCTTCT TAGATGAACC AGCAGCAGGT ATGAACCCAC AGGAAACAGC CGAATTGACT 3840 

GAGTTAATTC GTCGTATCAA AGATGAGTTT AAGATTACAA TCATGTTGAT TGAACACGAT 3900 

ATGAATCTGG TCATGGAAGT AACAGAACGT ATCTACGTAC TTGAATATGG CCGTTTAATC 3960 

GCTCAAGGAA CTCCAGACGA AATTAAGACC AATAAACGCG TTATCGAAGC TTATCTAGGA 4020 

GGTGAAGCCT AATGTCTATG TTAAAAGTTG AAAATCTTTC TGTGCATTAC GGTATGATCC 4080 

AAGCAGTTCG TGATGTAAGC TTTGAAGTTA ATGAAGGAGA AGTTGTTTCC CTTATCGGTG 4140 

CCAACGGTGC AGGTAAGACA ACTATTCTTC GCACCTTGTC AGGTTTGGTT CGACCAAGTT 4200 

CAGGAAAGAT TGAATTTTTA GGTCAAGAAA TCCAAAAAAT GCCAGCTCAG AAAATCGTGG 4260 

CAAGTGGTCT TTCACAAGTT CCAGAAGGAC GCCACGTCTT TCCTGGCTTG ACTGTTATGG 4320 

AAAATCTTGA AATGGGAGCT TTCTTAAAGA AAAATCGTGA AGAAAATCAA GCTAACT'i'GA 4380 

AGAAGGTTTT CTCACGCTTT CCTCGTCTTG AAGAACGGAA GAACCAAGAT GCAGCCACTC 4440 

TTTCAGGGGG GGAACAACAA ATGCTTGCCA TGGGACGCGC CCTCATGTCA ACACCAAAAC 4500 

TTCTTCTTTT AGATGAACCA TCAATGGGAC TTGCCCCAAT CTTTATCCAA GAAATTTTTG 4560 

ATATCATTCA AGATATTCAG AAGCAAGGAA CAACGGTCCT CTTGATTGAA CAAAATGCCA 4620 

ATAAAGCACT TGCAATCTCT GACCGAGGAT ATGTACTGGA AACAGGGAGA ATCGTCCTAT 4680 

CAGGAACAGG AAAAGAACTC GCTTCATCAG AAGAAGTCAG AAAAGCATAT CTAGGTGGCT 4740 

AAAACAATCC AGTGGATTGT TTTAGTCGGC AGATGGAGAT TACGAAGTAA TCATCAATAT 4800 
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AGTCCGGGGG ACCTTTTTAG TCGGTAGATT GAGATTGCAA ACAAATCTGC ATCTACATTG 4860 

AAAGCTTAAT TTCTAATAAT TGAAAAAATC GAATGAAAAA TTTCTTACCT TCATTCACAG 4920 

AGCTCGATTT CAGAGCTCTT TTTGCTAGCT TATTCATACT TTTCTGAATT TCGAAAAAGA 4980 

AATGTAAGCG TTTGATAGAT TTACAAAAAG ATTGTATAAT AGGGATAAGA ATAGAAAAGG 5040 

AGAAGTCTCA TGGCAGTTAA AGATTTTATG ACCCGCAAGG TAGTTTATAT TAGTCCAGAT 5100 

ATAACAGTAT CTCATGCAGC AGATTTGATG AGAGAGCAAG GTTTGCACCG TCTGCCTGTT 5160 

ATCGAAAATG ATCAATTAGT TGGTTTGGTG ACTGAGGGAA CCATTGCACA AGCAAGTCCA 5220 

TCTAAAGCAA CAAGTCTTTC TATCTATGAG ATGAATTATC TTCTGAATAA GACAAAAGTA 5280 

AAAGATGTCA TGATTCGCGA TGTTGTCACT GTCTCAGGCT ATGCTAGTCT AGAAGATGCA 5340 

ACTTATCTGA TGTTGAAAAA TAAGATTAGT ATTCTCCCTG TCGTAGATAA CCATCAAGTA 5400 

TACGGAGTTA TTACTGACCG TGACGTTTTC CAAGCCTTTC TTGAAATTGC AGGTTATGGC 5460 

GAAGAAGGGA TTCGTGTACG CTTTGTTACA GAAGATGAAG TTGGTGTTCT TGGAAAAATT 5520 

GTTTCTTTGA TTGTAGAAGA AAATTTGAAT ATCTCCCATA CAGTCAATAT TCCGCGTAAG 5580 

GATGGTAAGG TGATTATCGA AGTGCAAATC GATGGATCAA TTGATTTACC AGCCTTGAAA 5640 

GAAAAATTTG AAGCAAATGG TATTCAAGTG GAAGAAATCG CTCGCACTTC AGCAAAAGTC 5700 

TTGTAAGAAG GGAAGCCCAA AGGCTTCTTT TTTCATGAAA AGGGGATTAG AGCAAAAGAT 5760 

GGAAAGAAAT GATAAAATAT GCTATAATGA AATAATGTAA AAAAGGAGTA TTTATGGACA 5820 

TTTCAGTAAT TCGTCAGAAA ATTGACGCAA ATCGTGAAAA ATTAGCTTCT TTCAGGGGGT 5880 

CTCTTTGACC TCGAAGGGCT AGAGGAAGAG ATTGCCATCT TGGAAAACAA GATGACAGAA 5940 

CCTGATTTTT GGAACGATAA TATTGCGGCC CAAAAAACGT CGCAAGAATT AAATGAATTA 6000 

AAAAACACTT ACAATACCTT CCATAAGATG GAAGAGTTGC AGGATGAAGT CGAAATTTTA 6060 

TTGGATTTTT TGGCTGAAGA CGAGTCAGTG CATGATGAAC TGGTAGCGCA GTTAGCCGAA 6120 

CTTGATAAGA TAATGACCAG CTACGAGATG ACTCTACTCT TGTCAGAACC TTATGACCAC 6180 

AACAATGCCA TCTTGGAAAT CCATCCAGGT TCTGGTGGTA CTGAGGCGCA GGACTGGGGT 6240 

GATATGTTGC TTCGTATGTA TACTCGTTAT GGTAATGCTA AAGGCTTTAA AGTGGAAGTG 6300 

TTGGATTACC AAGCAGGTGA TGAGGCTGGT ATTAAGTCGG TAACTTTATC ATTTGAAGGG 6360 

CCTAATGCCT ATGGTCTCCT CAAGTCAGAA ATGGGTGTTC ACCGCTTAGT GCGAATCTCA 6420 

CCATTTGACT CTGCCAAACG TCGCCATACC TCTTTCACAT CTGTAGAAGT GATGCCAGAA 6480 

TTGGATGATA CTATTGAAGT GGAAATCCGT GAAGATGATA TCAAGATGGA TACCTTCCGT 6540 
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TCAGGTGGTG CCGGTGGACA AAACGTCAAT AAGGTTTCAA CAGGTGTACG TTTAACCCAC 6600 

ATTCCAACTG GAATTGTTGT CCAATCAACA GTAGATCGTA CCCAGTATGG AAATAGAGAT 6660 

CGTGCCATGA AGATGTTGCA GGCTAAGCTC TATCAAATGG AGCAAGATAA GAAGGCTGCG 6720 

GAGGTAGATT CTCTCAAAGG TGAGAAAAAG GAGATCACTT GGGGAAGCCA AATCCGTTCT 6780 

TATGTCTTCA CGCCTTATAC TATGGTAAAA GATCACCGAA CTAGCTTTGA GGTTGCTCAG 6840 

GTAGATAAGG TTATGGATGG GGACCTAGAT GGTTTTATCG ATGCTTATCT CAAGTGGCGA 6900 

ATTAGCTAAG ATAGAAAGGA ACTCACATGT CAATTATTGA AATGAGAGAT GTCGTTAAAA 6960 

AATACGACAA CGGAACAACT GCTCTACGCG GTGTTTCGGT TAGCGTTCAA CCGGGGGAAT 7020 

TTGCTTACAT CGTAGGACCT TCAGGAGCAG GGAAGTCAAC TTTTATTCGT TCTCTGTATC 7080 

GTGAAGTAAA AATCGATAAA GGAAGCCTAT CAGTTGCTGG TTTTAATCTG GTTAAGATCA 7140 

AAAAGAAAGA TGTCCCGCTT CTACGTCGTA GTGTTGGGGT TGTCTTCCAG GATTATAAAT 7200 

TGTTACCAAA GAAAACTGTC TATGAAAATA TTGCTTACGC TATGGAAGTA ATCGGGGAAA 7260 

ATCGCCGTAA TATCAAAAGA CGAGTGATGG AAGTTTTGGA CTTGGTTGGA TTGAAGCATA 7320 

AGGTTCGTTC TTTCCCAAAT GAACTCTCAG GTGGGGAGCA ACAGCGGATT GCGATTGCGC 7380 

GTGCAATTGT AAATAATCCC AAAGTATTGA TAGCTGATGA GCCAACAGGA AATCTGGATC 7440 

CGGATAATTC ATGGGAAATT ATGAATCTCT TGGAACGGAT TAACyTACAA GGAACAACTA 7500 

TTTTGATGGC GACTCATAAT AGCCAGATTG TAAATACCTT GCGCCACCGT GTCATTGCCA 7560 

TTGAAAATGG CCGTGTCGTT CGTGACGAAT CAAAAGGAGA GTATGGATAC GATGATTAGT 7620 

AGATTTTTTC GCCATTTATT TGAAGCCTTA AAAAGTTTGA AACGAAATGG TTGGATGACA 7680 

GTAGCTGCTG TCAGTTCAGT CATGATTACT TTGACCTTGG TGGCAATATT TGCATCTGTT 7740 

ATTTTCAATA CAGCGAAACT AGCTACAGAT ATTGAAAATA ATGTCCGTGT AGTAGTTTAT 7800 

ATCCGAAAGG ATGTGGAAGA TAATAGTCAG ACAATTGAAA AAGAAGGTCA AACTGTTACA 7860 

AATAATGACT ACCACAAGGT ATATGATTCT TTGAAGAACA TGTCTACGGT TAAAAGTGTT 7920 

ACCTTTTCAA GTAAAGAAGA ACAATATGAA AAATTAACCG AGATAATGGG AGATAACTGG 7980 

AAAATCTTTG AAGGAGATGC CAATCCTCTC TATGATGCCT ATATTGTAGA GGCAAACACT 8040 

CCAAATGATG TAAAAACTAT AGCCGAAGAT GCTAAAAAAA TTGAAGGTGT CTCTGAGGTT 8100 

CAAGATGGCG GTGCCAATAC AGAAAGACTC TTCAAGTTAG CTTCATTTAT CCGTGTTTGG 8160 

GGACTAGGGA TTGCTGCTTT GTTAATTTTT ATCGCAGTTT TCTTGATTTC AAATACCATT 8220 

CGTATTACCA TTATTTCCCG CAGTCGCGAA ATTCAAATCA TGCGCTTGGT CGGAGCTAAA 8280 

AACAGTTATA TCCGTGGACC GTTCTTGTTA GAAGGAGCCT TTATCGGTTT ATTGGGAGCT 8340 
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ATCGCACCAT CTGTTTTGGT CTTTATTGTT TATCAAATTG TTT AC CAATC TGTCAACAAA 8400 

TCGTTGGTAG GGCAAAATCT ATCCATGATT AGTCCAGATT TATTTAGTCC GTTGATGATT 8460 

GCCCTACTAT TTGTGATTGG GGTTTTCATT GGTTCATTGG GATCAGGAAT ATCCATGCGC 8520 

CGATTCTTGA AGATTTAGGT AAAATAGCTG CTTTTATGAG GAGATTGTAA AATCTCCTTT 8580 

TTTGCTACAA GAGTTTTTGA AAAGAGATGC GCAGAAGAAA AGAGCTTCCA AAGAAGTCCC 8640 

- CCAGAGAAGA CTTC 8654 



(2) INFORMATION FOR SEQ ID NO: 99: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 19718 base pairs 

(B) TYPE: nucleic acid 
(C> STRANDEDNESS : double 
(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 99: 

TGTCGCGTCA AAATCATTAC TATGGCTATG TATAGCCCTT ACTATGACTT GGCTAAACAC 60 

GTTCGCTTTC AAATTTCTAG GCTCAGGCTG AAACAGTCTC CCAGGCTGTT CACTCCCGAA 120 

TGCTAAAATC GTTCTTGATC GCTTTCACAT TGTACAACAT CTTAGCCGTG CTATGAGTCG 180 

TGTGCATGTC CAAATCATGA ATCAGTTTCA TCGAAAATCC CATGAATACA AGGCTATCAA 240 

GCGCTACTGG AAACTCATTC AACAGGATAG CCGTAAACTG AGTGATAAGC GATTTTATCG 300 

CCCTACTTTT CGCATGCACT TAACAAATAA AGAAATTCTT GACAAGATTT TAAGCTATTC 360 

AGAAGACTTG AAACACCACT ATCAGATCTA TCAACTCTTA CTTTTTCACT TTCAGAACAA 420 

AGACCCTGAG AAATTTTTCG GACTCATTGA GGACAATCTG AAGCAGGTTC ATCCTCTTTT 480 

TCAGACTGTC TTTAAAACCT TTCTCAAAGA TAAAGAAAAG ATTATCAACG CCCTTCAACT 540 

ACACTATTCT AATGCCAAAC TGGAAGCGAC CAATAATCTC ATCAAACTTA TCAAGCGCAA 600 

TGCCTTTGGT TTTCGAAACT TTGAAAACTT CAAAAAACGG ATTTTTATCG CTTTGAACAT 660 

CAAAAAAGAA AGGACGAAAT TTGTCCTTTC TCGAGCTTAG CTGACTTCAA CCCACTACAG 720 

TTGACAAAGA GCCTAATTTC CATAAAAATT GACATGGAAA TTATAAAACC ATTACTAGTT 780 

TAGTCCTTTT TGATAACGTG CCAATTCGGC TTGGTTCGCC CAAACATAGT GACCTGGACG 840 

GATTTCTACC ATAGATGGCT TATCAGTCTC ATAGTCGTGT TGACTTGGAT CGTAAACCTT 900 

CAAGACCTTC TTACGTTCCA AGATTGGATC TGGGATTGGT ACCGCTGAAA GCAAGGCTTG 960 

AGTATATGGG TGAATTGGAT TGTTAAACAA TTCTTCTGTT TCTGCAACCT CTACAATAAC 1020 



WO 98/18931 



PCT/US97/19588 



752 

ACCCTTGTAA ATAACTGCGA TACGATCTGA AATAAAGCGA ACAACCGACA AGTCATGGGC 1080 

GATGAAGAGA TAGGTCAGGC CGAGCTCTTT . TTGGAATTTT TTGAGCAAGT TCAAGACTTG 1140 

GGCACGTACA GAAACGTCCA AGGCTGAAAT TGGCTCATCT GCAATAACAA AGTCTGGTTG 1200 

CATGACCAAG GCACGGGCAA TACCGATACG TTGACGTTGA CCGCCTGAGA ATTCATGAGG 1260 

GTAACGAGTC AAGTGCTCAG CAAGAAGACC TACTTCACGG ATAATATTTT GAACTTTCTC 1320 

TTTACGTTCT TCTTCATCCT TAAATAAACG GTGATTGTAA AGACCTTCAG AAATAATATA 1380 

ATCAACAGTC GCACGTTCAT TCAAACTTGC GGCAGGGTCT TGGAAAATCA TCTGGATTCG 1440 

ACGAATCAAT TCCGCAGCTT GTTCACGCGA TTTCTTACCA TTAATCTTTT GAC CATC AAA 1500 

AATGATATCT CCATTACTTG TATCATTTAG ACCGATGATA GCACGACCAA TAGTTGTTTT 1560 

CCCACTACCG GACTCACCTA CAAGCGAGAA AGTTTCTCCC TTGTTGATAA AGAAGTTAGC 1620 

ATTTTTAACC GCGACAAACT TCTTACTTCC TTCACCGAAG GAAATTTCTA AATCTTTGAT 1680 

TTCTACTAAT TTTTCAGACA TTTCCTTCCT CCTAGTCAGC CAGATGGGCA AATCCCATTT 1740 

TTTCACGGAT CTTATCATGG AGATTTGCAA TCACAGCTGG TTTTTCTACT TTCGGAGCAT 1800 

CCTCATGAAG AAGCCAAGTT TTAGCCCAAT GTGTCTCTGA TACTGAGAAT TGAGGAGCTT 1860 

TTTGTTCGAA GTCAATCTGC ATTGCGTAGT CAGAACGCAA GGCAAAAGCA TCCCCTTTCA 1920 

GGTCAGTATA AAGTGACGGA GGTGTTCCTG GGATTGAGTA AAGATCCCCT TTATCATCAG 1980 

CAAGCTGAGG CAAGCTAGAC AAGAGACTCC ATGTATATGG ATGGCGAGGG TCATAGAAGA 2040 

CTTCCTCAAC CGTTCCATAC TCAACGATTT CTCCTGCATA CATAACCGCT ACCTTATCCG 2100 

CAATACTTGC CACCACACCA AGGTCGTGGG TAATAAAGAT TGTTGTGAAA TGATACTCGT 2160 

TTTGTAAAGA TTTTAGCAAA TCAATAATCT GAGCTTGAAT AGTTACATCC AAGGCAGTTG 2220 

TTGGCTCATC ACAGATCAAG ACATCAGGTC GGCAGGCAAG GGCAATAGCA ATAACGATAC 2280 

GTTGACGCAT TCCTCCAGAA TATTGGAATG GGTATTCATT AAAACGTCTA TCTGCGTCTG 2340 

GAATGCCAAC CTTATTCATG TAGTCAATGG CCAATTCTTT CGCTTCTTTA GCTGTTTl'TC 2400 

CTTGGTGTTT TACAATAACT TCTGTAATCT GACTACCAAT TGTTTTAATG GGGTCCAAAC 2460 

TAGTCATTGG GTCCTGGAAG ATAGTCGCAA TCTTAGCACC ACGAATTTGT TCCCAATCCT 2520 

TGTGAGAAGA TAAAGCTGTC AAGTCCTGAC CACGGTAGTC AATACTACCT TGGGCAATAC 2580 

GACCATTTTC TTCGAGCATA CCTGTGAAGG TCTTTGTCAA AACAGATTTA CCTGATCCTG 2640 

ACTCACCTAC CAAGGCTAAT ACTTCTCCTT CGACTAGTTC AAGGGAAACG CCGCGAATGG 2700 

CTGTCAATAC TTTGTCACGA ACGTCAAATT CCACGACAAT ATCGCGAGCA GTCAAAATTA 2760 

CATTTTTTTC TTTTGTCATT TCTACTCCTA TCTATGTGTA CGTGGATCAC TAGCATCCGC 2820 
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TAAGTTTTGA 


CCAACTACGA AAAGGGACAA 


GGATACCAAG 


ACAAGGGTTG 


TCAATGGAAT 


2880 


CCAGAACAAG 


TAAGCATTGG 


TTGTTACGTT 


TTGTGAATAA 


TCCGAAATCA 


AACGACCCAA 


2940 


ACTTGGCACT 


GTAATCGGTA ATCCAAGACC 


GAAGAAAGAC 


AAGAAGGCTT 


CGTATGAGAT 


3000 


AAAGCTTGGA 


AGCATTTGAG 


TCATGGTTGT 


CACAATAACA 


GATACCAATT 


GAGGCATGAT 


3060 


ATTTTTGGCA 


ACAATCTTCA 


AGGTTGGTGT 


TCCCAAAGTA 


CGTGACGCCA 


AGTTGTATTC 


3120 


CAAGTCACGA 


TAGCGCAAGA 


TTTGCACACG 


GATCATGAAG 


GCAATACCAA 


TCCATGTTGT 


3180 


TACGCTCATG 


GCAAAAATCA 


GATTCCAGAA 


TCCAGCTCCG 


ATTGAGTAAG 


TCAAGACAAT 


3240 


AACAATCAAA 


AGAGGTGGGA 


TGTTTGAGAT 


GACGTTGTAA 


ACTTCCATCA 


TGACACGGTC 


3300 


AACTGATTTT 


GAAATACCCC 


AAATACCACC 


GACAAAAACA 


CCGATAACCA 


AGTTAATCAC 


3360 


TGTCGCAATC 


ACAGAAATGA 


GGATGGAGTT 


ACGAGCTCCG 


AACCAGACAC 


CGTCAAAGAG 


3420 


CGATTTACCG 


TTACTGTCAG 


TACCGAACCA ATGCTCCGCA 


TTTGGCTTGA 


TATAACGAAC 


3480 


ACTAAAGTCG 


TTTACCTTGC 


TGACATCATT 


GAAATCAAAC 


TTAGAAAACA 


TTGGGTAGAT 


3540 


GAAACTTATC 


AAAATGATGG 


CTACCAAGAT TCCCAACATG 


ACTACAGTTG 


ATTTTTTCTT 


3600 


CATAAATTGT 


TTAAACACTG 


ATTTCCAGTA AGAATATGCT 


GGCGCATCAA 


TAGTTTCAGA 


3660 


GGCAAAATCG 


TCACGTTTTA 


CAAACTGAAA 


TTTTTCTTTA 


TCGATTGTAG 


ACATTATTTG 


3720 


CCTCCTTTCT 


CAGTCAATTT AATACGTGGG 


TCAATAATAG 


TCATCCAAAT 


ATCTCCCAAA 


3780 


AGACGTGAGA 


AGATAGAAAT 


ACATGTAAAG 


ATGAAGACAA 


GACCAACGAC 


CATAGAGTTA 


3840 


TTAGATGCTT 


TTACAGAGTC 


AATCAACATT 


TTACCCATAC 


CTGGGAAGGC 


GAAGACTGTT 


3900 


TCAGTAAGGG 


TTGCACCACC 


GATAACCCCA 


ATAATGGCAG 


CAGGAATTCC 


TGAAACCAGC 


3960 


GGAACCATGG 


CATTTTTAAA 


GATGTGTTTG 


TTTGAAATTT 


CTTTTTCAGA 


CAAACCTTTT 


4020 


GCACGAGCGA AACGAACAAA GTCTTGAGAT TGCAAGTCAA TCATGTAACG ACGAATCCAA 


4080 


ATGGCTGTAC 


CAGGAGCACC 


CAACAAACCA 


AGGATGACTG 


CTGGTAAAAC 


GTAAGAACGC 


4140 


CAATCTCCAG 


CTCCCAAGAT 


AGGGAATGAA 


TCTGGAAGGG 


CAATAGATGA 


TCCAATCAAT 


4200 


CGAACGATGT 


AAACCAAGGC 


AATCGTTGGA 


AGAGCAAGCA 


AGAAGGTCAA 


AGCCCCTGTT 


4260 


GAGAGGCTAT 


CAATCCAAGT 


GTTCTTGAAA 


CGAGCCATGG 


CTGAACCAAG 


TGGCACGGCA 


4320 


AGAGCATAGG 


CAAGAACCAA 


ACCAATCAAA 


CCAGTAATAG 


CAGAGCTGAC 


AATCATAGAT 


4380 


GGATATTGGT 


AATTACTTTC 


AGTCGCTGTA 


TAAGGATCAT 


CTTTCCCATA 


GCTAGCTACT 


4440 


TCACGAGAGT 


CAGCCTGACT 


AGGTGACTTG 


TAGGTTCTTG 


AGTAAATATT 


TACAGAAGAC 


4500 


GTTTTCTTAC 


CTGTTGGGAA 


CTGAACTTGG 


GCAGTTTTGG 


TTTGTCCTTG 


ACCTTGAGTA 


4560 
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ATAACCTGAA 


GAACTGGTGT 


ATTAGCATAG 


GTTGGGTAAG 


AGTCACCTAA 


ATTCAAGTTC 


4 620 


ACAAAGTTTT 


GATGAACAAA 


TGGGAACTGA 


CTGTTAAAGT 


ACAAGAGATA 


TTTATGTTTA 




GTTCCTGAAC CGACCAATGA CCATCCGATA GCTGGATCAT TTTCAAAACG AAGGTAGCGT 


a ha n 


TTCAAGTCTG 


GATTTTCAGG 


GTCTTGGATT 


TTATTTGTAT 


GGTCAATGTC 


AATCAAGTTA 


A Q AA 

4HUU 


GCATAGAAGT 


GAAAAACACG 


TTCAAAAATT 


GGAATTTCAC 


GAGTAGCATA 


GAATTGACCA 


4860 


CTTTCAGTAA 


ATTCTCCCAA 


AGTCCAACCA 


TGACCTAATT 


GATTGATGTA 


CTTTTCATAA 


4920 


ATAGCTTTAT 


TGGTCGCATT 


TGCTTCTACT 


GTTACAGAAG 


AATCCATGCT 


ACTTGCCTTT 


4980 


TCTTGCAACT 


CTTTAGTATC 


GTAATACTCA ATGTAGCCCA 


TACGCTCAAA 


CACAGTATTT 


5040 


TCATAGTTAT 


CACGTTTATC 


AGCCGTTGTC 


GCAATTTTAT 


TATAGTTAGG 


ATCCTGCTTG 


5100 


AAAATCAATT 


TTCGAGGAAC 


CAAGGTATAG 


ATAATCGTGT 


AGGTCAAAGT 


CGTTACTAAG 


C 1 £ A 


AAAATCGAAA 


CCAATGACCG 


CAAAACACGC 


ATAAAAATAT 


ATTTTTTCAT 


ATTATTTCCT 


5220 


TTAAAAATCC 


CAAAAGAACC 


TTCTCCTCAT 


GGAGAGAAAG 


TTCTATTAGA AATTATTTAC 


5280 


TTCACATGAC 


TTGCCAATTC 


TTTTTGAGCT 


TTCTCATTTG 


ATTCAGCTTT 


TTCTTTCAAC 


5340 


CATTTTTCAC 


GAGCTTTTTC 


ATACTCTTCC 


TTAGTCACCA 


CTTTATCTTG 


TGATTTCAAA 


54 00 


TATTTGAAGT 


AAACATCTGA 


CCCCTTAGAG 


CCTGTTTGCG 


CAGAAGCTCC 


AGTAAATGGA 


5460 


ACAATTCGTG 


AAAGCACTGG 


TGCTGCACCA 


GAAGAAGCCA 


TAGCAGGAAT 


AAAGAGTGAA 


5520 


CTATCTGTCA 


ACCATGCTTG 


AGCCGCTGCA 


TATTTTTCAT 


AACGGACATT 


CAAGTCGCTT 


5580 


GTCTCTCTGG 


CAGCTTCATC 


AACTAATTTA 


TCGTATTCTT 


TCAAACCAAC 


TTGAACTACT 


5640 


GAAGGGCTAT 


TTGGATTATC 


AAATCCTAAA 


TATGTTTTTG 


TAGTTTCACT 


GCTAGTTGTT 


D /UU 


TTTAAAATAT 


CCAGGTAAGT 


AGATGGGTCT 


TGATAGTCTG 


GCCCCCATGA 


AACTCCTCCT 


D 1 OU 


GATACATCCC 


AATCCTCAGA 


TGAAGCATTG 


GCAGCATAGT 


AAGTAATATT 


AAGGAATTCA 


COOA 


TCACTTGTCA 


TTTGTTGAAT 


ATCAACAACG 


ACATTTTCAA 


CACCAAGAAC 


TGTTTCTACA 


jOOU 


GATTGTTTAA AGGACTGAAT 


ACGAGATATG 


TAGTTTTTTG 


ATGCTTGGTC 


TACTGGAACG 


5940 


TCCAGATGAA 


TAGGAAACTG 


AACGCCGTCT 


GCTTCTAAAG 


CTTTCTTAGC 


TTTCGCAAAC 


6000 


TCTGCCTTGG 


CCTTGTCAGC 


ATTGAATAAA 


CCATCCTGCC 


CATCAGCTAA 


ATTCACACCT 


6060 


TTCCACTCAT 


CACCATAAGC 


AGGAAGTTGA 


GCAGCGACTA 


AATCACCAAA 


GGTCTTCTCA 


6120 


CCAGCTGAAA 


CAAAGTCTGG 


TTTTACAAAT 


AAATTACGAA 


CTGCTAAAGC 


TGCTCCATCT 


6180 


TTACCATTGA 


TTTGAGCTGA 


GTAAGCTGAG 


CGATCAAGAG 


CAAAATTCAA 


GGCTTGACGG 


6240 


AAATCTTTGT 


TAAGCAATGC 


CTTCTTAGTA GCTACTTTCT 


CTGAATCTGT 


AGTTTTAGAA 


6300 


GTATAGTTGT 


AACTTTGGCG 


ATCAATATTC 


ACACCCAGAC 


CAGCAATCCC 


AGAGCCTGAT 


6360 
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CGTTTCCCCG 


ACTCCTGACT 


CATATCCATC ATCAAGCfiAA 




Afl A ARAP A A A 


Q i CPl 
tJIDU 


ACTAATAAAA 


TAGTCCCCAC 


AATTCCGTAA CTCAGAATCG 


T ATP A AT AT A 


A Afi A r*Tf5Tf3f! 


8220 


GCATGTTCAT 


GATAAGGAGC 


ATGTATCCGA GGATAAGAGT 


TCATATAGGT 


I. AA 1 


ft ? fin 


TCACCCCAAA 


AAGGATTTTG 


CTTAAACAAG GCCATCCCAG 


CATCCCAGAT 


nunnn 1 vjV-Vj x 




TCTTCCATAG 


AAGAGTCTAA 


AGTACCCATT CGAACTCCCA 


AATCACTAGA 


AAAuAItuAAA 




CTCAAACCAA 


TCGCGAAGAC 


CCCAATACTA AGCCAAAAGG 


CCTTCCAGTT 


111 AA 1 nu 1 J- 


Ail 


GTAAAGAGAT 


AGATAATTGC 


TCCAGCGATA ATAGCAGGAA 


AGGCAGTTCG 


r"V 1111 \a X\\3 i. n 


8520 


AAGTTCAAAC 


CAAAGAGATT 


AACAAAGCCT GCAATCACAC 


AGAATACTTT 


PAAPPAATTP 


8580 


AACTTGGTCG 


TTGTAAACAG 


ATAGAAAGCA ATCATAATAC 


AGAAACAACA 


ftninnl 1V.I.A 




TAATAATTAG GATTAAAGAA GGTCACTTCT GCCCGGTTCT GATGCCACAC 


V~ 1 uLA 1 A 1 lu 


o /UU 


GGTGAAAGAA AAGCATAGTT 


AAATTTCTTC ACAATTTGGA 


AATGTTCTAA 




8760 


GCAGCTGACA AGACACTACC 


AAACAAGACA AACTGCAAAA TCAATCGAAA 


IxAAl 1 XAlAiu 


goon 


GATAAAATCG 


ACTGATAGTG 


CAAAAAGAAA ATAGTAAATA 


GAAACATTCC 


1 At. iunnuLt 


ooon 
oooU 


ACAAGACCCA TCCAATTTTG 


TGCAAGAATG GATATAACAG 


TACTATAGCT 


MuAAAAAGA 




AGCAGCATCG 


GATGCTCCCC 


CATTTTCTGA AGAATACTTT 


TCATGTCTCC 


TGTaaAaATC 


9000 


AAACTGATAA TATATAAACA GAGTACAACT ACAAAAAGAT AAAAGGGTAA 




yuou 


AGGATAATTC 


CCAATAAAAT 


CAGCTCTTTA CTAGACAACC 


CCTTCAGCTT 


1 lUuVlAAAu 




CCTATTGATT 


TCAAAATGAA TCCTTTCTCT CCAAATCAGC 


TGATTCAGAT 


AA X AVa 1 MuL 




TATCCTATAT 


TGTACCACTT 


TTTTAGCAAT TTGAAAACAA 


AGGAAACGTT 


1 1LLAAAA1A 


G Oil rt 


AAAACCCTAT 


TTTATCCACC 


ATATCAAGGC TTCAAAATGA 


TACTTCAACT 




J J UU 


TTACCCGATA 


AGTCTGATTT 


TGCAAATCAA TTTCTACTAC 


TGCTGTTACG 


Vjn\> 1 1 A 1\. 1 1 


9360 


TATTTTGACG 


TTTGATTACA 


ATGCTGTGAG CTGTTGGTGT 


CTCTATCTCA 




9420 


CTAGATCAAA GGCTTCTGAA CGGTTACGGA AAGAAAATAG ATTGAGAAGG 


GCCTTCACAA 


9480 


CAGGTCGTTG 


CACTTCTTTT 


GCTATTTCCT CGTTGCTATA 


GTAATGACGA 


TTAATATTTC 


9540 


GACCTTCTTT 


AGTTTCTTCT 


AATAATTTCA AGTCATTCTT 


GCCTGCTAAT 


AGACCCACAT 


9600 


AGTAAATCTG 


AGGAATACCT 


GGGGCAAAAG CTTGAATTAG 


ACGAGCGAGA 


AAATACTTGA 


9660 


CATCATCATC 


TCCAAGCGCT 


GAATAGTAGG TTGAATTGAT 


TTGGTAGATA 


TCTAAGTTGT 


9720 


TATACTCGGC 


ACTAGAGTAC 


TTACGTTTGA CATTGGCTCC 


AACCTTATAG 


AGTTCATTTG 


9780 


AAGCATAGTC 


AATCTCCTCA 


TCGGTCAGGA TATCCTTGAC 


ATCTACTACT 


CCAATCCCAT 


9840 


CATGGGTATC 


TAGCGTCGTA 


AATTGCTTCA TCGGGCTCAT 


CTTTAACCAC 


TTAGCCAAAC 


9900 



WO 98/18931 



PCTYUS97/19588 



755 



TGTGTGTAAT 


AGATATTGTC 


CTTGTATTCT 


TCTGCAACCT 


TAGAATAGTT 


GGAGCTGGTA 


6420 


GGGTAAAGAC 


GGGCATAACT 


1 

ATAAGCTCCA 

i 


CTAGTGAAGT 


TACGCTCTAG 


CGACTCCTGA 


6480 


TCTGATCCAT 


CATAGTAAGC 


TAGATTGATA 

i 


GTATCTAGGT 


GGACATTTTC 


TTTATCCCAA 


6540 


TATTGCTCAT 


TTTTTACAAA 


CTCTACAGAA 


GATTTTGCAG 


TCAACCCTTT 


CAACAAGAAT 


6600 


GGACCATTAT 


AAAGCAAGGA 


TGTCGGATCT 


GTTGGTTTAG 


CAAAATCGCT 


TCCTTTTGAT 


6660 


GTTTCGAATT 


CTTCATTCAG 


AGGCCAGAAA 


ATAGAATAGG 


TCAACTTAGA 


GTTCCAGAAC 


6720 


GGTTCAGGCT 


GGTTCAAAGT 


GTATTGTAAC 


GTATAATCAT 


CAACCGCCTT 


GACACCAACT 


6780 


GTTGAAAAAT 


CTGTTGAAGT 


TCCTGATAGA 


TAATCTGCCA 


AGCCTTTAAC 


CGAATTTTCA 


6840 


GCTAAATACA 


TAGCTTCTGA 


TTTTTTATCT 


GCTGCGTGTT 


TTAAACCGTT 


CACGAAATCT 


6900 


TTAGCCGTCA 


CCTCTGCATA TTCTTCTCCA 


TCAGAGGTAA 


ACCATTTAAC 


CCCTTTACGA 


6960 


ATCTTATAAG 


TGTAGGTCAA 


ACCATCCTTA 


GAGACTTCCC 


AATCCTCTGC 


AACTGCAGGA 


7020 


GCAAGATTAC 


CGTAATTATC 


GTTAGTGAAT 


AAACCATCAA 


TCCCATTTGA 


AGTCACTACT 


7080 


GTTGTACTAT 


TTTTACTTGA 


AATCAGGTAG 


TCCAAGGTTT 


CTGGGTCTGC 


TGTATAAACA 


7140 


TAGCCATAAG 


CTTTAGGGGC 


TGATGAATCA GATGATTTTG 


AAGAACTGCA 


TGCTGCAAGT 


7200 


ACACCTGCTG 


CTAATAAAAC 


AAGACCTGCT 


GTAGCAAATA 


CACGATTTTT 


TTTCATTTTC 


7260 


TACTCCTCTG 


TTTATGTGAA 


TTATAGATTG 


ACAACCATTA 


TATCACATTA 


TCCATTAAAA 


7320 


ATCAAACAAA 


TTTTCAGAAT 


ATTTAGGCTT 


GTTGGCACAA ATTTTTCATT 


TTTTTTGAAT 


7380 


ATATGATTCA 


AATTGTCGTT 


CGAAGTGTCA 


AAGACTACAG 


TGAAAATAGG 


AAATTTGACG 


7440 


CAGAAACTTT 


GGAGTTTAGG 


AAGACATACA GTAAAATGAA ATACGGACGG 


AACAATGTGA 


7500 


TTTTGGAATT 


CAAATTAAAT 


TATAACAATA TTGTAGAAGT 


ATCATTCTAG 


TATTCAAGAT 


7560 


TCAGTTTACT 


ATGTCTTTTC 


ACACCAACCT 


TATCCCGAAT 


TCAATTACTT 


TTGTGATTTA 


7620 


CATATATAGA 


TTAAGACTAT 


CTTTTATACT 


TTAAAATTTC 


TCGCTACCTT 


ATCCACTATA 


7680 


TGCTCCTCGC 


TATCACGTTT 


CTATTCATAG 


CCTACGATTT 


CACTATTGCT 


TTCTCTGACA 


7740 


ATTCTTATTT 


CCTGCGTCAG 


ACTTAAAACG 


ATCTATCCCC 


AGACCATTTT 


AATCCGCTAC 


7800 


CTCACGATAG 


TCAGGCTTGG 


GGAGCGCTAT 


TGTATTCACC 


GGTAGTGGAG 


CCCTACAGAG 


7860 


GACTTACACC 


TCAGATGCAC 


GACATGCCCA 


TCGTATAAAA 


AATCTCCTAC 


CCAAGGTAGA 


7920 


AGATTTCAAA 


CTTATAAAAC 


TTAATCCGTC 


ATGTCCGATA 


CCAACATTCG 


ATGCTCCAAT 


7980 


GGAATACTGC 


ACATAACTAG 


CAAGAAAATA 


AAGCCTGACT 


GAATCCAGAA 


GAGAGCCAAG 


8040 


TCAAAAATTC 


CGTGCACAGC 


AACCACTGTA 


AGGAAAGATA 


GATAAAGGCC 


GATAATCGGA 


8100 
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GCTCTGTTCT 


GGAACTGTAA AGAGTATAAA 


GTGTCACCAT 


TGGAAGAGCA 


AAATCATAAA 


9960 


CATAGTAATC 


ATGGTCTGCT 


ATTTTAAACT 


GAATCGAATA 


GTGTTCATGA 


ATCTCAGGTA 


10020 


AAAGCTCTGT 


CCCATACTCA GCAGCGATAT 


CTCGAACTTT 


GTCCAATAAA 


TCCCAAATAT 


10080 


CTGGTTCCAC 


AAAGAAATCA 


TTAGTATCCA 


ATTTCTTCAC 


TGCATAAGCA 


AAGGCATCTA 


10140 


GACGAATCAA 


ATCACACCCA 


TTACTTGCCA 


AGTGCTGAAT 


GGTCTTACGG 


ATAAATTCCA 


10200 


TAGTTACTTC 


TTTGGTCACA 


TCAAGATCAA 


TCTGCTCCTC 


ACCAAAGGTA 


TTCCACAAAT 


10260 


GTTCCACTGA 


ACCATCTTCA 


AACACAATCT 


CTTGCTTTGG 


TGCACGATCC 


TTACGCTTGT 


10320 


AAATTAAATC 


TACATCAGAC 


TGTGTCGGAC 


GGTTTTCTGG 


CCAAAACTTA 


TCCCAGTTTA 


10380 


AAAAGAGAGC 


TTTAAATTCA 


CTGGCTTCAT 


GTTTTTCTTG 


ATAGTCCTTA 


TAATACTTGG 


10440 


ATTGACGAGA 


AATATGATTA 


ATCATAAAAT 


CAAACATAAG 


ATAATATTTC 


TCACCTAAAC 


10500 


GCTTCACATC 


CTCCCAATCA 


CCAAAAGCTG 


AGTCCACTTC 


GTCGTAGTCA 


ACTGGCGCAA 


10560 


ATCCACGATC 


AACTGTTGAT 


GGGAAAAATG 


GTAAAAGGTG 


AACTCCTCCA 


ATAGCATCTC 


10620 


CAAAATGCTC 


TTCCAAATTA 


TCATATAAGT 


CTTTAAGATT 


ATTTCCAAGG 


CTATCAGAAT 


10680 


AGGTAATCAA 


CATGGTTTTA 


TTTTGAATTG 


GCATCATTAC 


TCTCCTTTTT 


CTAATTGAAG 


10740 


CCAAGTCTCA 


TATGATCTGG 


CTTCATAAAT 


AAAATTCATT 


TTAAATCTCT 


ATTTATCATC 


10800 


AAACTCGTAC 


TAATATAGAC 


TGTGATAAAC 


AAAGTACTAC 


TTTCTTGTTT 


TCTGCATAGA 


10860 


ATTATCAACA 


AGCTAAACTC 


TTCCTCTGTG 


TCAAAGACTA 


TAGATTCCAT 


GAGCTCTTCT 


10920 


TATACTCTTC 


GAAAATCTCT 


TCAAACCACG 


TCAGCTTCAC 


CTTGCCGTAG 


GTATGGTTAC 


10980 


TGACTTCGTC 


AGTTTCATCC 


ACAACCTCAA 


AACAGTGTTT 


TGAGCAACCT 


GCGGCTAGCT 


11040 


TCCTAGTTTG 


CTCTTTGATT 


TTCATTGAGT 


ATTACTTCAC 


TGCCCCGTTG 


CTCATTCCTG 


11100 


AAATGATATG 


GCGTTGGAAG 


AAGAGATAGA 


CAATGGTGAT 


ACTGATAATG 


CCGACCACGT 


11160 


AAGAGGCAAA 


GCTTGGTCCG 


TAGTCGTTGA 


AATATTGGCC 


TGCGTAGTTG 


TATTGGAACA 


11220 


AAGGCAGAGT 


CCACATTTTG 


GAATCCCGGT 


TCAAGACAAG 


GAGTGGCAAC 


ATGAAGTCAT 


11280 


TCCAGAACCA 


AAGGGCATTG 


ATGATCATGG 


TTGTCGCATG 


CATCGGTTTC 


ATCATTGGGA 


11340 


AGATGATGCG 


GAAATAGGTT 


GTAAATTGAT 


TAGCCCCATC 


GATCTCTGCT 


GCTTCATCCA 


11400 


GACTTTCTGG 


AATCGAGATT 


TTGATATAGC 


CAACATAGAG 


AAAGAGGGTC 


TGTGGAATCG 


11460 


CATAGGTCAA 


GTAGAGCAAG 


ATCAAACCAA AGGTATTAGC 


CAAACCGAGT 


TTACTCATCA 


11520 


TAACCGTAAT 


CGGAATCATG 


ATGACTTGGA AAGGTACGAA GATTCCGAGG 


ATTAAGAGGG 


11580 


TATACATGAT 


GGTAAAGGCT 


TTTCTTTTAC 


TCATATTGCG 


AGCGATGGAG 


TAGGCTGCCA 


11640 
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TAGGGATAAA GATCATTACT GCAAGTAAAG ACAAGACAGT GATGACGACA GAGTTCCAAT 11700 

AATAGCCTCC AATCCCATCA GCTAAGAGAC GGCTAAAGTT GTCCCATGTG AAGTTGGTTG 11760 

GAAAGCCAAA GAAATTATCT ACAATATCCT TAGTGGGTTT GAAGGAACTA AAGAGGGTAG 11820 

CAAGGAGCGG CACTAAAATC AGAACCGATC CTAGAATCAA TAGAATGTAT TTGCCAATCA 11880 

GGGCTTTTCT TTCATCTTGT TTCATCATGC TTCTCCTCTT AAATTTCAAA TTTCTTAGAT 11940 

ACTCTCAATT GGATGATCGA AATCACTACA ATTAAGAAGA ACAAGATTAC GGCAATGGCA 12000 

TTGGCATAAC CGAATTGGTT GTTTTTAAAG GCATAGTTAT AAACCAAGAG CCCAAGTGAG 12060 

GTTGTGGCAT TGTTTGGACC ACCACCGGTC ATGGCAAAGA CTTGGTCAAA GGCAGTCAGC 12120 

CCACCTTTTA GGGCTAGGAT AAAGACCATA GAGACACTTG GTAGCAAGTA AGGCAATTCA 12180 

ATGTTCCAGA AAACTTGCTT GCTAGTCGCA CCATCAATCC TTGCTGCCTC TGTAATCTCA 12240 

GTTGGAATAG ATTGCAAACC AGCTAGGAAG ATGATGATGG GCATAGCCAC CCCTTGCCAA 12300 

AGAAGGACAA AGACAGCCGC AAAGATTGCT CCCCACTTAG TCCCTAAAAG ACTGGTTTGG 12360 

AAAAATTCAA TATGAAGGGC ATTTCCAATC GCTGGAAGAC CGTAGTTGAA GACTTGCTTG 12420 

AAGATCAAAG CCACTGTCAA ACCAGATAAA ACAGCTGGGA AGAAGAACCA AGCACGGAAG 12480 

AAGGTTTGGC CTTTGATTTT AGAATTCAAG ACACGCGCAA TGAAGATCCC GAGTGCAATC 12540 

TCACCAACCA CCATGGCAAT CGCAATGATT GCGGTAAAGC CAATCGCATT CATGAATTTT 12600 

GGATCCATGA AGAGGAGCTT AAAGTTGTTT AAGCCAACAA ATTTGTAGTT ATAAGTCAAT 12660 

CCTGTCCAGT TGGTAAAACT GTAAAAGGCT CCTTGAAACA TCGGCACATA GAAGAAAATT 12720 

GCTTGTAACA AGAGGGGGAT GACCACAAAA GCCCATGCCC AATATTTTTG TAATACTTTT 12780 

TTCATAGTCT CTCTACTCCT AATCCACATC CGCTTTCATC GGGTTAAAGA AGGCATTCAA 12840 

ATCATTGACC ATGCCTTGTT TATCACCGGT CAAGACATAG TTCATGGTCA AGGTATGGAA 12900 

GTCTGCTTCA CTGGTCCAGT ATTGTTGCAA CCAGACCAAG TGACGATCCG TAAAGGCATA 12960 

TTCGGTCATA CCAGCAAGCG GTGAATCTTC TCCTGCTTGT TTGACCCCTT CGATCGCTGT 13020 

TGGAGATCCG TCCACATCGT AGTATTTTTG CATGACTTCT GGACGGGTCA TATATTCCAC 13080 

AAAGGCATTG GCTTCTTTTG GATGTTTGGT GGTGGCTGAG ATAGACCATG CCAAGTCTCC 13140 

CGCACCAACG GTTAAGCTTT GTCCTTTTTC TTTTCCTGGA ATCATGAAGG TCCCAATCTT 13200 

AAAGTTCGGT TTTTGTTCAT TAATCGCTGT GATCGCCCAA GACCCATTTG GTGTCATGAG 13260 

GACATCCCCA CGTGCGAAGG CTCCGATAAC ATCGGTATAG CCAGCACCTT CCCAGTTCTT 13320 

TTGCTTAGAT CCATTGATGC GAAGGATGTC CATGACCTTG ATATCATCTT TCATAATCGG 13380 

ATCCGACAAT TTAATGGCAT TTGGTTGAGA ATAACGAAGG TATTGATTTG CTTCTTTTCC 13440 
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TCCACCTGTT GCTGTCGCAA AGGCTAATTG ATTGTAACCA TTGAGTGTCC AAGCATCTGC 13500 

ACCTGCAATT CCAAATGGTG TTTGTCCTTT AGCAACGATA TCTTTGACTA ACTGTTCAAA 13560 

TTCATCCCAG GTTTCAGGAA CCTTCAAGCC CAGTTCTTCG AATTTATCTT TGTTGTAGTA 13620 

AATTCCATAA GCATTAGCTG TAAAAGGAAC GTTGTAAACT TTTTCGTTTA CAGCATATTT 13680 

TTCAGCGTAG CCATTTTTCA CGCGTTTCAG GTAGTCTTTG TTGCTCAAAT CTTCAAAAAC 13740 

ACCTGCTTTT GCCCATTCTT GCAGTTCGAT GGACTGTGGG TAAATATTGA CCACATCAGG 13800 

CACATCTCCT GCGAGAACGC GTGTCTTCAA TACTTCACCA GCATTTGGTA CATTGACGAC 13860 

TTTGACCTTG ATCTTAGGGT TTTCCTTCTC AAAATCACGA GTGATTTCTT CCAAGGTTTT 13920 

GGTCATTTCT TTTTTCTGGT TGAAATACTC GATGGTCACT GTGCCATCCG CAGATTTACC 13980 

ATAGTTGGAG CAAGCGCCGA GCCCAAACAA AGCTAAACCT GTAGTTGCAA GAAGTCCGAT 14040 

TTTTTTATAC CATTCCATTA GAAAGCCTCC TTTATAAATT TATACACCCT TATTGAACTG 14100 

CACCCCAAAA GTTAGACAGA ATAAATCTAA CTTTTGGGGT CAGTACATAT CATAGTTTTC 14160 

TAAAAATATA CTGTCTACTC AAAAAATCTC CTTGGGATAA GATAACAGTT AAGCCCGCAT 14220 

ACATTAGTTC TGCACCTGAG TAAACTTCGC CATTTTCCTG TAATTTATAT AGTCCCTCTT 14280 

CATCCAAATC TTTTAATTTT AAAGTTGTTT CCATGGTCTC TACAACAGAT AAAACGCGAA 14340 

CGTAGGTTAC AATCGTTTGA TTTCCGTAAT TAAATTGTAC AGCTGCTTCA TTGGATACAG 14400 

TATCAGGATT AATTAGTCTA TACTGCTGTC CTAACTGAAC TACTGGTCGT AATTCTTTAT 14460 

ACAAGTTCAC CTGATTAGCA ATCGTAGCTT TCTCTTCATC TGATAAATTT GTCAAATCAA 14520 

GTTCATAGCC CAAATTTCCC ATCATTGCTA CAAGGCCACG TGTTTCTAAT GGTGTCATTC 14580 

GTCCCATCTG ATGATTCGGT ACTGCTGACA CA7GAGCCCC CATAGAAATG GTTGGATAGA 14640 

GATAGGATGA ACCGTATTGA ATTGGTAAAC GTGCAATGGC ATCAGTATTA TCACTAGCCC 14700 

AGACTTGTGG GAAATAGCGC ATCATACCAA GATCATTTCG TCCACCACCA CCAGAGCAGG 14760 

ACTCAAAGAG AATATGGCTG TGCTTCTCTG TCAGATAAGA AACGAGTTCA TAAAGCCCCA 14820 

GCATGTACTG ATGAGATTGC ATCTGTGTCT CTAGATAAGT TAATCCATTC CCTAGCTTAG 14880 

TGATATTGCG GTTCATATCC CATTTAATGT AATCAATATC ATGATAAAAT AGGAGTTGAT 14940 

CTAAGACACT TTTCAAGTAT TCTACTACCT GAGGATTGGC AAGATTAAGT ACTAATTGAT 15000 

TCCGAGAATA AGTATGCTCA TAGCCAGGAA CCTGAATAGC CCAGTCAGGA TGTTGACGAT 15060 

ACAAATCACT ATCTACAGAA ATCATTTCGG GTTCTAACCA AAGTCCAAAC TGCAAACCTC 15120 

TTTCATGGAT AGCTGAAATC AGACTTTCTA GACTTCCACC CAGTTTTTCC TCATTAACAA 15180 
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CCCAATCACC TAAAGCACGA TTATCATCAA AACGATTGCC AAACCAACCA TCATCTAATA 15240 

CAAAAAGTTC AATGCCAACT TTCTTAGCTT CATCTGCTAA CTCTAACAGT TTTTCTCTCT 15300 

GAAAGTCAAA GTAAGTAGCT TCCCAGTTAT TGATTAGAAT TGGACGTTCT TTTTTAGAAA 15360 

ATTCACTTAG CATAATGTGC TTCAGTACAA AATTCTGACT TTCATGACTA ATACCAGTTA 15420 

ATCCCTGATC TGAATGAGTC ACTAAAGCTA CCGGTGTTTC AAAGTATTCC TCAGGAGCTA 15480 

ACTTCCAAGA AAAGTTTTCT GGATTAATGC CAATAGCCAC CCGAACTTCA TTCAATTGAT 15540 

TTTTTTGAAC AAAAGCTTCA AAGTTGCCAC TATACATTAG TTGAATAGCA AACACATTCC 15600 

CAGCATCCTC TGTGACTCCT TGTTCGCATA GTAGAAGAGC TGGTGTTTGA GCATGACCAG 15660 

AAGCACCTCG GTTTGAACTA ATCGAAAAGA TTCCTTGTTC TACCTGTTGA CGTCTAACAG 15720 

TCTTTTCACG AGCATAAGCA CCCTGCAGAG TTACTATTTC GTAATCTGCA GCTGGAAAAT 15780 

CAGCCATAAA AGAAAAATCT TTATGGATGA CAACTTCCTG ATTACTATTA TTATCTAATT 15840 

TACTGTAGCT AGCAATAGTC GCATCATTAT TAAAAGTAGT ATAATACAAA GTCAGACTAA 15900 

GTTGAGCCTT AGAATCTTCT AACATTAAGA CAAGAGTCTC TGTATCGTCC ATGCTATGTG 15960 

GAGAAGGTAA GCCCTGTGGA CCATTCTGAC CTTTTAAAAT CTTTGCTTCT ACAAATCGAA 16020 

AGTCTGTTAC TTCAGTTACA CTATGCTGAA CCTGTATGGT TGGTTTCCTA AAATCTCCTA 16080 

AGCCATGTTG TCCAAAAATC TGTCGCTGAG TATCTAAACT AAAGGTTCGA TTAGTAGCCG 16140 

TTGGATTTCC TGAAAAGGCA TGGTCTCGTT CATAAACACT ATTGGAACCT TTATAGTTCT 16200 

TAATAGTCTT TCCTAAATGT TTCAAAAGTA AGTAGCCATT TCGATTTTCA ATAATCAAAC 16260 

TTAGATTTTT ACTCTCAACA TAAAATAGAT TATTCTCTAT CCTAACTCCC ATTTACTTCA 16320 

CCTCATCACT TTATTGATTA TATTTTATCA CCTGAAATCG CTTTCCAAAA TAGAAAAATG 16380 

TCTCAAGAAT ATGGTAAAAT GTTAGGTAGG AGGTAGCACA TGTTAGTTTT TTCAGAATAC 16440 

CAGACTGGAA CAATCGACCT TGCCCTAAGC TTTTATGGAT ATGAGGAATG CACACCTAAT 16500 

TACTCTTTTG GTCCAGCCAT TCGTGATACA TACGTTCTAC ATTACATTAC TAAAGGACAA 16560 

GGAAAATTTC ATTACAAGGG TAAAATTGTT GATTTAAAAG AAGGAGATTT CTTTCTATTA i6620 

AAACCAGAGG AACTAACCTT TTATCAAGCA GATAGTAAAG AACCTTGGGC CTACTACTGG 16680 

TTAGGAATCA CTGGAGGGAA AGCCCCTGAT TATTTTGCTC TTTCCCAAAT TTCTGATCAA 16740 

TCCTATCTCA TCCAATCTGA AACTTGTCAT ACCCAGACTA CTGCAAAACT CATCTCAGAC 16800 

ATTGTCCGCT TCGCTCAGAT TACAAAATCA AGTGAATTAG CTCAACTCCA TATCATGGGA 16860 

CAACTTCATG AACTGATGTT TCATCTGGGA ACTATTGCTC CCAATCAGAA AAAAAAGAAT 16920 

ATTTCATCAA CCCACCAACT CTATCTTGAA TGCAAACGAT TAATTGATAG CCACTATCCT 16980 
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CAATCACTTA CAATTCAAGA TTTAGCAAAA GAACTATCCG TTCACAGAAG CTACTTATCA 17040 

AGCGTATTCA AAGAATTTAA TACCTTATCA CCCAAAGAAT ACCTACTCTA CGTTCGAATG 17100 

CACCGAGCTA GACAACTTCT CGAAAATACC CAAGAGTCCA TCAAGGTAAT TGCATACTCG 17160 

GTAGGTTTTT CAGATCCACT CCATTTTTCG AAAGCTTATA AACAATACTT TAATCAGACT 17220 

CCAAGTCATA CAAGAAAAGA ATACTCTCAA TACCAACTAG TAAGAAAGGC AACATTATGA 17280 

.AATCCTACCA AGCTGTCTAC CAAATCCTAT CTAAAGAAAC CGACTATATC AGCGGAGAAA 17340 

AAATCGCAGA AAAACTATCC CTAAGCCGAA CAGCAATTTG GAAAGCCATC AAGCGACTAG 17400 

AACAAGAAGG CATTGAAATT GATAGTATCA AAAATAGAGG ATATAAACTG ATGAATGGTG 17460 

ACCTTATTCT TCCAGAGATT CTAGAAGAAA ATCTTCCAAT TAAAGTCAGC TTTAAACCCG 17520 

AAACAAAATC AACACAACTA GATGCAAAAG AAGCAATTGA TTTAGGCCAT GAAGCAAATA 17580 

CCCTCTATCT AGCTTCCTAT CAAACAGCAG GCCGAGGCCG TTTTCAACGT TCCTTCTACT 17 640 

CACCACAAGG TGGTATTTAT ATGACACTCC ATCTTAAACC AAATCTCCCC TATGACAAAT 17700 

TACCATCCTA CACACTACTT GTAGCTGGAG CTGTCTACAA AGCCATTAAG AACCTAACTT 17760 

TAATAGATGT CGACATAAAA TGGGTCAATG ATATCTATCT AAACAATCAT AAAATTGGAG 17820 

GAATCCTTAC TGAAGCAATG ACCTCTGTAG AAACTGGCTT AGTCACAGAT ATCATTATTG 17880 

GAGTAGGTAT CAATTTCACT ATTAAAGACT TCCCTCAGGA ATTAAAAGAA AAAGCTGCCA 17940 

GCTTATTTAA AGCTACAGCT CCTATAACAA GGAATGAATT GATCATAGAA ATCTGGCGTG 18000 

CTTTCTTCGA AACACCAGCA GAAGAGCTAT TATACCTATA CAAAAAACAG TCATTCATTC 18060 

TAGGAAAAGA AGTCACTTTC ACACTAGAGC AAAAAGACTA CAAGGGACTT GCTAAAGACA 18120 

TCTCAGAAAA TGGAAAACTT TTAGTTCAAT GTGATAACGG AAAAGAAATC TGGCTAAATA 18180 

GTGGCGAAAT TTCTCTCAAT AGTTGGAAGT AAAATAACAC AATTATAATA TAAACGATAT 18240 

AAAAATAACT TCAGATTAGT AATTCAATTA AGTTTTACGG ATCTGAAGTT TTATTGGCTC 18300 

TAAAAATAAA AAAGAGAGTT ACAGACTCTC ATTAAAACGG AGAATAAGGG ATTCGAACCC 18360 

TTGCGCCAGT TACCCGACCT AACGATTTAG CAAACCGTCC TCTTCAGCCT CTTGAGTAAT 18420 

TCTCCAATTA ATGGGCACGA GTGGACTCGA ACCACCGACC TCACGCTTAT CAGGCGTGCG 18480 

CTCTAACCAC CTGAGCTACG CGCCCAAGTT AAAAAACTTG GTAATTTGAA CAAAGTTCAA 18540 

AGCGGGTGAC GAGAATCGAA CTCGCGACAA CAGCTTGGAA GGCTGTAGTT TTACCACTAA 18600 

ACTACACCCG CATAAATACT ATCAATAAAA TGGCGCGAGA CGGAATCGAA CCGCCGACAC 18660 

ATGGAGCTTC AATCCATTGC TCTACCAACT GAGCTACCGA GCCTTATTGC GGGAGCAGGA 18720 
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TTTGAACCTA CGACCTTCGG GTTATGAGCC CGACGAGCTA CCGAGCTGCT CCATCCCGCG 18780 

TTAATAATAT AAAAGGAGGA TGTGGGATTC GAACCCACGC ACGCTTTTAC ACGCCTGACG 18840 

GTTTTCAAGA CCGTTCCCTT CAGCCGGACT TGGGTAATCC TCCAATATTC AAATGGACCT 18900 

TGTAGGACTT GAACCTACGA CCACTCGGTT ATGAGCCGAG AGCTCTAACC AGCTGAGCTA 18960 

AAGGTCCGAC AAGATCATTA TAGCGGCGAA GGGGATCGAA CCCCCGACCT CCCGGGTATG 19020 

AACCGGACGC TCTAGCCAGC TGAGCTACAC CGCCATGAAT CGGGAAGACA GGATTCGAAC 19080 

CTGCGACACC TTGGTCCCAA ACCAAGTACT CTACCAAGCT GAGCTACTTC CCGAGTTAAA 19140 

TAGAAAAATG CACCCTAGAG GAGTCGAACC TCTAACCGCC TGATTCGTAG TCAGGTACTC 19200 

TATCCAGTTG AGCTAAGGGT GCTCCATATT ATGCCGAGGA CCGGAATCGA ACCGGTACGA 19260 

TCGTTACCAA TCGCAGGATT TTAAGTCCTG TGCGTCTGCC AGTTCCGCCA CCCCGGCCTC 19320 

TCTAAGCGAA CGACGGGATT CGAACCCGCG ACCCCCACCT TGGCAAGGTG GTGTTCTACC 19380 

ACTGAACTAC GTTCGCACTG TTTTCTTCTA TCTAAAAATG CCGGCTACAT GACTTGAACA 19440 

CGCGACCCTC TGATTACAAA TCAGATGCTC TACCAACTGA GCTAAGCCGG CTCATTTGTT 19500 

ATATCTTAAT GCGGGTTAAG GGACTTGAAC CCCCACGCCG TTAAGCGCCA GATCCTAAAT 19560 

CTGGTGCGTC TGCCAATTCC GCCAAACCCG CATATATGAC CCGTACTGGG CTCGAACCAG 19620 

TGACCCATTG ATTAAAAGTC AATTGCTCTA CCAACTGAGC TAACGAGTCT AAAATAACTT 19680 

GCGTTACCTT AAACGGTCCG ACGGAATCGA CCCGGTAC 19718 
(2) INFORMATION FOR SEQ ID NO: 100: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 4117 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 100: 

CCGTGGAAAA GTCTGGATAG TGAATGGTCT TCACACAATG ACCTGAAAGA AGCCTGAGAA 60 

TAATTATGGA GAGTAGCATT CTGAGAGGTG TTAGCAGAAC CATATGACAG AGCTGTTTGA 120 

AGAGGGAATA TTGAGGAGAA AAATCCTGAG CCTACCAGTT GGAGTTGGAA AGAGCTGACT 180 

GTTAGATCAT GGTTTATTAT CCACAACCTG TGGATAACTT TGTGAATAAG AGAAGTTGCT 240 

AAAGAAGGAG ATATATAACG ATGAAGAAAA TCAAACCGCA TGGACCGTTA CCAAGTCAGA 300 

CTCAGCTAGC TTATCTGGGA GATGAACTAG CAGCTTTTAT CCACTTCGGT CCTAATACCT 360 

TTTATGACCA AGAATGGGGG ACTGGACAGG AGGATCCTGA GCGCTTTAAC CCGAGTCAGT 420 
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TGGATGCGCG 
TGGTCAAGCA 
AGGTCAGTCC 
CAGAGTTTGA 
ATCATGTGGA 
- TATCAAATCC 
GAGGAGAGGG 
ACCTGCAGGG 
ATGAACGAGG 
CAGAAGCAGA 
GAGAGGCAGA 
CTCTCGAGGA 
TTAATATTCC 
AATTTGCGAC 
CTGGTCCAGC 
GCTCTTGGGC 
AAACTTTTGA 
TTCATGTGCA 
GTTACAAACG 
CAGAATCACA 
CAAAAAAAGA 
GAGAAAATGC 
TTTCGATTCA 
TTGCGTTTCA 
GAGATAAAAC 
TTCAAGTCCA 
TAGTGACTTG 
TGTTGAATAG 
TTGTGGTTTT 



TGAGTGGGTT CGTGTGCTCA 
CCACGATGGC TTTGTCCTTT 
TTGGAGGAGA GGAAAGGGCG 
TATGGATATG GGGGTCTACC 
CCGAGAAGCG GACTACAATG 
TAACTATGGG AATGCTGGTA 
CGCGCAAAAG GTTAATTATG 
CGATTGCTTG ATTTTTTCAA 
GTATGCAGGT GATCCACTGT 
GCTGAACTAT CTTCAGCACG 
TGTTTCCATC CGTCCAGGCT 
GTTGGTCGAA ATCTACTTTC 
GCCGAATCAA GCTGGGCTCT 
CTATCGCAAT GAGCTCTATA 
TCTTTCCGCA GACTTTGCTT 
AAGCGATGCA GACTTGCCCA 
TGTAATTGAG TTAAGAGAAG 
AGTAGAGGTG GATGGTGTCT 
TCTCTTACGA GGAGCAGTTG 
GGCTTTGCCT TTGTTGACCA 
AGTTGTTCAG GAACTAGCAT 
CTATTTTACA GTTAAGCGCA 
ACCGGGGACA GGTGTCCATG 
AACTGGTGAG ACTGAAAAAA 
CTTGGATTTC TATCTGAACC 
AGTTTCATAA AAGAAGAACC 
GTAACCAGCT GAGGGTGAAA 
TTGATACGAG TGTTTTGTCC 
GTTTAGTATG GATATCCAGC 



763 
AGGAAACGGG 
ATCCGACAGC 
ACTTGCTCCT 
TGTCACCGTG 
CCTATTATCT 
AGTTCGCTGA 
AATTTGAAAA 
CAGAAGGCAC 
GGCAAAAGGT 
GGGATCCCTC 
GGTTCTACCA 
ACTCAGTAGG 
TTGATGCAAA 
AAGAAGATTT 
GTCGCCATTT 
TCCAGTTAGA 
ATTTGAAGCT 
GGCAGGAGTT 
TTGAGGCACA 
AGATTTCCCT 
TTGCAGAAAA 
GAGAATGTAG 
GTGTCGCCTA 
GTCTGACGCT 
TAACGGTGGA 
TTTGCGCGAT 
GTTAGTTGTT 
AGTCGGCATT 
CATTTATCTT 



CTTCAAAAAG 
TCACACAGAT 
TGAAGTATCC 
GGATGCCCAT 
GGCTCAGTTG 
GGTTTGGATG 
ATGGTTTGAA 
CAGTATCCGC 
GAATCCTGAT 
GGGCACGATT 
TGAGGATCAG 
GCGAGGAACT 
GGATATTGAA 
GGCTCTGGGA 
GACAGACGGC 
ACTCGACTTA 
AGGGCAACGA 
TGGTTCGGGT 
GAAGATACGT 
TTATAAAACT 
AAGCCTAGCT 
TGGTCCTTTA 
TCAGGATGAG 
ACCAACCTTG 
TGGTCAGCTT 
GCAAAGGTTC 
CAGCTTTTAA 
CTTTGACAAA 
CTTTAGCGAG 



TTGATTTTGG 
TATTCGGTTA 
CAAGCTGCCA 
AGTCCCCTCT 
AAGGAAATCT 
GATGGTGCCA 
ACCATTCGTG 
TGGATTGGCA 
AAACTAGGAA 
TTTTCAATCG 
GATCCTAAGT 
CCACTCTTGC 
CGACTTTATG 
GCTGAGGTAT 
CTTGAGACCA 
GGTTCTCCTA 
ATCGCTGCTT 
CATACTGTTG 
GTAGTCATTA 
CCTGGATTAT 
GTGGCAAAGG 
GAAGCTAAGA 
ATTCAAGTCC 
TATTTCGCAG 
GTGGATCAAC 
TTTTGGTTAT 
GAGGTCTTGG 
GTTAAAATGG 
GTAGACTCGT 



480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
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AGATGGTCAA 
AATCCGAGAG 
TAGGCTTCCC 
TAGTCAGGGT 
ATGGCTATTT 
TAGCCTGTTG 
GTTTCTTTTC 
GCTTGAATGG 
TGAAAGTTTC 
CGGTAATTTT 
GCACAGCTGG 
TGGTGGTCGG 
ATGCCGAGAG 
AGGTCGGGGG 
GGCATCATAC 
CTATTGAGGA 
GGGAAGGTGG 
ACAGTACCAG 
GTCTGTGGGA 
CCTGTCTCTA 
TTACAGCTAT 
GAAGTGGCTA 
TCTTGGCTGA 
TTTTTGTTAG 
TCTACTGTCA 
TTCAAACGAA 
AGAGAAAAGA 
TGGCGGTGAA 
GAGTAGTCCC 
AGATGTGGAT 



AGAGAGGGAT 
CTGACCAGAT 
AACTTGGGTG 
AATCGCTGTA 
GTCCAAAAGG 
TTTCAAAGAG 
CACCCATCAG 
CAGAGCATTC 
CTTGATTGTC 
GTGAAGCAGC 
CGATACAAAA 
TAGAGAGGTA 
GGTCGGCTTT 
TCATGTCCTT 
CCCGTTCATC 
AACCTTCTAA 
TGCGGAAGGT 
TAGCCAGATC 
AGAGGAAGAG 
TAATGTCAAA 
CAAAATCTTC 
GTTGCATCTC 
TAGCAAGAAT 
TTTCAGTTTT 
GTTCATCTGC 
TAGAAGCACC 
GCTTCAAATA 
AGAGGCTGTC 
CAATCCAAGG 
CAAAAAACCA 



TCCGAGGTCA 
GTACCAAGCA 
AAAAGCTTTC 
ACGGAAGAGA 
AGCAGTAGCC 
GGGAGCATCT 
TTGGATTAAG 
AGCGTAGTCT 
TCGTGCTCGC 
CTTGTAGGTT 
GTCACTATAG 
ACCTAGTTCT 
GCTGGCTGTT 
GCAGGCGCTA 
TGGAGCCAGC 
AAAGCGTTGA 
ATCCCAGAAA 
CATGTGGATG 
TCTGTAGAGG 
ACGATGGAGG 
TTGAGGTAGA 
GGTTTGACTA 
ATCCGTGTTC 
ACCTTCTTGT 
TGCGTGAAGA 
ATAGCAAGTC 
GTGAGGCTGG 
TCCCCCCAGT 
ACTGGGCTGG 
AGATCCATCC 



764 
TAGCTTGGTT 



GAGAGACTAC 
TGACGGAGCG 
TAAGGAATGT 
ATCTCGCTCA 
TGACAGGCTT 
CCAGGGATGT 
CGCCCCCAAC 
ATGTAACCTG 
TCAGCGATTT 
GCATAGTCTA 
TGGTATTGGG 
TCGAGCATGG 
TCTGCGATAA 
CATTTTGGAA 
TAGTGCTCCG 
CCATTGTTGC 
GCTTGCCCTG 
CAGTGGTCAA 
AGATTTTCCC 
TTGATTAGAG 
CTTGCTAAGT 
ATTTGCAGGG 
CGCAGGGCAA 
TAGAGGGAGA 
GGTGTGAGCT 
AAGCAAGCTT 
TGACTGGTGA 
TGAGTTAATC 
TGGTCACTGG 



TTCCTGGACA 
CATTGTCTTC 
TCTTGATAAG 
GGAAACTAGG 
TTTCGTGAAT 
TCAAAAGATA 
CGTGGAGAAC 
TATAAGGAGA 
TCTCAGCGTC 
CTATGTTCTC 
GAGTATGGCT 
CTAGTCCGTG 
CTTGGAAGAG 
TACCGTCTAA 
GGAAACCAGT 
GTATGATAAG 
TAAAGAGGAC 
ATTCATTAAT 
AGAAGGTTCG 
AATCCACTTG 
CTTGAGAAGG 
CAATTCGCCA 
CAGTGAACAT 
GAGTCCGCTT 
GGGCTTTGCC 
GGGTTTCAAT 
TATCTATATC 
CAGGTGTCAG 
GAATCCCCTG 
TCTGGGGCAC 



GGTTGGATAA 
ATCTCCAGGA 
AAGGGCAGTG 
CTGGTTGGAA 
TTCGTAACCA 
GTTGCTAAAG 
GCCTAAAGTA 
GAAGTCAGGG 
AAATAGCTGG 
TAGTTTTTTG 
AACACTTTCG 
GCGGCCATTG 
TTCTCCTTCT 
AAGTGTACCT 
ATCGCGGTAG 
GGCAAAGAGG 
ACCAGGCTTG 
CTCATAAAAA 
GTCAGCCTCT 
GGCACTTGAT 
AGAGATGAAA 
GTCTCCAGCT 
CGTTAGCGAA 
ATCTACTTGC 
TTGCTTTTGA 
CTGATAACGC 
ATAAGAAGAC 
AAGGAGCCAA 
AAAGATAGGC 
AAAGTAATTC 



2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2880 

2940 

3000 

3060 

3120 

3180 

3240 

3300 

3360 

3420 

3480 

3540 

3600 

3660 

3720 

3780 

3840 

3900 

3960 
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ATCCCAAAAG GCACGCCTGT GTATGGCAGG GTATTTCCCC GAGAAAAGGC ATGCTTGTTG 4020 

GTAGTTCCAA AACGGGTATC GATGGTATCA AGTAGTGGTT TCATAGTCTT TCCTTTAGCT 4080 

GTTTTTCTAC ATTATATCAG TAATAGAGGG CCTTTAG 4117 
(2) INFORMATION FOR SEQ ID NO: 101: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2727 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 101: 

CTGGTTCAAT TATTATTCAC TCTAAGTAGT CATATGTTCT TTATTTATGT GAGTTTTTAC 60 

CTTTTAAAGG ATCTTGTTAG ATGGGAGAAG GTTTTAAAAG TGACAGATGA TAATACAAGA 120 

AAAGTTCGTT TATTAGTAGC CTTTTTTAGC ATTGTCATAG GCTACATCCT GAGTTCTTTC 180 

TTTATTAGCC TGTATCATTT GTGGC AAGAA GCGCTTAGAG GATTATTATG AAATCAAGAG 240 

TAAAGGAAAC GAGTATGGAT AAAATTGTGG TTCAAGGTGG CGATAATCGT CTGGTAGGAA 300 

GCGTGACGAT CGAGGGAGCA AAAAATGCAG TCTTACCCTT GTTGGCAGCG ACTATTCTAG 360 

CAAGTGAAGG AAAGACCGTC TTGCAGAATG TTCCGATTTT GTCGGATGTC TTTATTATGA 420 

ATCAGGTAGT TGGTGGTTTG AATGCCAAGG TTGACTTTGA TGAGGAAGCT CATCTTGTCA 4 80 

AGGTGGATGC TACTGGCGAC ATCACTGAGG AAGCCCCTTA CAAGTATGTC AGCAAGATGC 540 

GCGCCTCCAT CGTTGTATTA GGGCCAATCC TTGCCCGTGT GGGTCATGCC AAGGTATCCA 600 

TGCCAGGTGG TTGTACGATT GGTAGCCGTC CTATTGATCT TCATTTGAAA GGTCTGGAAG 660 

CTATGGGGGT TAAGATTAGT CAGACAGCTG GTTACATCGA AGCCAAGGCA GAACGCTTGC 720 

ATGGTGCTCA TATCTATATG GACTTTCCAA GTGTTGGTGC AACGCAGAAC TTGATGATGG 780 

CAGCGACTCT GGCTGATGGG GTGACAGTGA TTGAGAATGC TGCGCGTGAG CCTGAGATTG 840 

TTGACTTAGC CATTCTCCTT AATGAAATGG GAGCCAAGGT CAAAGGTGCT GGTACAGAGA 900 

CTATAACCAT TACTGGTGTT GAGAAACTTC ATGGTACGAC TCACAATGTA GTCCAAGACC 960 

GTATCGAAGC AGGAACCTTT ATGGTAGCTG CTGCCATGAC TGGTGGTGAT GTCTTGATTC 1020 

GAGACGCTGT CTGGGAGCAC AACCGTCCCT TGATTGCCAA GTTACTTGAA ATGGGTGTTG 1080 

AAGTAATTGA AGAAGACGAA GGAATTCGTG TTCGTTCTCA ACTAGAAAAT CTAAAAGCTG 1140 

TTCATGTGAA AACCTTGCCC CACCCAGGAT TTCCAACAGA TATGCAGGCT CAATTTACAG 1200 
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CCTTGATGAC 


AGTTGCAAAA 


GGCGAATCAA 


CCATGGTGGA 


GACAGTTTTC 


GAAAATCGTT 


1260 


TCCAACACCT 


AGAAGAGATG 


CGCCGCATGG 


GCTTGCATTC 


TGAGATTATC 


CGTGATACAG 


1320 


CTCGTATTGT 


TGGTGGACAG 


CCTTTGCAGG 


GAGCAGAAGT 


TCTTTCAACT 


GACCTTCGTG 


1380 


CCAGTGCGGC 


CTTGATTTTG 


ACAGGTTTGG 


TAGCACAGGG 


AGAAACTGTG 


GTCGGTAAAT 


1440 


TGGTTCACTT 


GGATAGAGGT 


TACTACGGTT 


TCCATGAGAA 


GTTGGCGCAG 


CTAGGTGCTA 


1500 


AGATTCAGCG 


GATTGAGGCA 


AGTGATGAAG 


ATGAATAAGA 


AATCAAGCTA 


CGTAGTCAAG 


1560 


CGTTTACTTT 


TAGTCATCAT 


AGTACTGATT 


TTAGGTACTC 


TGGCTCTAGG 


AATCGGTTTA 


1620 


ATGGTAGGTT 


ATGGAATCTT 


GGGCAAGGGT 


CAAGATCCAT 


GGGCTATCCT 


GTCTCCAGCA 


1680 


AAATGGCAGG 


AATTGATTCA 


TAAATTTACA 


GGAAATTAGG 


CTGGAGAACC 


AGCCTTTTTC 


1740 


TAAAGATAAG 


GAGAAATATG 


AACAAAAAAA 


CAAGACAGAC 


ACTAATCGGA 


CTGCTAGTGT 


1800 


TATTGCTTTT 


GTCTACAGGG 


AGCTATTATA 


TCAAGCAGAT 


GCCGTCGGCA 


CCTAATAGTC 


1860 


CCAAAACCAA 


TCTTAGTCAG 


AAAAAACAAG 


CGTCTGAAGC 


TCCTAGTCAA 


GCATTGGCAG 


1920 


AGAGTGTCTT 


AACAGACGCA 


GTCAAGAGTC 


AAATAAAGGG 


GAGTCTGGAG 


TGGAATGGCT 


1980 


CAGGTGCTTT 


TATCGTCAAT 


GGTAATAAAA 


CAAATCTAGA 


TGCCAAGGTT 


TCAAGTAAGC 


2040 


CCTACGCTGA 


CAATAAAACA 


AAGACAGTGG 


GCAAGGAAAC 


TGTTCCAACC 


GTAGCTAATG 


2100 


CCCTCTTGTC 


TAAGGCCACT 


CGTCAGTACA 


AGAATCGTAA 


AGAAACTGGG 


AATGGTTCAA 


2160 


CTTCTTGGAC 


TCCTCCAGGT 


TGGCATCAGG 


TCAAGAATCT 


AAAGGGCTCT 


TATACCCATG 


2220 


CAGTCGATAG 


AGGTCATTTG 


TTAGGCTATG 


CCTTAATCGG 


TGGTTTGGAT 


GGTTTTGATG 


2280 


CCTCAACAAG 


CAATCCTAAA 








pAfir;r , APAAr; 


Z J 4 U 


CCGAGTATTC 


GACTGGTCAA 


AACTACTATG 


AAAGCAAGGT 


GCGTAAAGCC 


TTGGACCAAA 


2400 


ACAAGCGTGT 


CCGTTACCGT 


GTAACCCTTT 


ACTACGCTTC 


AAACGAGGAT 


TTAGTTCCCT 


2460 


CAGCTTCACA 


GATTGAAGCC 


AAGTCTTCGG 


ATGGAGAATT 


GGAATTCAAT 


GTTCTAGTTC 


2520 


CCAATGTTCA 


AAAGGGACTT 


CAACTGGATT 


ACCGAACTGG 


AGAAGTAACT 


GTAACTCAGT 


2580 


AAAAGATACG 


CCTACACTCC 


TATGTCACTT 


ATGGATGTAG 


GAGTTCTTTT 


TACTAGTTTA 


2640 


AG CAGG ACT A 


AGACAGGTAC 


TAAGACAAAA 


TAGCAACTTC 


TAAAACTAAC 


TTCCAGTTTT 


2700 


GGGAGAGAGA 


TGGAAGTTAC 


TTTGAGA 








2727 



(2) INFORMATION FOR SEQ ID NO: 102: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5717 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 102: 

TTTTTTGTAG ATTTAAGTGG GGTGCAATTC CTAAAAAATA AAAAACAATT TTTGAAAATT 60 

ATGTTAGCAG GAATTGCTTC AAATTCGATT TTATCACTTA CAGGTTTACT TGTTTTATTG 120 

TTCACATCGT ATAAATTGCT TGGACTCTTA TTTTTTATCA TTAACTTAGG TATGATTTTT 180 

ATTAATTCAA TTCCTTTTTT TCAGTATGAT AGTGGTATTA TTTTAAGATA CTTGAATTCT 240 

AACAATAATA ACTTGAATTT TCAATATATA GTTCAACTTT TAATAGCATT TGTTATTATT 300 

TATTTTCCTT TGAGTCAACT ATTACAGTTT TTGACACCCA ATATTATTGT TCGTAGTATA 360 

GGAGGGGTGG TTGTTTCTAT ACTGCTTTCT AT ATT AT AT A TGATAGGAAG GACGAAATAT 420 

GTTCTACGTA AATAGTTATG TTTTTGCTTA TAAAAAAGAA GGTATAATGT ATTTACGTGG 480 

TCGGAGTATG CGGGAAATAG CTATAGAACC TCAAATTTCG CAAGAATTTA TCAACGATCT 540 

ATTTAATAGT TGTAAGGAAC TATTAGAGAT AGAAGAAGTA TTAGGCAGTA AACTAACATT 600 

TGAACTATAA ATGAACAAAT TTTAATTTCG GATGAGATAG ATATTGATAG TAGATATTCT 660 

AGAACTAAAG GTTACTATTC GTTATTTTAT AATGAAGAGT ATAATAAAAT ACAGAATAAA 720 

ACAGTATTAG TATTAGGAGC AGGAGTCTTA GGATGTTATA TATCTCTAAG TCTAAGTATG 780 

TATGGAGTGA GG AAACTTAT TGTCGCTGAT TACGATATAA TAGAACCATC AAATTTAAAT 840 

AGGCAAATTC TTTATACAGA GTCGGATGTT GGTAAGGAGA AGATTAATGT TCTTTCTGAA 900 

AAAATACACA AGTATAATTC AGATGTTCAG GTAGTACCTA TTTCTATTAA AGTTTCTTCA 960 

GTAGAAGAAT TAGAAAAAAT TGTTGCGGAA TATGGGAGTA TAGATTTTAT CGTTAAAGCA 1020 

ATTGATACGC CCATTGATAT TATAAAAATT GTCAATCAAT TTGCTGTATC GCATAAGATA 1080 

TCCTACATAT CAGGAGGGTT TAATGGATGC TATCTTATTA TTGATAATAT ATATATCCCT 1140 

ACCATCGGTT CTTGCTTTGG TTGTCGGAAT ATAAACAAAG ATATAAATAA GTACACTTTA 1200 

TCTGATAAGA CAAAGTGGCC GACTACACCA GAGATGCCTG CTATTTTGGG AGGGATAATG 1260 

ACTAATTTAA TAATTAAAAT ATTTCTGGGA TGTTATAATG AAATCCTAAT AGATAACGCT 1320 

TACGTTTATA ATATGAGAAA TCATGCTCTA AGTCAAGAAA AATATGTTCT GGAAAACGGA 1380 

GAATGTCCAA TTTGTAAAAA AATAATAAAG TGAAAGATAA CAATATTAGA GCGAAAACAT 1440 

TTATTCGTTC AGTTTGTTTT TGCTTATTAT CAGGAGGAGT AGCTTTTTTA TCTGCTATTG 1500 

GGCAGTTCAC TGTTATAGAA ACACAATTAA TAGTATTGTT CTTGGGTATT ATTTTTGCTA 1560 

TATATTATGC TTACTACAAT AAAAATATTC AAACATCATT GGAAAATATA GTATGGCTTT 1620 
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TTTCATCGTT 


TGAGATTTTA 


TTTTTGCTTG 


TTAATTTTAG 


AACATTTATT 


CAGTTACCAG 


1680 


TGGATATTTT 


TATTGGTATG 


ATAATATTTT 


TAATGCTGTG 


GATATTTATT 


ATGTTAGGTA 


1740 


T, ACjTGTGTCT 


TAGTTATTAT 


ATAACTTTAT 


TATTTAGCAA 


GGAGGCTTAG 


TATGTTTAAA 


1800 


AAAATAGGTA 


TAATGAGCAT 


TTGCATATAT 


ATAATTATTT 


TATACTGCTT 


GAGAATGTAT 


1860 


CGTATTATCA 


ATAATATTGA 


AACAATCTTG 


CTAACGGTTA 


TATGCTTAAT 


GTTATTGTTT 


1920 


TTTTTAAGAC 


GTTTATTTGA 


TAAAGATAAG 


TAAATAGATG 


TTAAGTAAAA 


ATGTAGAATA 


1980 


TAAAGGAGGT 


GCAATGAGTA 


TGATTGAAGT 


TAGCCATTTA 


TCAAAAAGTT 


TTGGTGATAA 


2040 


AATAGCTTTA 


AATAATATAA 


GCTTCACTGT 


TAAAGAAGGT 


TAGATTTTTG 


GATTTTTAGA 


2100 


ACCATCTGGT 


TCTGGAAAGA 


CCACAACGAT 


TAATATTCTG 


ACTGGGCAGT 


TCCTTGCCGA 


2160 


TAAAGGACAA 


TCTATTATTT 


TGGGACAAAA 


ATCTCAAAAT 


TTAACAAGCG 


GTGAATTAAA 


2220 


GAGAATTGGA 


TTGGTTAGCG 


ATACAAGTGG 


ATTTTATGAG 


AAAATGTCTC 


TGTATAACAA 


2280 


TCTTCTTTTT 


TATAGTAAAT 


TTTATAATAT 


TAGTAAATCA 


CGTGTTGATA 


ATTTGTTAAA 


2340 


GCGAGTAGGA 


TTATATGATA 


GTCGCAAGAT 


GGTAGCAGGA 


AAATTATCCA 


CTGGAATGAG 


2400 


GCAACGAATG 


CTTTTAGCAC 


GAGCTCTTAT 


CAACAACCCC 


GCTGTACTCT 


TTCTGGATGA 


2460 


ACCGACCTCA 


GGTCTAGATC 


CCACAACTTC 


TCGAACAATT 


CATGAGTTAA 


TTTTAGAATT 


2520 


GAAAACAGCA GGGACAACGA 


TTTTTCTAAC 


GACTCATGAT 


ATGAATGAAG 


CAACTCTTTT 


2580 


ATGTGATTAT GTTGCCTTAT 


TAAATAAAGG 


GAAATTAGTT 


GAGCAAGGAG 


CTCCTTCTGA 


2640 


ACTCATTCAA 


AGATATAATA 


AAGATAAAAA 


GATTAAGGTT 


ACAGATTATA 


ATGGGAATCA 


2700 


GATAACTTTT 


GATTTTACAT 


CACTAGAACA 


GGTATCTCAG 


ACTGATCTGG 


AAAATATTTT 


2760 


TTCAATTCAT 


TCATGTGAGC 


CTACTTTAGA 


AGATATTTTT 


ATCACATTAA 


CAGGAGGAAA 


2820 


GCTAAATGCT 


TAAACGGTTT 


CTGGCTTTGG 


TATGGTTGCG 


TTGTCAAATC 


ATCCTTTCCA 


2880 


ATAAGAGTAT 


TTTATTGCAA 


GTTTTAGTGC 


CTTTTGCTTT 


CACATATTTT 


TATAAATATC 


2940 


TTATGGAAAC 


ACAGGGGAAG 


GTCAACGATC 


AACAGGCATT 


AGTTCTTTTG 


ATGATGTGTT 


3000 


TACCTTTTTC 


TTTTTCTTTG 


GCTGTTGGAA 


GTCCTATAAC 


TATTATCTTG 


TCTGAAGAAA 


3060 


AAGAAAAGTA 


CAATTTACAA 


ACTCTTCTGT 


TGAGTGGTGT 


TAAAGGCTCC 


GAATACATTT 


3120 


TATCAACTAT 


GTTTCTTCCT 


TTTTTGCTAA 


CTTTTGTGAT 


TATGGGAACT 


ACTCCTCTTA 


3180 


TTTTAGGAGT 


TACAATTGTA 


CATACTTTTA 


ATTATATTAC 


AATCGTTCTT 


CTAACCTCTT 


3240 


TATCCATCAT 


TTTATTCTAT 


TTATTGATAG 


GTTTAACCGC 


GAAGAGCCAA 


GTAGTAGCTC 


3300 


AGGTTATCAG 


TCTTCCTGCT 


ATGATTTTAG 


TTGCTTTCTT 


ACCGATGCTA 


TCTGGTTTGG 


3360 


ATAAGACAGT 


TGCGAAGATA 


ACAGATTATA 


GTTTTATGGG 


ACTATTTACT 


AAGTTTTTCA 


3420 
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CAAAATGGGA GGAATTTTCA TGGAATAAAA CTCTAATTCC TAATCTAACA CTACTTATTT 3480 

GGATTGTTCT TCTATTAACT TTAATTACGA TAACTATTAG GAAAAAGAAA ATTTCTTAAT 3540 

TGAGTTATTT TAATGATTAT AAACACAAGT GGGAAGGAAA AAATGAACTG ATCTTTTTGA 3600 

CAGCAATTCT ACAGAATAGT CTTATTGCTA TATTTTGATT TGAGTGTACG AAAAAAGAAA 3660 

AATAACAATA GTGCTCATAC TAATTGCAGA AGTTTTGGGT GATAAGATAA CTGATAAATT 3720 

- GCAATAAAAA ATGCAACATT TTTAAATCTC CTCTATAAGT GCTTCAAAAA GTGCTTCAAA 3780 

ACCTGTCTTG TAATCCAAGT ATTTTTGGGG ACGGTGATTA ATAAGCTAGC AAAGCATCAT 3840 

TAAGGATTTT TTCGGTAATT GTTGCCAAAT CGGTTTAAGA AAATACTCAC GAAGAAGTCC 3900 

ATTCGCATTC TCATTACTTC CCCTTTGCCA AGATGAATAG GCATCCGCAA AATAAAACAG 3960 

AATTCCCATT TGTTCAATTA AAGGGTAACA AGCAAACTCT TTTTCTCTGT CCGAAGTGAA 4020 

AGTCTTTAAC TATTCTTTTG GAAAGAGTCT TGTGAGGTGT TCAATAGCAG TCAACATGGA 4080 

TTTAGCTGTT TTTACTTGAC AAGTGCTAGT AGAAATAATA GAATAGTAAA AAACCTTTAA 4140 

AGCAGTCCAG AGAGGCAGCT AAGGTTAGAC GGTGAAAGGG TGGAGACTAC CCATTTTTCG 4200 

TGGAACCTTG CTGTTGGCAG GTTCCTTTTT TCGTGGCTTC TGTTGGCCAG ACTCTCTCAC 4260 

TAGTAAAGGT AAAAGGAGAA ACCTATGCGA GAACATCGTC CAATCATTGC TCTTGATTTT 4320 

CCTAGTTTTG AGGCGGTCAA GGAATTTTTA GCTCTTTTCC CAGCAGAAGA AAGCCTTTAT 4380 

CTCAAGGTAG GGATGGAGCT TTATTACGCA GCGGGGCCTG AGATTGTGTC CTACTTAAAA 4440 

GGTTTGGGTC ATAGTGTCTT TTTGGATCTC AAACTTCATG ACATTCCTAA TACAGTCAAG 4500 

TCAGCCATGA AGATCTTGTC TCAGCTTGGT GTCGATATGA CTAATGTCCA TGCGGCTGGT 4560 

GGTGTAGAGA TGATGAAGGC GGCGCGTGAA GGTCTTGGGA GTCAAGCCAA ATTGATCGCT 4620 

GTAACTCAGC TCACATCAAC GTCAGAAGCT CAGATGCAGG AGTTTCAAAA TATCCAAACC 4680 

AGTCTGCAAG AGTCTGTGAT TCACTATGCC AAGAAGACAG CTGAAGCTGG CTTGGATGGT 4740 

GTTGTTTGCT CGGCTCAGGA AGTACAAGTC ATCAAGCAGG CTACCAATCC AGATTTTATC 4800 

TGTCTGACAC CAGGGATTCG TCCAGCTGGT GTTGCAGTTG GAGATCAAAA ACGAGTCATG 4860 

ACACCTGCTG ATGCCTATCA AATCGGCAGT GACTATATCG TAGTGGGACG TCCCATTACC 4920 

CAAGCTGAGG ATCCTGTTGC AGCTTATCAT GCCATCAAGG ATGAATGGAC ACAGGACTGG 4980 

AATTAAAGAA CTAGATTAGA AAAATAAAAG GAGAATACCA TGACACTTGC TAAAGATATC 5040 

GCTAGCCACC TCTTGAAAAT CCAAGCCGTT TACCTCAAAC CAGAGGAACC CTTCACTTGG 5100 

GCATCTGGTA TCAAGTCACC GATTTACACT GATAATCGTG TGACACTAGC CTATCCAGAA 5160 
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ACTCGTACCC TAATTGAAAA TGGTTTTGTG GAAGCTATCA AAGAAGCCTT TCCTGAAGTA 5220 

GAAGTGATTG CAGGAACTGC AACAGCAGGG ATTCCACACG GAGCCATTAT TGCTGATAAG 5280 

ATGGACTTGC CTTTTGCCTA CATCCGTAGT AAACCAAAAG ACCACGGAGC TGGTAATCAA 5340 

ATCGAAGGTC GCGTAGCTCA AGGTCAAAAA ATGGTAGTGG TTGAAGACCT TATTTCAACG 5400 

GGTGGTTCAG TTCTPGAAGC TGTAGCAGCA GCCAAGCGAG AAGGAGCAGA TGTACTTGGA 5460 

GTTGTAGCGA TTTTCAGCTA CCAATTGCCA AAAGCAGATA AGAACTTTGC AGATGCTGGT 5520 

GTTAAACTTG TGACGCTTTC AAACTATAGC GAGCTTATCC ATCTAGCCCA AGAAGAAGGT 5580 

TACATCACGC CAGAGGGCCT TGATCTTCTA AAACGCTTTA AAGAAGACCA AGAAAATTGG 5640 

CAAGAAGGTT AGGTCAGTAA GATAAAGAGA GACGAGGCTA CCGAGTCTCT TTTACCATTT 5700 

TATTTAAAAT ATGACAG 5717 
(2) INFORMATION FOR SEQ ID NO: 103: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5558 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 103: 



CCTGGACTTT 


CTAAAATGAA 


ATCTTGCGAC 


CTGGATCAAG 


CCCTTCATGA 


GCATTTTTCA 


60 


GAAGAAGAAT 


TAGCTGGTCA 


CTTTCATGTC 


CTTCTATGGA 


CTTTTTTTAC 


AATGGCATTG 


120 


CTATCACACC 


CAATACCTAT 


CTAAGCGCCT 


GGTTCGTAAA 


CTTTATTGCA 


GCTCTTCCTC 


180 


TAAATTTCCT 


AATTGTTGAA 


CCAATTGCCC 


GTTTTATACT 


AAGTTCTTTT 


CAGAAACCAT 


240 


TTACTGGGGA 


AGAAGTTGAA 


GATTTTCAAG 


ATGATGATGA 


AATCCCAACT 


ATTATCTAAG 


300 


CCAGTTCTGT 


AAACTACTAA 


TATTTGAAAT 


CCACTTCCTT 


TTAGGGTGCA 


ATGGTTATAA 


360 


ATGAATTTTT 


GAGAGGATCA 


GAATGAAAAA 


ACTAGCAACC 


CTTCTTTTAC 


TGTCTACTGT 


420 


AGCCCTAGCT 


GGGTGTAGCA 


GCGTCCAACG 


CAGTCTGCGT 


GGTGATGATT 


ATGTTGATTC 


480 


CAGTCTTGCT 


GCTGAAGAAA 


GTTCCAAAGT 


AGCTGCCCAA 


TCTGCCAAGG 


AGTTAAACGA 


540 


TGCTTTAACA 


AACGAAAACG 


CCAATTTCCC 


ACAACTATCT 


AAGGAAGTTG 


CTGAAGATGA 


600 


AGCCGAAGTG 


ATTTTCCACA 


CAAGCCAAGG 


TGATATTCGC 


ATTAAACTCT 


TCCCTAAACT 


660 


CGCTCCTCTA 


GCGGTTGAAA 


ATTTCCTCAC 


TCACGCCAAA 


GAAGGCTACT 


ATAACGGTAT 


720 


TACCTTCCAC 


CGTGTCATCG 


ATGGCTTTAT 


GGTCCAAACT 


GGAGATCCAA 


AAGGGGACGG 


780 


TACAGGTGGT 


CAGTCCATCT 


GGCATGACAA 


GGATAAGACT 


AAAGACAAAG 


GAACTGGTTT 


840 
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CAAGAACGAG 


ATTACTCCTT 


ATTTGTATAA 


CATCCGTGGT 


GCTCTTGCTA 


TGGCTAATAC 


900 


TGGTCAACCA 


AACACCAATG 


GCAGCCAGTT 


CTTCATCAAC 


CAAAACTCTA 


CAGATACCTC 


960 


TTCTAAACTC 


CCTACAAGCA 


AGTATCCACA 


GAAAATTATT 


GAAGCCTACA 


AAGAAGGTGG 


1020 


AAACCCTAGT 


CTAGATGGCA 


AACACCCAGT 


CTTTGGTCAA 


GTGATTGACG 


GTATGGATGT 


1080 


TGTGGATAAG 


ATTGCTAAGG 


CCGAAAAAGA 


TGAAAAAGAC 


AAGCCAACTA 


CTGCTATCAC 


1140 


AATCGACAGC 


ATCGAAGTGG 


TGAAAGACTA 


CGATTTTAAA 


TCTTAAAAAC 


CAAAAAAATA 


1200 


CAGTATCCAC 


ATTCGGTACT 


GTATTTCTTT 


TACTCTCATT 


CTTAAGTTAA 


ATTATTAAAA 


1260 


TCCCATATTT 


GGTCTATCCA 


GCCTTCATAA 


AAGTCTGGCT 


CGTGGCAGAC 


CATAAGGATA 


1320 


GATCCCCTAT 


ATTCTTTGAG 


AGCGCGTTTG 


AGCTCATCCT 


TTGCATCCAC 


ATCCAAATGG 


1380 


TTGGTCGGCT 


CGTCCAGCAC 


TAAAACGTTG 


TTTTCACGAT 


TCATCAAGAG 


ACAGAAACGA 


1440 


ACCTTGGCTT 


GCTCTCCCCC 


TGATAATACT 


TGAATCTGGC 


TTTCAATATG 


TTTGGTTGTC 


1500 


AAACCACAAC 


GGGCAAGGGC 


TGCACGGACT 


TCTGCTTGAT 


TAAGGGCAGG 


AAAGGCATTC 


1560 


CAGACAGCTT 


CAAGAGGAGT 


TTGGCGATTA 


CCGCCTTCTA 


CTTCCTGCTC 


AAAATAACCA 


1620 


AGTTCTAAAT 


AATCTCCACG 


CTCCACTTCC 


CCAGCGATTG 


GCGAGATAAT 


GCCCAAGAGA 


1680 


CTCTTCAAGA 


GAGTTGTTTT 


TCCAATACCA 


TTAGCACCAA 


TAATCGCAAC 


CTTTTGATTG 


1740 


CGTTCGAAGG 


TAAGATTTAA 


AGGCTTAGTA 


AGAGGACGGT 


CGTAACCAAT 


TTGCAAGTTC 


1800 


TTGGCTTGGA 


AGATAAAGCG 


CCCTGGTGTA 


CGAGCTGGTT 


TGAAATCAAA 


GGATGGTTTT 


1860 


GGTTTCTCAC 


TTTGGAGTTC 


GATAATATCC 


ATCTTATCCA 


ATTTCTTTTG 


ACGAGACATA 


1920 


GCCATATTAC 


GAGTTGCAAC 


ACGGGCTTTA TTACGAGCCA 


CAAAGTCCTT 


GAGGTCTGCA 


1980 


ATCTCTTTCT 


GCTGGCGTTC 


GTAGGCTGCC 


TCTAGCTGAG 


ATTTCTTCAT 


AGCATAAACT 


2040 


TCTTGGAACT 


GGTAGTAGTC 


ACCAGAGTAA 


CGCGTCAGCT 


GTTGATTTTC 


CACATGATAG 


2100 


ACAATATTAA 


TAACGTCATT 


GAGGAATGGA ATATCGTGCG 


AAATGAGAAC 


AAAGGCATTC 


2160 


TCATAGTTTT 


GGAGATAGCG 


CTTGAGCCAA 


TCAATATGCT 


CAGCATCCAA 


GTAGTTGGTC 


2220 


GGCTCGTCCA 


ACAGCAAGAT 


ATCAGGCTTT 


TCAAGGAGAA 


GTTTTGCCAA 


AAGCACCTTG 


2280 


GTTCTTTGCC 


CACCTGACAA 


AGAAGTTACA 


TCCGTATCCA 


TGCCAAAGTC 


CATAACACCA 


2340 


AGAGCACGCG 


CTACTTCGTC 


AATCTTAGCA 


TCCAAGGTAT 


AGAAATCACG 


ACTCTCCAGA 


2400 


CGGTCTTGAA 


GTTCTCCTAC 


TTCTTCCATG 


AGAGCATCAA 


CATCCGCGCC 


GTCTTCAGCC 


2460 


ATTTTCATAT 


AGAGGTCATT 


GATACGAGCT 


TCAGCTTTGA 


AAAGCTCATC 


AAAAGCCGTA 


2520 


CGGAGAACAT 


CACGCACCGA 


CTGTCTTTCA 


GCAAGGACAG 


AGTGCTGATC 


CAAGTAACCA 


2580 
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GCCGTCACAT ATTTGGACCA CTCAACCTTT CCTTCATCTG GCAGCATTTT ACCAGTCACG 2640 

ATACTCATAA AGGTTGATTT TCCTTCACCA TTGGCACCGA CCAGGCCGAT ATGTTCTCCC 2700 

TTGAGGAGAC GGAAGGACAC ATCTTCAAAA ATTGCACGGT CACCAAAACC GTGACTCAGA 2760 

TTTTTAACTT CTAAAATACT CATTTTAATT CCTTACCTTG TTTTTATGTA ATCGTTTATA 2820 

AAGGAGCCAA GCCAGATAGC CACCCAAAGT GTTGGTCCAC AAATCATCAA TCTCAAAGAC 2880 

GCGATTGAAA TCAAAGAAAA AGTCCAAGAT TAATTGCGTA CACTCGATTC CAAGACTCAC 2940 

AAGAAAACTA AAAAGAAGGA CCTTTTTTGT TTTCCGCAAA TTTGGAAATA GATAAAGGAG 3000 

TTGGAAAATC AGAGGAAAAA ACAAGAAGAC ATTGAGGATA TTTTGTAAAA AAATCCAACA 3060 

TAATTGTCCA ATGTCACTCA CTTCGCCCAG TTTCCAGAGA GAATTGAAAG GAGTCAAAAG 3120 

AAAAACCAGG CGTCCAAGAT GCTGAATACC TGGAGTTCCC ACTCCCACGG TAGATTGTTC 3180 

TTGAGGAGTA AAGCAAAAAC AGACAATGCA AATGCTATAG AAAATGACTC CCCAGACCAA 3240 

AATATGATTA TAAGTCTTCT TCATCATTAA GGATTTACCG CTGCGACTGC CTTCTGGCGG 3300 

TCACGTTTCA TTGTGTTAGA GCGCAATTGT CCACAAGCTG CGTCAATATC TGTACCATGC 3360 

TCTTGACGAA CCACACAGTT GACCCCTTTT TTCTTAAGCG TATCATAGAA AGCCAACACG 3420 

CACTCTTTGG GACTACGGCT ATATTGGTCA TGCTCACTAA CTGGGTTATA AGGAATCAAG 3480 

TTTACATAAG ACAATTTCTT GATGTTCTTG AGCAATTCAG TCAATTCCAA GGCTTGTTCT 3540 

ACACCGTCGT TGACTTCATT AAGCATGATA TATTCAAAGG TTACACGACG GTTTGTTGTC 3600 

TCAATGTAGT ATTCAATAGC AGCAAAGAGT TTTTCAATCG GAAAGGCACG GTTAATCTTC 3660 

ATGATACTTG AACGAAGTTC ATTGTTAGGT GCGTGAAGAG ACACGGCAAG ATTGACCTGA 3720 

ACCCCTTCAT CAGCAAAGTC ACGAATTTTA TGAGCCAAAC CTGAGGTTGA AACCGTGATG 3780 

TGACGAGCAC CGATAGCCAT TCCTTTATCA TCATTGATAG TACGAAAGAA ATTCAAGACA 3840 

TTGTTGTAAT TATCAAAGGG CTCACCGATT CCCATGACAA CGATATGGCT GATGCGTTCA 3900 

TCCTGACCAC GCTCATCAAA GTATTTCTGA ACCAGCATGA TTTGCGCTAC GATTTCACCG 3960 

TTATTGAGGT CACGTTGCTT CTTAATCAAA CCAGAGGCAC AGAAGGTACA ACCGATATTA 4020 

CAGCCGACCT GAGTGGTCAC ACAGACAGAT AAACCATAGT GTTGACGCAT GAGTACAGTC 4080 

TCAATTAACA TACCGTCGGG CAATTCAAAG AGATATTTGA CTGTACCATC AGCAGACTCT 4140 

TGCACAATAC GTTGTTTCAA GGGATTGACC ACAAACTGGT CATTGAGCTT AGCAATCAAA 4200 

TCCTTGGAAA GGTTGGTCAT TTCTTCAAAT GACTGCACAC GTTTACGGTA GAGCCATTCC 4260 

CAGATTTGAT CTGCACGGAA TTTCTTTTCT CCCTGCTCCA ATACCCATTC CTGCATGGTT 4320 

TGATGTACCA AACTATGAAT TGAGGGTTTC ATTTCTTCTC CTTATTCTCT ACTCACTTCT 4380 
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GACGAATGAC AAAATGACGT TGTCCCTTGT CGTCTTTCTG 
CTGCATTCGA CTTTCGTTTA GTTTGAGTCG GTTTCTTTCC 
CTTCCGTCTT ACGCATTTTC TTGTCAAATG ATGCTCGCTT 
CAAAATAGGC ACAACCATAA CTACAATACT CTAAAAGGTA 
TTTCAAGTTT TTCTTCTGTT CGGTCATCCT TGTAAAAACC 
TGCTCCAGTC CCCCACGATA TAATCAAACT TGGTTAATAC 
AAGTCGTCAC ATCAAAGGCA TCCTTGATAT TTTCAACCAA 
TTTCGACCTT GTCCCCGTGT AAATGGAACT CCGGACCAGG 
ATTCAGGTGC AATTTCTTTT CGCATAGATA TCCTTTTTTC 
TCTACCATAA TTTCTAGCAG TTAGCACGTT TCTCATAAAA 
TTGTCAGACC AGAATCTTAT AACCTAAAAA GAGAAGAACA 
TTATTTAGCA GCTGCGTACA ATTCATCTAC TTTATTCCAG 
TTTGATGTAG TCAGGACGCA CGTTGCGGTA TTTCACGTAG 
CAAGCCCAAG ATTGGTTTTT TACCTTCTGA GATTGGTGTG 
CACTTCAAGT TTCCCTTCTT TGTTGACAAC CAACCATGCC 
TGTTGCTGCT GCAGTGAAGG CTGCTTGGAA TTCTTCAAAT 
TGCTGCTGCC AGTTCTGCTG AAGGAGCTGT TTTCTCGGGA 
AGCGTGGTTC AAGTGTCCGC CACCATTGTT GATAAGTGCT 
AGATTCTACA TCAGCAAGCA AGGCTTCAAG GTCTTCACCG 
AGCTGCATTG GCATTGTTGA CATAAGTTTG ATGGTGTT 
(2) INFORMATION FOR SEQ ID NO: 104: 

(i) SEQUENCE CHARACTERISTICS* 

(A) LENGTH: 6735 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



ACGACGTCTA 
TTTTCTAGAA 
AGGGGCTTCA 
GTCTTGTAAA 
TCGTAGGCGA 
TTCTGAAAAA 
GGAAAAAGCT 
AAACTTGTTA 
ACGATTACTT 
ATGAAAAAAG 
ATTCTTCCCT 
TTGATTACTG 
TAAGCATGTT 
TCTTGGTTTG 
CAACCTGAAC 
GAACCAAATG 
GTCATCAATT 
TGACGGATAT 
ATTTCAGGGT 



TTTTTCTTAT 
GGTGTTTCTT 
TTTTCTAAGA 
CGACTGATTT 
AGCTGTTCGT 
CGCTGATTAA 
ATCCCTTCCG 
TAGTTGTATA 
AATACTTTAT 
TCTGACGATT 
CCAACTATCA 
AAAAGAAAGC 
CCCAAACGTC 
CTGTTGAAGT 
CAAAACGAGT 
TTGCATCGAT 
CCCAGAAAAG 
CAGCTGGGAT 
GTTTTTCTAA 



4440 

4500 

4560 

4620 

4680 

4740 

4800 

4860 

4920 

4980 

5040 

5100 

5160 

5220 

5280 

5340 

5400 

5460 

5520 

5558 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 104: 
GGAATTGTAA ATATCATATT GTTTTTGCAC CCAAATATCG TCGTCAAATC ATTTATGGCA 
GATACAAAGC TAGTATCGGA AGAATCATAC GTGACTTATG TGAGCGTAAG GGTGTAATAA 
TCCATGAAGC GAATGCTTGT TCAGACCATA TTCACATGCT TATCAGTATT CCTCCGAAAC 



60 
120 
180 
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TTAGTGTTTC 


GTCCTTTATG 


GGCTATTTAA 


AGGGCAAGAG 


CAGTTTGATG 


ATTTTTGATA 


240 


AGCATGCGAA 


TTTAAAATAC 


AAATATGGCA 


ATCGCAAGTT 


TTGGTGTAGA 


GGCTATTATG 


300 


TAGATACGGT 


AGGCCGTAAT 


CAGAAAGTGA 


TAGCTGAATA 


TATTCAGAAT 


CAATTACAAG 


360 


AAGACAGAGT 


AGCAGACCAG 


CTCACGTTAT 


TCGAGTCAGT 


AGATCCGTTT 


ACTGGCGAAA 


420 


TAAATAAGAG 


GAAGTAACTA 


AGGTGCTTTA 


GCACCTGCTC 


GGGAAAGTGG 


TGCGCGAGGA 


480 


AGCTATTTCG 


GTGGGCCTTT 


GGCCCTGGCC 


GGTAGAAGCG 


GCTTATAGCC 


GCAGAACAAA 


540 


CCACCAGTTC 


ACACTGGTGG 


TTTTGATTTA 


AAAAACTTGA 


TACATAAAAA 


TAAAAGTCTA 


600 


TATAAAGGAT 


GGTAAAATTC 


CTGTTGTCCG 


ATTTGGACAA 


TATCCTAAAT 


AGTTACAATA 


660 


TATGGTCTAT 


ACTTTTTCTT 


AGGAGAAAGC 


TAGATGTACA 


GACGTTTGAG 


AGATTTGAGG 


720 


GAGGATCATG 


ATCTGCCCCA 


AAAGCAAATA 


GCTACAATAC 


TTTCGTTTAC 


AAATTCAGCT 


780 


TATGCCAAAA 


TTGAACGGGG 


TGAGCATGCG 


TTGACGGCTG 


ATGTATTGGT 


TAAACTCTCA 


840 


GATTTCTATG 


ACGTCAGTAC 


AGACTATTTA 


TTGGGATTAA 


CTGATTTTCC 


TGATAAAATT 


900 


CGCTTTAGAA 


AATAATCTCC 


TCAATTTCAT 


AGAGTTTGAA 


AATGAGTGAG 


ATTTTTTATT 


960 


TGCCCTTTGA 


CAACTGAATA 


GCCTAAAATG 


GTACTTTCCT 


CATTTGTGGA 


GCAAATTTGA 


1020 


ATGGCTCGCC 


ATGATAAGAG 


CGATTTTAAA 


ATCATCAATA 


AAATAGAGCG 


ATACTTTATA 


1080 


TGCCATGATA 


CAAATGATAT 


ACAATGATAC 


TTCTGACCGT 


TCAGCCTGCC 


AACGTAAAAG 


1140 


AGCAGCAAGT 


GAAATTCTTA 


TGATGACTTC 


ATCAGTCATG 


CCACGTTGAA 


TGTGTGAGTT 


1200 


TGTTAGATAA 


ACGCAATTAA 


TCCTCAAAAG 


GTTCCCCGAA 


CCTTTTGAGT 


TCTACAGACG 


1260 


CATCACGTGG 


AGTGTGTAAG 


CTTGTTGCTA 


AAAGCGTAAA 


AACCTTGGAA 


CGAAAGGAAT 


1320 


AATAGACTTT 


CTGCGAAACA 


AAAATATAAT 


ACAATAAAAC 


TATGAATGAT 


GAAGCAAGTA 


1380 


AACAATTGAG 


CGATAGCCGT 


TTCAAGATCC 


TTGTAGGTGT 


TCAGCGCACG 


ACTTTTGAAG 


1440 


AGATGTTAGC 


TGTGTTAAAA 


ACAGCTTATC 


AACGTAAACG 


CGCAAAAGGT 


GGACGAAAAA 


1500 


GCAAATTAAG 


CCTAGACGAT 


CTCCTTATGG 


TAACTATTCA 


ATACATGCGA 


GAATAGAGCA 


1560 


CTTATGAACA AATTGCGGCT 


GATTTTGGCA 


TTCACGAAAG 


CAACTTAATC 


CGTCGGAGTC 


1620 


AATGGGTTGA AGCAACTCTT ATTCAAAATG 


GTTTTACGAT 


TTCAAATTCT 


GCCTTAATTC 


1680 


TGTAAAAACA 


GTAAAATTCG 


AAGGATTGTA 


AGGTAAGAGT 


TTTTTTCTTT 


CTGAAAAAAT 


1740 


GGTATAATAG 


CAATCAAAAC 


TAGAAAATAA 


AACGGAATTT 


GGAACAGATT 


TGTCTGTATC 


1800 


CTAGTAGAGT 


GGTGATACTA 


TGAAGATTAG 


TAAGAGGCAC 


TTATTAAATT 


ATTCCATCTT 


1860 


GATTCCCTAC 


TTGCTTTTAT 


CTATTTTGGG 


CTTGATTGTG 


GTCTATTCGA 


CCACCAGTGC 


1920 


TATTTTAATT 


GAAGAAGGCA 


AGAGCGCCTT 


GCAGTTGGTT 


CGAAACCAAG 


GAATCTTTTG 


1980 
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GATTGTTAGT TTGATACTGA TTGCCTTAAT TTATAAATTG AGACTAGATT TTTTGAGAAA 2040 

TGAGCGACTA ATCATTTTAG TTATATTAAT AGAAATGCTT TTATTGTTCT TGGCTCGTTT 2100 

TATTGGTATT TCCGTAAACG GGGCATACGG TTGGATTTCG GTTGCAGGAA TAACTATTCA 2160 

GCCAGCTGAG TACTTAAAAA TCATTATTAT TTGGTATTTA GCTCACCGAT TCTCCAAACA 2220 

GCAAGAAGAA ATAGCTACTT ATGATTTTCA AGTTTTGACT CAAAATCAAT GGCTTCCCCG 2280 

- TGCTTTTAAT GATTGGCGAT TCGTTCTCCT AGTTCTGATT GGAAGTTTGG GAATTTTCCC 2340 

TGATTTAGGA AATGCGACTA TTTTAGTCTT GGTTTCCTTG ATTATGTATA CAGTTAGTGG 2400 

AATCGCTTAT CGCTGGTTTT CAACCATTCT GGCGCTCGTA TCTGCCGCTT CTGTCTTTGT 2460 

CTTGACCACT ATCAGCCTAA TCGGTGTTGA GACCTTTTCA AAAATTCCAG TATTCGGCTA 2520 

TGTAGCCAAG CGCTTTAGTG CCTTTTTTAA TCCTTTTGCC GATCGTGCTG ATGCAGGTCA 2580 

CCAGTTAGCT AATTCTTATT TTGCCATGGT CAATGGCGGT TGGTTTGGTC TAGGTCTTGG 2640 

AAACTCGATT GAAAAACGAG GTTATTTGCC AGAAGCTCAT ACAGACTTTG TCTTTTCTAT 2700 

CGTGATTGAA GAATTTGGCT TTGTTGGTGC CAGTCTTATT TTAGCTCTCT TGTTTTTCAT 2760 

GATTTTGCGG ATTATCTTGG TCGGTATCCG AGCGGAGAAT CCTTTCAATG CCATGGTTGC 2820 

ACTCGGTGTC GGAGGGATGA TGTTGGTTCA GGTATTTGTC AATATCGGAG GGATTTCGGG 2880 

CTTGATTCCA TCTACAGGAG TGACTTTCCC CTTCTTATCC CAGGGTGGAA ATAGTCTTCT 2940 

AGTCTTATCA GTGGCAGTAG CCTTTGTCTT AAATATTGAT GCCAGTGAAA AACGCGCTAA 3000 

ATTGTACCGA GAATTGGAAA ATCAACCAAT GAACCTTCTG TTGAAGTAGG ATAAAGAAAG 3060 

GATAGTTTAT GTCTCTTCAA AAATTAGAAA ATTATAGTAA TAAAAGTGTT GTGCAAGAAG 3120 

AAGTCTTGAT TCTAACAGAA TTACTGGAAG ATATTACTAA AAATATGCTT GCCCCAGAGA 3180 

CCTTTGAAAA AATAATACAG TTGAAAGAAT TATCAACGCA GGAAGATTAT CAAGGTCTAA 3240 

ACCGTCTAGT GACTAGCTTA TCAAATGATG AAATGGTCTA TATTTCACGC TATTTCTCTA 3300 

TCTTGCCTCT TTTGATTAAT ATTTCAGAGG ATGTGGATTT AGCTTATGAA ATCAATCATC 3360 

AAAATAATAT TGATCAGGAC TATTTAGGTA AATTATCTAC AACGATTAAA TTGGTAGCAG 3420 

AAAAGGAAAA TGCCGTTGAG ATCCTAGAAC ACTTGAATGT TGTCCCTGTT TTGACAGCCC 3480 

ATCCAACACA AGTGCAACGC AAAAGTATGT TGGATTTAAC AAATCATATT CATAGTCTTT 3540 

TGCGTAAATA CCGTGATGTT AAGTTGGGGT TGATCAATAA AGATAAATGG TACAATGATT 3600 

TGCGTCGTTA CATCGAAATT ATCATGCAGA CAGACATGAT TCGTGAGAAA AAATTAAAAG 3660 

TGACTAACGA AATCACGAAT GCTATGGAAT ATTATAACAG CTCCTTTTTG AAAGCTGTAC 3720 
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-A^GAC GACGGAGTAT AAGCGCTTAG CGCAA^GCA roarer TTAAAACAGG 
CTAAACCAAT CACCATGGGT ATGTGGATAG GTGG^ACCG 

CACCAAAOAC C^AAGCAG TC^CACTCA CTCAGTGTGA AGTCATCATG AACTACTATG 
ATAAAAAGAT TTACCAACTT TATCGTGAAT T^^c ^ 
GCAAGCAAGT CAGAGAAATG GCTCGTCAAT CCAAGGATAA CTCGATTTAC CGCGAAAAAG 
AGC^ACCG WWcno TTTGATATTC AATCAAAAAT TCAGGCAACT AAAACCTATC 
TGATTGAGGA TGAAGAAGTT GGGACTCGTT ATGAAACCGC CAATGATTTC TACAAGGATT 
— CA T TCGAGATTCT CTACTAGAAA ATAAGGGCGA GTCCTTGATT TCAGGTGATT • 
TTGTGGAATT A^CACGCA GTAGAGATAT ^G^ OTTAGCATCA 
GACAAGACTC TAGCGTCTAT GAAGCCTGTG TGGCAGAAC* CTO3 AAA TCA GCAGGAATTC 
ATTCTCGTTA TAGCGAC^ AGCGAAGAAG aaaag™* cc™*™ 
AAGAAGATCC CCGAATTCTT TC^CGACTC ACGCAGAAAA ATCAGAATTA TTAGCAAAAG 
AATTAGCTAT TTTTAAGACG OOWBWm TCAAA^ 

GTCAGACCAT CATTTCACAT GCAACCAGCC TTTCTGATAT GCTAGAATTA GCTATTCTGT 
TAAAAGAAGT AGGACTGGTG GATACGGAAA GGGCGCGTGT TCAGATTGTT CCCC™* 
AAACAATTGA AGACTTGGAT CATTCAGAGG AAACAATGAG AAAATATCTT ^AGCC 
TTGCCAAAAA ATGGATTGAC TCACGAAATA ACTACCAAGA AATCATGCTT GGCTACTCTG 
ACAGTAATAA AGATGGOGGT TACTTGTCAT CATGTTGGAC CCTCTACAAG GCTCAACAAC 
AATTGACTGC TATTGGAGAT GAATTTGGCG TTAAGGTTAC CTTC^CAT GGTCGTGGTG 
GTACTGTCGG TCGTGGTGGT GGGCCAACCT ATGAAGCCAT TACATCTCAA CCGCTCAAGT 
CTATCAAGGA TCGTATCCGC TTGACGGAGC AGGGTGAAGT AATTGGGAAT AAATACGGTA 
ACAAAGACGC CGCTTACTAT AACCTTGAAA TGCTAGTATC GGCAGCTATT AACCGTATGA 
TTACTCAGAA GAAGAGCGAT ACCAATACCC CAAATCGTTA TGAAACCATT ATGGATCAAG 
TAGTGGACCG TAGTTACGAT ATCTACCGTG ATTTGGTCTT TGGTAATGAG CATTTCTATG 
a^a™^ CGAGTCAAGT CCAATCAAGG CTA^CAAO ttttaatatt ggttctcotc 
CAGCCCCTCG TAAGACTATT ACTGAAATCG GTGGTTTGCG ^CCATCCCT TGGGTATTCT 
CATGGTCACA gagtcgtg^ ATOrT0CCM GA^ACGG GGTTGGTTCA AGCTTCAAGG 
AATTTATCAA TAAAAATCCA GAGAATATTG CTATCTTACG AGATATGTAC CAAAATTGGC 
CT^CA ATCGC^C. TCAAATGTTG ATATGGTTTT QKMMKA ^ ^ 
™C™* ATATGCTAAA C^CAAG ACGAGCAAGT TAAGGCCATC TATGAGACTA 
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TTTTAAATGA ATGGCAAGTT ACTAAGAACG TTATCTTGGC TATTGAAGGA CATGACGAAC 5580 

TCTTAGCTGA CAATCCATAT CTAAAAGCTA GTCTGGATTA CCGTATGCCT TACTTTAATA 5640 

TTCTCAACTA TATTCAGTTG GAGTTGATTA AACGCCAACG TCGTGGAGAA TTGTCCAGTG 5700 

f ATCAAGAACG ATTGATTCAT ATCACCATCA ACGGAATTGC GACAGGATTG CGTAATTCAG 5760 

GTTGATAATT TTCAAGAGTG AATGCTAAAA GTGAATATCA AAAAAATTCT AATAGACTAT 5820 

TGACAAGTAG TTTAAAAATG ATATAATTTA ACCATTCAGA AAAGTAATCA TACAAACTTT 5880 

TTAGAGAGTC TGTGGTAGCT GAAAACAGAT AAGTGGCAAT GATGAAAATT GGGCTGAATG 5940 

CTATTTAGAA TTTGAAATTA TAAAAATTCG GTAAGCACAC CTTACAGTGC ATCTCGTTAT 6000 

TGCGAGACTG AGCGATAGGG AAATTCCCTA TAATTGAGGT GGTACCGCGC ATCGACGTCC 6060 

TCACACAAGT TTTTTGTGTG AGGATTTTTT TGATGGAGGT TAGTATGGAA AGAAAACGAT 6120 

GGCGTCGCTT GTTTAGATAA GTGAAATATG TTAAAGGAAA TAAAAAGGAG AAACAGAATG 6180 

AAAAATAAAC GTTTAATTGG AATTATTGCT GCATTAGCAG TCTTAGTAGC AGGAAGCTTG 6240 

ATTTATTCTT CAATGAATAA ATCAGAAGCT CAGAATAATA AGGATGAGAA GAAAATAACC 6300 

AAGATTGGTG TGCTTCAATT TGTGAGCCAT CCATCCCTTG ATTTGATTTA TAAAGGGATC 6360 

CAAGATGGAC TTGCAGAAGA AGGATATAAA GATGATCAAG TTAAAATTGA TTTTATGAAC 6420 

TCAGAAGGTG ACCAAAGTAA GGTTGCGACA ATGAGTAAAC AATTGGTTGC AAATGGGAAT 6480 

GACCTTGTGG TTGGTATCGC AACACCAGCA GCCCAAGGGT TGGCTAGTGC AACAAAAGAC 6540 

CTACCGGTTA TCATGGCCGC TATTACAGAC CCAATTGGTG CTAACTTGGT TAAAGATTTG 6600 

AAAAAACCAG GTGGCAACGT TACAGGGGTA TCTGACCACA ATCCAGCTCA ACAACAAGTT 6660 

GAACTCATCA AGGCTCTGAC ACCGAATGTG AAAACAATCG GAGCTCTTTA CTCAAGTAGC 6720 

GAAGACAATT CAAAA 6735 
(2) INFORMATION FOR SEQ ID NO: 105: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6516 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 105: 

CTAGAGGATC CCAGCAGGTA AATTGGCTTC AGCTGGCAAA AAAGTTGCCC TCGTTGAACG 60 

CAGCAAGGCT ATGTACGGTG GAACTTGTAT CAACATTGGT TGTATCCCAA CTAAAACCTT 120 
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GCTAGTTGCT 


GCTGAAAAGG 


ACTTGTCTTT 


TGAAGAAGTC 


ATTGCTACTA 


AAAACACGAT 


180 


CACTGGTCGC 


CTCAACGGTA AAAACTATGC 


GACTGTTGCT 


GGTACAGGCG 


TAGATATCTT 


240 


TGATGCGGAA 


GCTCACTTCC 


TTTCAAATAA 


AGTCATCGAA 


ATCCAAGCTG 


GTGATGAAAA 


300 


GAAAGAACTG 


ACTGCTGAAA 


CAATCGTCAT 


CAACACTGGT 


GCTGTTTCAA 


ACGTCTTGCC 


3 60 


AATCCCTGGA 


CTTGCTACAA 


GCAAAAACAT 


CTTTGACTCA 


ACAGGTATCC 


AAAGCTTGGA 


420 


CAAATTACCT GAAAAACTTG GAATCCTTGG TGGCGGAAAT ATCGGTCTTG AATTTGCCGG 


480 






GCAAGGTCAC 


AGTCCTAGAT 


GCCTTGGATA 


CATTCCTACC 


540 


TCGTGCAGAA 




CAGCTCTTGC 


TAAACAATAC 


ATGGAAGAAG 


ATGGCATTGA 


600 


n± x vjv> X 1 Win 




CTACTGAAAT 


CAAAAACGAT 


GGTGACCAAG 


TGCTTGTCGT 


660 


AACTGAAfiAP 


n a a a r*PT a c*f* 


GTTTCGACGC 


CCTTCTCTAC 


GCAACTGGAC 


GCAAACCAAA 


720 




^.i. 1 L^\Av_ lib 


AAAATACAGA 


TATTGAACTA 


ACTGAACGTG 


GTGCTATTAA 


780 


afiTapapa a & 


L. At- 1\3 1 t-AAA 


CAAACGTTCC 


TGGTGTCTTT 


GCAGTTGGAG 


ATGTCAACGG 


840 


1 uValA» 1 l^JVA 


1 X I AL.X XAwV 


TTTCACTTGA 


TGACTTCCGT 


GTTGTTTACA 


GCTACCTTGC 


900 


TGGAGATGGC 


AGCTATACAC 


TTGAAGACCG 


TCTCAATGTG 


CCAAATACTA 


TGTTCATCAC 


960 


ACCTGCACTT 


TCACAAGTTG 


GTTTGACTGA AAGCCAAGCA GCTGATTTGA AACTTCCATA 


1020 


CGCTGTTAAG 


GAAATCCCCG 


TTGCAGCAAT 


GCCTCGTGGT 


CACGTAAATG 


GAGACCTTCG 


1080 


CGGTGCCTTC 


AAAGCTGTTG 


TCAATACTGA 


AACAAAAGAA 


ATTCTTGGAG 


CAAGCATCTT 


1140 


CTCAGAAGGT 


TCTCAAGAAA 


TCATCAACAT 


CATCACTGTT 


GCTATGGACA 


ACAAGATTCC 


1200 


TTACACTTAC 


TTCACAAAAC 


AAATCTTCAC 


TCACCCAACC 


TTGGCTGAGA 


ACTTGAATGA 


1260 


CTTGTTTGCG 


ATTTAAGTTG 


AGATTTAATC 


GTATCGAACA GCCCTCTTTG 


GGCTGTTTTT 


1320 


ACTTCTGCGG 


AATCTCAAAT 


CTGTCTTTCT 


CCTCTTTTAT 


GATATAATAG 


AAACATGAAC 


1380 


TTAAAAACTA 


CTTTGGGCCT 


TCTTGCTGGG 


CGTTCTTCCC 


ACTTCGTTTT 


AAGCCGTCTT 


1440 


GGACGTGGAA 


GTACGCTCCC 


AGGGAAAGTC 


GCCCTTCAAT 


TTGATAAAGA 


TATTTTACAA 


1500 


AACCTAGCTA 


AGAACTACGA 


GATTGTCGTT 


GTCACTGGAA 


CAAATGGAAA 


AACCCTGACA 


1560 


ACTGCCCTCA 


CTGTCGGCAT 


TTTAAAAGAG 


GTTTATGGTC 


AAGTTCTAAC 


CAACCCAAGC 


1620 


GGTGCCAACA 


TGATTACAGG 


GATTGCAACA 


ACCTTCCTAA 


CAGCCAAATC 


TTCTAAAACT 


1680 


GGGAAAAATA TTGCCGTCCT 


CGAAATTGAC 


GAAGCCAGTC 


TATCTCGTAT 


CTGTGACTAT 


1740 


ATCCAGCCTA 


GTCTTTTTGT 


CATTACTAAT 


ATCTTCCGTG 


ACCAGATGGA 


CCGTTTCGGT 


1800 


GAAATCTATA 


CTACCTATAA 


CATGATATTG 


GATGCCATTC 


GGAAAGTTCC 


AACTGCTACT 


1860 


GTTCTCCTTA ACGGAGACAG 


TCCACTTTTC 


TACAAGCCAA 


CTATTCCAAA 


CCCTATAGAG 


1920 
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TATTTTGGTT TTGACTTGGA AAAGGGACCA GCCCAACTGG CTCACTACAA TACCGAAGGG 1980 

ATTCTCTGTC CTGACTGCCA AGGCATCCTC AAATATGAGC ATAATACCTA TGCAAACTTG 2040 

GGTGCCTATA TCTGTGAAGG TTGTGGATGT AAACGTCCTG ATCTCGACTA TCGTTTGACA 2100 

AAACTGGTTG AGTTGACCAA CAATCGCTCT CGCTTTGTCA TAGACGGCCA AGAATACGGT 2160 

ATCCAAATCG GCGGGCTCTA TAATATCTAT AACGCCCTAG CTGCTGTGGC CATCGCCCGT 2220 

TTCCTAGGTG CCGATTCGCA ACTCATCAAA CAGGGATTTG ACAAGAGCCG TGCTGTCTTT 2280 

GGACGCCAAG AAACCTTTCA TATCGGTGAC AAGGAATGTA CCCTTGTCTT GATTAAAAAT 2340 

CCAGTCGGTG CAACCCAAGC TATCGAAATG ATCAAACTAG CACCTTATCC ATTTAGCCTA 2400 

TCTGTCCTCC TTAATGCCAA CTATGCAGAT GGAATTGACA CTAGCTGGAT CTGGGATGCA 2460 

GACTTTGAAC AAATCACTGA CATGGACATT CCTGAAATCA ACGCTGGCGG TGTTCGTCAT 2520 

TCTGAAATCG CTCGTCGCCT CCGAGTGACT GGCTATCCAG CTGAGAAAAT CACTGAAACG 2580 

AGTAATCTGG AGCAAGTTCT CAAGACCATT GAGAATCAAG ACTGCAAGCA TGCCTATATT 2640 

CTGGCAACTT ATACTGCCAT GCTGGAATTT CGTGAACTGC TGGCTAGTCG TCAGATTGTT 2700 

AGAAAGGAGA TGAACTAATG GTTTATACTT CACTTTCCTC AAAAGATGGC AATTACCCCT 2760 

ATCAGCTCAA CATTGCCCAC CTCTACGGAA ATCTCATGAA TACtACGGGG ACAATGGAAA 2820 

CATCCTCATG CTCAAGTATG TGGCTGAAAA ACTGGGAGCC CATGTGACCG TTGACATCGT 2880 

TTCTCTCCAT GATGACTTTG ATGAAAATCA CTACGACATC GCCTTTTTCG GTGGTGGTCA 2940 

AGACTTTGAA CAAAGTATCA TTGCAGACGA CCTACCTGCT AAAAAAGAGA GCATTGACAA 3000 

CTACATCCAA AACGACGGTG TAGTTCTGGC TATCTGCGGT GGTTTCCAAC TATTGGGTCA 3060 

ATATTATGTT GAAGCTTCAG GAAAACGTAT CGAAGGGCTA GGGGTCATGG GACACTACAC 3120 

GCTCAACCAG ACCAATAACC GTTTTATCGG TGACATCAAG ATTCACAATG AAGATTTCGA 3180 

TGAAACCTAC TATGGATTTG AAAATCACCA AGGTCGTACC TTCCTCTCTG ATGACCAAAA 3240 

ACCGCTGGGA CAGGTTGTCT ATGGAAATGG AAACAACGAA GAAAAGGTCG GTGAAGGGGT 3300 

TCATTATAAG AATGTCTTTG GTTCCTACTT CCACGGGCCT ATCCTCTCTC GTAATGCCAA 3360 

TCTGGCTTAT CGCCTAGTTA CTACTGCCCT CAAGAAGAAA TATGGTCAGG ACATCCAACT 3420 

CCCTGCCTAT GAGGACATTC TCAGCCAAGA AATCGCTGAA GAGTACAGTG ACGTCAAAAG 3480 

CAAGGCTGAC TTTTCTTAAA CAAAGGAAAA TGATATCAAA GAACTCCGTT ATCTTGTCGG 3540 

AGTTTTTTGT CTTTTCTTTT ACCCTTCTCC CTTGCATTTT CTCTCATTTT TTGCCAAAAT 3600 

AGAGGGGTAG AAAGAAGGTA GCATATGTCT AAATTACAAC AAATCCTAAC ATATCTTGAA 3660 
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TCAGAAAAAC 


TAGACGTCGC 


TGTCGTATCT 


GACCCCGTCA 


CAATCAATTA 


CCTCACTGGT 


3720 


TTTTACAGTG 


ATCCCCATGA 


ACGCCAAATG 


TTCCTCTTTG 


TCCTAGCAGA 


TCAGGAACCT 


3780 


CTCCTCTTTG 


TCCCAGCTCT 


TGAAGTAGAA 


CGTGCAAGTA 


GCACCGTTTC 


CTTCCCAGTA 


3840 


GTGGGCTATG 


TCGATTCTGA 


AAATCCATGG 


CAAAAAATCA 


AACATGCTCT 


TCCACAACTT 


3900 


GACTTCAAAC 


GTGTCGCTGT 


TGAGTTTGAC 


AATCTCATCT 


TGACCAAATA 


CCATGGTTTG 


3960 


AAAACAGTTT 


TTGAGACTGC 


TGAGTTTGAC 


AACCTCACTC 


CTCGTATCCA 


ACGCATGCGC 


4020 


CTCATCAAAT 


CAGCTGATGA 


AGTGCAAAAA 


ATGATGGTTG 


CAGGTCTTTA 


TGCTGACAAG 


4080 


GCTGTTCATG 


TTGGTTTTGA 


CAATATTTCT 


CTTGATAAGA 


CTGAGACAGA 


TATCATCGCA 


4140 


CAAATCGACT 


TTGCCATGAA 


ACGTGAAGGT 


TATGAAATGA 


GCTTTGATAC 


CATGGTCTTG 


4200 


ACTGGTGATA 


ATGCTGCGAA 


TCCACACGGC 


ATTCCAGCAG 


CTAATAAGGT 


TGAAAATGAT 


4260 


GCTCTTCTCC 


TCTTTGACCT 


GGGTGTTCTG 


GTCAATGGCT 


ATGCGTCAGA 


TATGACTCGT 


4320 


ACAGTCGCTG 


TCGGCAAACC 


AGACCAATTC 


AAGAAAGATA 


TTTACAACTT 


GACTCTTGAA 


4380 


GCCCAACAAG 


CTGCTCTTGA 


CTTTATCAAG 


CCAGGTGTGA 


CTGCTCATGA 


AGTGGACCGC 


4440 


GCTGCCCGTG 


AGGTCATCGA 


AAAAGCTGGT 


TATGGTGAGT 


ACTTCAACCA 


CCGTCTCGGG 


4500 


CATGGTATCG 


GTATGGATGT 


CCATGAATTC 


CCATCTATCA 


TGGAAGGAAA 


CGACATGGTC 


4560 


ATCGAAGAAG 


GCATGTGCTT 


CTCTGTTGAA 


CCAGGTATCT 


ATATCCCTGG 


TAAAGTCGGT 


4620 


GTTCGTATTG 


AAGACTGCGG 


TGTTGTTACC 


AAGGATGGCT 


TCAACCTCTT 


TACAAGCACC 


4680 


AGCAAAGATT 


TGCTTTATTT 


TGATTAAACT 


ATATAGCCCC 


TATGCTTTCC 


TTTCAAAATA 


4740 


TCTAGGGGCT 


ATTTTATTGT 


CATTTTTCTG 


CTATTATGCT 


AAAGAAATTG 


GCTGCAATAA 


4800 


TCTAACCCTA 


AGTGTCTGGA 


ATGATAACGA 


GGGTGCTCTC 


CGCTTTTATC 


AAAGACAAGG 


4860 


GATGAAACCC 


CAAGAAACAA 


CAATGGAAAT 


GATAATTGAT 


TAAGAAGTCA 


TCTATCAAAA 


4920 


GATGTTAGAA 


AAAGTTCAAT 


TTCACTAGAA 


AATGAGGAAA 


ATCTCCCCAC 


AATAAAACGC 


4980 


ATAGTATCAG 


GTATTGTGTA 


CTGACCCCAA 


ACAGTTAGAC 


AATTAATTTA 


TCCGAAGGAT 


5040 


TTAGTTCTGT 


ACTGCACAGG 


ACTAAGTCCT 


TTTAGTTTTA 


CCTTAATTCG 


TTTGTTGTTG 


5100 


TAGTAATCAA 


TATAGTCTAT 


AATGACTTGT 


TCCAATTGGT 


TAAGTGATTT 


AAATGTTTTC 


5160 


TCATAGCCAT 


AAAACATTTC 


GGATTTTAAA 


ATGCCAAAGA 


AAGATTCCAT 


CATACCGTTG 


5220 


TCTTGGCTGT 


TTCCCTTGCG 


TGACATAGAT 


GCTTGAATTC 


CCTTATTCTC 


TAGGAACCGA 


5280 


TGATAAGAAT 


CGTGTTGGTA 


TTGCCAGCCT 


TGGTCACTAT 


GGAGAATCGT 


ATTCTCGTAG 


5340 


TGCTTCTCTT 


TGAATGCCTG 


TTCCAACATT 


GTTTGTACTT 


ATTCTAAATT 


AGGCGAACAA 


5400 


GAAAGATTAA 


AAGCAATAAT 


TTCGCTGTTA 


AAGCCATCTA 


AAACTGGTGA 


TAAGTAAAGC 


5460 
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TTTTGAGTAC TTGCTGGAAT GGCAAATTCA GTCACATCTG TGTAGCACTT TTCCATTGTT 5520 

TTAGAGCCTT CAAATTGGGC TTGAATGAGA TTCTCTGCCT TCTTACCAAC GTCTCCTTTA 5580 

TGAGAAGAAT ATTTTCGTTT CTTTCGCATT TTAGCTTGTA AATTGAGTAC TTTCATCAAG 5640 

CCTTGAACTC TTTTATGATT TACCAGATAA CCACGATTTC TTAGTTCTAA ATGAACCCGG 5700 

CGATAAGCAT AATTTCCCTT GTGTTCGATA AAGATGGATT GAATTTCAGT TTTAAGCTCT 5760 

TGGTCTTTAT CTGTTTTGTC TAGCTGTTTC AAGTGATAGT AGTAGGTCCA ACGAGCTAGT 5820 

TTAATGGCTT CTAGAAGAAG ATCTAACGAA AACTCAGTCA TTAATTCTTG AACAATTTCT 5880 

GTCTTTCTTC TTTCTCTTTT TCCTCCTTCA ATCGGAGTTC TCTTAACTTT TTTAGGATGG 5940 

CATTCTCCGC TCTCAGGTAC TCTCCCTCTT GTTTTCTCAA CAATAGTATA CCCGTTTTTC 6000 

CTGTATTGTG CTAGCCAGTT AAGAAGTATC GTACGACTTG GGAGACCGTA TTCAAGAGAA 6060 

ACTCTATCTT TAGTCCAGCC TTCATGTCAG ACTTTATTAA CCCCAATTAT TCACCCCAAA 6120 

TCTAAAAACC ATCCAGAATC CTTGCCTTAG CTTAGATCCT GGATGGTTTC TTTTTTCACC 6180 

CAATGGGTGT TTTTTACTAG ACAAAAAAGA GTTTCCCCTT TATGGTATAA GTGTAGAAAA 6240 

AAACACAAAA AGAAAGGAAA CTCACATGAA CAGTTTACCA AATCATCACT TCCAAAACAA 6300 

GTCTTTTTAC CAACTATCTT TCGATGGAGG TCATTTAACC CAGTATGGTG GTCTTATCTT 6360 

TTTTCAGGAA CTTTTTTCCC AGTTGAAACT AAAAGAGCGG ATTTCTAAGT ATTTAGTAAC 6420 

GAATGAmCAA CGCCGCTACT GTCGTTATTC GGATTCAGAT ATCCwTGTCC AGTTCCTCTT 6480 

TCAACTGTTA ACAGGTTATG GAACGGAATA TGCTTG 6516 
(2) INFORMATION FOR SEQ ID NO: 106: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14654 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 106: 

TTTTCAACCC ATATCGTGGC TCCTGAATAC TACTTACTGA CAACTATGCT ATCAGAGACT 60 

TCTCTACTTG TTTTCTATAT CATTTTCATC CATAGAAAAC AACTCATCCA CTTGGGACAT 120 

ATCTTTAGCT ATACTGTTCG ATACTCTCTC TTTTCACTTT CCTTTGTAGC AATTTATTTC 180 

CTGATTAATT TCGTGTATCC TGTAGATATG GTCATTAATT TGCCATTTTT GATTAATACT 240 

GGTTTGATTG TCTTGCTATC AGCTATCTCT TATATTAGTC TACTTGTCTT CACAAAAGAT 300 
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AGCATTTTCT ATGAATTTTT AAACCATGTC CTAGCCTTAA AAAATAAATT TAAAAAATCA 
TAGGAGTTTA AAATGAAACA ACTAACCGTT GAAGATGCCA AACAAATTGA ATTAGAAATT 
TTGGATTATA TTGATACTCT CTGTAAAAAG CACAATATCA ACTATATTAT TAACTACGGT 
ACTCTGATTG GGGCGGTTCG ACATGAGGGC TTTATCCCTT GGGACGACGA TATTGATCTG 
TCCATGCCTA GAGAAGACTA CCAACGATTT ATTAACATTT TTCAAAAGGA AAAAAGCAAG 
TATAAGCTCC TATCCTTAGA AACTGATAAG AACTACTTTA ACAACTTTAT CAAGATAACC 
GACAGTACGA CTAAAATTAT TGATACTCGA AATACAAAAA CCTATGAGTC TGGTATCTTT 
ATCGATATTT TCCCTATAGA TCGCTTTGAT GATCCTAAGG TCATTGATAC TTGTTATAAA 
CTGGAAAGCT TCAAACTGCT GTCTTTCAGT AAACATAAAA ATATTGTCTA TAAGGATAGC 
CTTTTAAAAG ATTGGATACG AACAGCCTTC TGGTTACTCC TTCGACCGGT TTCTCCTCGT 
TATTTTGCAA ATAAAATCGA GAAAGAAATT CAAAAATATA GTCGTGAAAA TGGGCAATAT 
ATGGCTTTTA TCCCTTCAAA ATTTAAGGAA AAGGAAGTCT TCCCAAGTGG TACCTTTGAT 
AAAACAATCG ATTTACCCTT TGAGAATTTA AGCCTTCCTG CACCTGAAAA ATTTGATACT 
ATTTTGACAC AATTTTATGG AGATTATATG ACCCTACCAC CAGAAGAAAA ACGCTTCTAC 
AGTCATGAAT TTCACGCTTA TAAATTGGAG GATTAGGATG CAATATTTAG AAAAAAAAGA 
AATTAAAGAA ATTCAACTAG CCCTGCTGGA CTATATTGAT GAGACTTGTA AGAAACATGA 
TATTCCTTAT TTTCTCAGTT ATGGAACCAT GCTTGGAGCC ATCCGCCACA AAGGTATGAT 
TCCTTGGGAT GATGATATTG ATATTTCCCT TTATCGTGAG GATTATGAGC GTTTACTGAA 
GATTATTGAA GAAGAAAATC ACCCTCGCTA CAAGGTTCTT TCCTACGATA CATCTTCTTG 
GTACTTCCAT AATTTCGCAT CGATTTTGGA CACTTCTACT GTTATAGAAG ACCATGTTAA 
GTACAAGCGT CATGATACCA GCCTTTTCAT CGATGTCTTC CCAATTGATC GATTTACAGA 
CTTGAGCATT GTCGACAAGA GCTATAAGTA TGTGGCTCTT CGTCAACTAG CTTATATCAA 
AAAATCACGA GCAGTTCACG GTGATAGCAA ACTAAAAGAT TTTCTTAGAT TATGTAGCTG 
GTACGCTCTC CGATTTGTCA ATCCTCGCTA CTTTTACAAG AAAATTGATC AACTAGTCAA 
AAATGCTGTA ACCAACACTC CTCAATATGA AGGAGGAGTT GGGATCGGTA AGGAAGGGAT 
GAAAGAAATC TTCCCAGTTG ATACCTTTAA AGAACTGATT TTAACTGAGT TTGAGGGCCG 
TATGTTGCCT GTTCCCAAAA AATATGACCA ATTTTTAACC CAGATGTATG GCGATTATAT 
GACACCACCA TCAAAAGAAA TGCAAGAGTG GTATAGTCAT AGCATTAAAG CTTATCGCAA 
AAACTGATTG AGGGGGATTA TACAAACTAC TAAGATAGAG GTTATTCAAA AACATAATTT 
TAGTAGAAAA TGAAATACAT ATTCCCACAA TAAAACGCAT CATATCAAGG TTTTTGAAAA 



360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
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ACCTTGATAT GATGCGTTTT ATAATTTTAA AGACTTTTTT CTATAGTAGA TTGAAATAAG 2160 

ATGCGAACAA ATCAATTAGA AAATTCAAAT TAATTTATAG AAATATTTTA GTATTCCTGT 2220 

GTACTGTTCT AAATTCAGTC TGCTATATCT TATTTTTCTA TTTAAATCGC TTCTGTAACA 2280 

AAGCTACGAC TTTCAAGTAC CTTAAGCATG GCATTAGCTG TATCTAGCGC TGTGAAGAGG 2340 

GGCACCCCGT GTTCAATGGC TGAACGACGA ATTTGCTCAC CATCTTCGTC AGCAGTTCGT 2400 

TTTGTTCCTA CTGTGTTAAT GATAGCTTGA ATTCTTCCTT TGCGTACAAA ACTTGGGATA 2460 

TCCTTATCGT CATCACCAAT CTTACCAACA GGTTGGGCTT GCAAGCCATG ACTAGCAAAG 2520 

AAGGCTGCTG TCCCTTCTGT CGCAAGGATT CCATAACCAA TGTTTTGGAA ACGACGAGCC 2580 

AAGTTCAAGG CTTCTTCTTT GGCATCATCA GCGATGGTAA AGACGACATT ACCAAAAGTT 2640 

GGCAAGTGTA GATAAGAAGC TTCAAAGGCT TTATAGAGAG CTTTTTCCAA AGTAGCATCA 2700 

GAACCCATAA CTTCACCTGT TGACTTCATT TCAGGACCGA GCAAGCTGTC TACCTTAGCT 2760 

AGTTTGGTAA AGGAGAAGAC AGGTGCCTTG ATATGAACAC GGGTGCTTTC AGGGTAAAGT 2820 

CCATTTTGGT AGCCAAGTTC TGATAAACTT TGACCAAGAA TGAGTTTGGT CGCTACTTGA 2880 

GCCATAGGAA TATTGGTTAC CTTAGATAGG AATGGAACAG TACGGCTGGC ACGTGGATTG 2940 

ACCTCAATAA CGTAGACTTT TTCATCCTTG ATAACAAACT GGATGTTCAT CATTCCAAGG 3000 

CAGTGAAGAC CGATTGCTAA GCGTTTGGTG TAGTCTGCGA TGGTCTCCTG AACCTTTTGC 3060 

GACAAGGTTT GTGGTGGGTA AACAGCCATT GAGTCACCTG AGTGGACACC AGCACGTTCG 3120 

ATATGCTCCA TGATACCAGG AATGAGTACA TTTTTACCAT CTGAAATGGC ATCAACTTCG 3180 

CACTCTTGCC CAACGATATA AGAGTCGACA AGAACTGGGT GGTCTGGACT AGCCTTAACA 3240 

GCAGTTCGCA TGTAAGAACG AAGGTCTTCT TCGTTTTCAA CGATTTCCAT GGCACGTCCA 3300 

CCAAGTACAT AAGATGGGCG GACAAGAACT GGGAAGCCAA TCTTGCGAGC TGCAAGAGCT 3360 

GCTTCTTCTT CATTGGTAGC CGTTTGTCCT GGTGGCTGTG GAATATCCAA TTCTTTGAGA 3420 

GCTTGCTCGA AGAGGTCACG GTCTTCGGCA CGATCTAGGT CAGCAACCTG TGTACCAAGG 3480 

ATGGTCACAC CTGCTTTTGC CAATGGCTCC GCAAGGTTGA TGGCTGTTTG ACCACCGAAC 3540 

TGAACGATAA CTCCCTTTGG TTGTTCCAAG TCAATGACGT TCATAACATC TTCGAATGTC 3600 

AATGGCTCAA AGTAAAGCTT ATCTGATACA GAGAAGTCTG TTGAAACGGT CTCTGGGTTT 3660 

GAGTTCATGA TGATAGCTTC ATAACCAGCT GCCTGGATAG CCTTAACAGA GTGAACGGTT 3720 

GCGTAGTCAA ACTCAACCCC TTGACCGATA CGGATTGGAC CTGAACCTAG GACAAGTACA 3780 

GATTCTTTAT CAGATCTGAT AGATTCATTT TCCCAACCAT AGGTTGAATA GAAATATGGC 3840 
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GTTTCGGAGT 


CGAACTCTGC 


CGCACAAGTG 


TCTACCATCT 


TATAAACTGG 


AACAATCTTG 


3900 


TTTTCCAAGC 


GAAGTTGGCG 


AACTTTATCA 


TCAGTCGTTC 


CCCAGAGTTC 


AGCAATCTTA 


3960 


CGGTCTGAAA 


AACCATTAAG 


TTTGGCTGTT 


TTCAAAACTT 


CTAAATCTTG 


TGGATGAGCA 


4020 


CCCAATTCTT 


GCTCAATTTC 


AAAGATATGC 


AAGAGTTTAT 


CAAGATAGAA 


GATATCAATT 


4080 


TTTGTAAGCT 


CTGCAATTTC 


TTCAGGTGTG 


TAGCCACGAC 


GAATGGCTTC 


TGATACGTAG 


4140 


AAGAGACGGT 


CATCTTGGGC 


TTTGACAACC 


TTTTCAATCA 


AGGCATCATC 


AGAAACTGCT 


4200 


GCAAGTTCAG 


GTATTTCATT 


GTGGTGCACC 


CCAATTTCAA 


GGGAGCGGCA 


GGCCTTGAGA 


4260 


AGAGATTCCT 


CGATGTTACG 


ACCGATTGCC 


ATGACTTCTC 


CAGTCGCCTT 


CATTTGTGTA 


4320 


CCGAGACGGC 


GTTCACCCTT 


TTCAAACTTG 


TCAAATGGGA AACGTGGAAT 


CTTAGCAACT 


4380 


ACGTAGTCAA 


GGGCTGGTTC 


AAACATGGCA 


TAGGTTGAAC 


CTGTAACTGG 


GTTTATAACC 


4440 


TCATCCAAGG 


TCAAACCTAC 


TGCAATCTTG 


GCAGCCAACT 


TAGCAATCGG 


ATATCCTGTC 


4500 


GCTTTAGAAG 


CAAGGGCTGA 


CGAACGTGAT 


ACACGAGGGT 


TTACTTCGAT 


AACATAATAC 


4560 


TTGAAGCTGT 


TAGGATCAAG 


AGCTAGCTGA 


ACATTACATC 


CACCTTCAAT 


CTTGAGGGCA 


4620 


CGAATAATGC 


TCAAGCTCGC 


ATCACGAAGC 


ATTTGGTTTT 


CATAGTCTGA 


CATGGTTTGC 


4680 


GCAGGGGCAA 


ATACAATGGA ATCCCCTGTG 


TGAATCCCAA 


CTGGGTCAAA 


GTTTTCCATG 


4740 


TTACAAACAA 


CCAAGGCATT 


GTCAGCTGAG 


TCACGCATCA 


CTTCGTATTC 


AATTTCCTTG 


4800 


AAACCGGCAA 


TCGAACGCTC 


AATCAAACAT 


TGGGTAACAG 


GTGACAATTT 


CAAACCATTT 


4860 


TCAGTGATTT 


CACGCAATTC 


TTTCTCGTTG 


GCACACATAC 


CACCACCAGT 


ACCACCAAGG 


4920 


GTAAAGGCTG 


GACGAACGAT 


GACTGGGTAG 


CCAATTGTCG 


CTGCAAAGGC 


AACTGCTTCT 


4980 


TCTACTGTGT 


TAACAATTTC 


AGATTCTGGA ATGGGTTGTT 


CAAGCTCTTC 


CATCAATTGT 


5040 


TTAAAGAGGT 


CACGGTCCTC 


CGCTTGGTCA ATGGCAGATA ATTTGGTACC 


CAGAAGTTCA 


5100 


ACGCCAAGCT CGTCTAGGAT ACCATTTTTA GATAATTCCA TGGCCATGTT GAGACCTGTC 


5160 


TGACCACCGA 


GTGTTGGTAG 


CAAGGCATCT 


GGACCTTCCT 


TACGAAGAAT 


ACGTGTCACA 


5220 


AACTCAAGTG 


TAATCGGTTC 


AATGTAAACC 


TTGTCAGCAA 


TTTCCTTGTC 


CGTCATGATG 


5280 


GTTGCAGGAT 


TTGAGTTAAC 


CAAAACAACC 


TCATAACCTT 


CCTCTTTCAA 


CGACAAGCAA 


5340 


GCCTGAGTCC 


CAGCGTAGTC 


AAACTCAGCA 


GCCTGACCAA 


TAATAATCGG 


ACCAGAACCA 


5400 


ATCACCATAA 


TTTTTTGAAT 


ATCAGTACGT 


TTAGGCATAT 


ATAAGATATT 


AAGGGTGTCA 


5460 


AGCGGACAAA 


GCTAAAATAG 


GAGTTATGAC 


GAAGAACTGT 


CAGTTCTAGG 


AATAACTATC 


5520 


TTTTTAGCAC 


CGTCCGTAGC 


CCGTATTCAG 


TTCAGCAAAT 


ACGGAGCACC 


CTTCTCCTTT 


5580 


CTATTCGTCG 


CCTCTCAGGG 


CGACATTAAA 


TAAGATACAA 


AGGACGAATA 


GAAAGCGATT 


5640 
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GAATTTTAGG 
TTCCGTCCGT 
TGTTTAAAAA 
CCAGGAGCTG 
CCTTCCACTG 
TCGCGGCTGA 
TCACGTACCG 
TTTGCCATTG 
ACACCGCGAA 
AACATAACTC 
ACTGTCACGT 
TCCACTAGCA 
TGTTTGATAT 
CTGTCCCCAA 
AGAGCACGCG 
AAGGTCATTT 
TTACAAGTTG 
GGATAAGTAA 
CCGGTCATCC 
TTGCCTTCAA 
CTCGTGGACG 
TAGTACGGAT 
CCGTGATTTT 
TGGGTGCATG 
TTGGGCATGG 
ACGGAGAAAC 
ATAGTCTGCA 
CAAGCGTTTC 
AATAGCAACT 



AAATCAAGGA 
GTTCAGTTAC 
CTTCCATCAT 
CATCTGGGTG 
ACTTGTCATT 
CTGCATAACC 
CATGGTTGAA 
CAAAGAGTTG 
TCATGTOGAG 
CGTCAGGATT 
TACAGTTGCG 
CCACGCTCAA 
TGTCTGTCGG 
CATGGGTCAA 
TATCAATTCC 
GGTTGCGCCA 
GAATAATGGA 
AGGTCAAGAT 
CTGTATTAAA 
AAACTGTGCC 
CTCACTGGCG 
TCTAAGATTG 
GGTGCTTCAA 
AGGATTGCTG 
TAGTCTTCTT 
ATCATGACAT 
AACTCTTGAC 
AAAATCTGCA 
TTAAGACCCT 



AGGATTGACA 
ATAAATTCTC 
CTCGATAAAC 
GTATTGAACA 
GATTTCTTCG 
ATGGTTCTGG 
TCCACGGTGG 
GTGTCCCATA 
TGCTTGTGGA 
GAGATGGAGA 
TTTAGAAAGT 
ACCAACTCCT 
TAAAACTGTT 
GGTTGCACGC 
TGAAATCCCT 
GTTGCTAGCT 
TTCATAATCA 
TTGTCCATTA 
GACGATTTCG 
ATCTTCTAAT 

TCTTTTAACG 
CCATTCGAAC 
CCAAGTGGTC 
TTTCTTTCAA 
TTGAAAATAC 
CAACCTGATC 
TTCTCCATTC 
TATTGGATTT 
CAAAGTGGCC 



785 
ATCCAAGTTG 
CGACGAGCTT 
TCGTCAAATA 
GAGAAAGCAG 
TGGGTAATAA 
CTGGTGAAGT 
CCAAACTTCA 
CAAATACCAA 
ACGTCTTCTG 
ATTTCTTCAG 
TCACGTAGGA 
GGAGCTGGAT 
GCTTGGAGCT 
ATAGTACCAT 
GGAATTTTCT 
CTACGCGCTT 
TCACGATTAA 
TAAGACTGGT 
CCTGTTACAT 
ACTAGAATTC 
TCTTGTGTTT 
AAAGACACCA 
TGCTATTTCT 
ACGATCGTAA 
AGCTCCACTA 
AATGATTTCA 
CTCAGGTCCA 
GGCAACGCGT 
AAATTCCTCA 



GTTTCTCTAC 
TTACTCGTTC 
GGTAGCTAGC 
GTTGGTATCT 
TCAAGTGCTC 
CTACTCGTCC 
TCTTATAGGT 
AGATTGGAAT 
GGTTACCTGG 
CCGTTGTCGA 
TTGAGTGCTT 
AAGACGTTTT 
GGTCCGTCAC 
GCTTACGGAT 
TGGCTTTCAA 
CTTCAAAAAC 
TACCATAATT 
CTGTAATGGA 
CAATATCTGC 
TTTTTGTCAT 
TAGTTGGCGT 
TTGGTCATTT 
ACATCACGAT 
CGTTCTTGAG 
TCATGGCGTT 
TCAATGGTTA 
GCGAAAAAGA 
GAGTGGTCCA 
TAAATGGTCA 



ATTCTGAGCT 
TTAGTTTGAT 
GTCGTGTGGC 
GTGGCGCACA 
TGGCAAATCC 
TGTTGCGATT 
CTTAGCCCCG 
TTTTCCTTGT 
ACCATTTGAC 
ATAAGGAACA 
GAGACCAAAG 
AGTAGAAACC 
ATGGTCCATA 
AATCTTGGTA 
AAATTCATCC 
AACGACTCCC 
TCCTACCAAA 
TTCTTGGTAG 
TCCGAAGGCC 
ATTTTCACCT 
TTCTACTCGC 
GTTGGACAAT 
TGATTGGAGC 
TCAAGCCATG 
CGTGTTGCAC 
CAAACTGTCC 
GTTCAGCTCC 
AGTCACCTGC 
TCAAATCAAG 



5700 

5760 

5820 

5880 

5940 

6000 

6060 

6120 

6180 

6240 

6300 

6360 

6420 

6480 

6540 

6600 

6660 

6720 

6780 

6840 

6900 

6960 

7020 

7080 

7140 

7200 

7260 

7320 

7380 
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CAAGCTCTGG 


CTAGGGTGTT 


GGCCCGAACC 


A I LTCCACCA 


TTGATGATGG 


AAGTCGTAAT 


7440 


CGTTGGACTA 


GCAATCAATT 


CTCTATAGTA 


GTCGACCTCT 


GGATGGCGAA 


TCACACAGAC 


7500 


ATCCACTCCT 


AAAGCAGACA 


GAGTCAAAAT 


VAs IXaTCATAA 


AGTGTCTCAC 


CCTTATTAAC 


7560 




TTCACATCAA AGTCAAGTCG 


TTCCAATCCA 


AGTTTAATCT 


CTGCGACTTC 


7620 




TGTGTCCGTG 


TAGAATCCTC 


AH f> Tk Ik ✓*! K Ik 

AAAGAAGAGA 


TTGGAAACAA 


TCGGATGGTC 


7680 




AGCTGGGCTC 


CATTTTTAAA 


CTCAATTCCT 


CGCTTGATCA 


ATTTCATTAC 


7740 




GTGAGGTCTT 


CCATGGACAC 


CACATGGTTC 


AATGCTTGTT 


GATTTTCTGA 


7800 


LA 1 At. 1 


CCTTTAACTT 


TCTAAGCTTC 


TTCAGTAATC 


AGAACTCTGT 


CTTGGTCATC 


78.60 


AAbTTCTGTC 


ATCTCTACGA 


TGATTTCTTC 


AGAACGACTG 


GTTGGGATAT 


TTTTTCCAAC 


7920 


GTAATCTGGA 


CGGATTGGCA 


ATTCTCTATG 


TCCACGATCG 


ACTAGAACTG 


CTAAACTCAC 


7980 


ACGCGCAGGA 


CGACCATGAC 


CGACAATATT 


ATCAATAGCA 


GCACGGATGG 


TACGACCTGT 


8040 


ATAGAGCACA 


TCATCCACCA 


AGATAACTTC 


GCGGTCTGTC 


ACATCGACAG 


AAACCAAAGA 


8100 


AGTATCTTCT 


CCACTTTTAA 


CATCATCACG 


GAAAGGTTTA 


GTATCCAATT 


CCACAACAGG 


8160 


AACTGAAAGA 


TTTTCTAACT 


GCTTCAAACG 


TTCTTGGATT 


CGGTGGGCAA 


TAAAGACACC 


8220 


ACGAGTTTTA 


ATACCAGCCA 


AGACGATCTT 


ATTCAAATCT 


TTGTTGCGTT 


CGATAATCTC 


8280 


ATAAGTAATA 


CGCGTAATCG 


CTCGTTTGAC 


GGTCAATTCG 


TCTACAACTT 


CTTTTGTTTT 


8340 


CATGACAAAC 


CTCCAAAAAG 


AAAAGTCTCC 


TTAAACAAGG 


AGACTTGAAA 


TTTATAGCCA 


8400 


AGCGAGCCCT 


ACTGCACACA 


GTATAGACTT 


CACCCTTCTA 


CTTTATCGCG 


CTCCTTGCCT 


8460 


GCCTCACGGG ACAGGTTTAA 


AGGAATATTT 


AGTTATCATT 


TACTATAGCA 


CAAAGCATGC 


8520 


TTAAAATCAA GCAAAAAGTT 


TCAATGTAGC 


n mnnvn a ^ a a a 

ATCTTACAAA 


TTGCTAAAAT 


CATATAATTG 


8580 


TGGGTACTGG 


TCACACTCTG 


GATTTTTTGG 


ATGGCAAATG 


GCTCTTCCAA 


AATAAATCAT 


8640 


GGCCTGATGG 


GCAGCTAACC 


ACTGCTCAGG 


LuuLAnuAl A 


JXCATljACV-t. 


U<_ I 1 I I vJCAL 


8700 


CTCAAGTGGC 


GTCGCTGATT 


TTTTGACAAT 


ATCGTGGTGT 


TTGCAAATAC 


GCTCCACATG 


8760 


AGTATCCACT 


GCAAAGGCTG 


GAATTCCAAA 


TCCTACACTC 


ATGACAACAT 


TGGCTGTCTT 


8820 


GCGACCAACA 


CCTGCCAAAC 


TCTCCAATTC 


TTCACGTGTC 


TGAGGGACTT 


GACCATCAAA 


8880 


ATCGTCTAGT 


AACTGTTGGG 


CACATTTTTT 


AAGGAATTTA 


GCTTTATTCC 


GATACAATCC 


8940 


CAAGCGAGAA ATATGTGAAG 


CAATCTCACT 


CTCTGTCGCT 


ACAGACATAG 


CTTGGGGTGT 


9000 


TGGAAAGGCA 


ACAAAGAGAC 


CTGGTGTGGC 


CTTATTTACC 


GCTGCATCTG 


TCGTCTGGGC 


9060 


TGATAACATG 


ACCGCAACCA 


GGAGTTCAAA 


ATGATTGGTA 


AAATCAAGAC 


TAGGCTTGGC 


9120 


ATCTGGGAAG 


AGGGCAATGA 


TTTCTTCTAG 


CACCTTTCGT 


GCTCGTTTTT 


TTGACAAGAC 


9180 
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CATTATTCAT 


CTCCGTCAAA 


TAGTCCTTGT 


AAGCCAGCAA 


AAGGACTGTT 


TTCTTCTTTC 


9240 


TTTACTGCTT 


TTTGAGCTTG 


GTATTCTTCC 


TCTGTCATGA 


TTTGCCAGTC 


ATTTCCTGAG 


9300 


ATAAATCCTT 


GACCAGCTTC 


TTCTTCAGCC 


GTCAAGACCT 


TGATAGGAAT 


GTTTAGCAGG 


9360 


ATATTGTCTG 


ATACACTCTC 


AGCAAGGTCA 


AGCTCCCCAT 


TTTCGATGGG 


CAAGACCAAG 


9420 


TCATCATCTA 


AAACTTCTTG 


ATCTAGCTGG 


TTAGTTGCGC 


CTTCCATGAA 


AACTTCCGTG 


9480 


- ACTGGAT AAG 


ATTCAACTAA 


CTCAACTGGC 


TCCATACTGC 


GACTCGACGC 


AAGAACAATG 


9540 


GTATAAGATA 


GTTGATAATC 


TAAGAAATAC 


ATACGGTCTT 


CATATTGTAC 


TTTCCCAACT 


9600 


GCAAGGATAT 


CTTTTACATC 


TAAAATTTCT 


TGATTACGTG 


CACGCAGGTC 


ATCAACTAAA 


9660 


TCTAACGTTT 


GTTCAAAGTT 


CAAACCTTCA GACTGCTTAC 


GAATTTCTTG 


AATATTTAAT 


9720 


TTCATACTTC 


CTCCATAAAG 


ATTTACTCTC 


TTGATTATAC 


CATGAAAAGG 


CTACAAATCA 


9780 


GCACACCAAA 


CTTTGTAATT 


AAAATTCAAA 


ATTTTAACAT 


ATTTACTATG 


ATAGTTTTAT 


9840 


TTTTTAGTGC 


TATACTATAG 


GGAAAGAGTA 


CATCAGATCA 


AGGAGGATGC 


TCACATGGAA 


9900 


GACAAGAAAC 


TCATTCAACT 


CCTATCCAAG 


TTAAATAAAA 


GCTACCAAAA 


CTGTAAACAG 


9960 


GGTACGGCAG 


ATGATATTCG ACTACAAGAG 


CTGCTAAACA 


CTACTATGCA 


AGAGCTCAAA 


10020 


AAAACGGAAC 


AGTTGAACAA 


CAGTATCTTA 


ATTGATCTTG 


AGAAATTTTA 


CCAACCTACC 


10080 


AGTCTTCTGA 


TTGGACTGGG 


TAGCCTAAAA 


CTAAACGATC 


AAGCACGCAC 


TGCTTGGCGA 


10140 


AACTATGATA 


AATTCCATTA 


CGATCATGTC 


AAACACGTAC 


TAAGTCTCTA 


TGGACCTGTT 


10200 


TTTGAATTTT 


AGAGCATAGA 


ATTTCCAGTT 


TTCTGTTGAC 


AAAATTTCCT 


TAAAGGTATA 


10260 


ATATAAAGAT 


ACTAATACTC 


GGAGGTAAGG 


GAGACATGAA 


CAACTAAGTC 


TATCAAATAA 


10320 


AGAACCTTTA 


TTTAGTAGAT 


CTTGTTTTTG 


TCTCTTTTTG 


TGTGCTCTTT 


TATGCTCTTT 


10380 


TTCTGGCATG 


TTAATAGAGT 


TTTTTTGACA TAGACTTTGG 


GCTCTACTAG 


GTAAAGTAGA 


10440 


GCTTTTTGTT 


ATGCACTATG 


AACATTCTAG 


AAAGGGAAAT 


CATATGATAA 


AAATCAATCA 


10500 


TCTAACCATC ACACAAAACA AAGATTTACG AGATCTTGTA TCTGACCTAA CCATGACCAT 


10560 


CCAAGACGGG 


GAAAAGGTTG 


CTATTATTGG TGAAGAAGGA AATGGCAAAT 


CAACCTTACT 


10620 


TAAAATTTTA 


ATGGGGGAAG 


CTTTGTCTGA 


TTTCACTATC 


AAGGGAAACA 


TCCAATCTGA 


10680 


CTATCAGTCA 


CTGGCCTACA 


TTCCTCAAAA 


AGTCCCTGAG 


GACCTAAAAA 


AGAAAACTTT 


10740 


ACACGACTAC 


TTCTTTTTAG 


ATTCTATTGA 


TTTAGACTAC 


AGTATCCTCT 


ATCGTTTGGC 


10800 


GGAGGAATTG 


CATTTTGATA 


GCAATCGTTT 


CGCAAGTGAC 


CAAGAGATTG 


GCAATCTATC 


10860 


AGGGGGCGAA 


GCTTTGAAAA 


TTCAGCTTAT 


CCATGAGTTA 


GCCAAACCCT 


TTGAGATTCT 


10920 
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nlUll Aur A X 


GAACCTTCAA ATGACCTAGA CCTTGAGACA GTTGATTGGC TAAAAGGCCA 


10980 


vjA 1 1 l_ AAAALa 


ACCAGGCAAA 


CCGTTATTTT CATTTCCCAT 


GATGAAGACT 




11040 


7» a cncc r/~» r r> 


ACTATTGTTC 


ACTTGCGACT GGTCAAACAC 


CGTAAAGAAG 


CGGAAAPGPT 


11100 


nkj 1 AbAb t. AT 


TTAGACTATG 


ATAGCTATAG TGAGCAGAGA 


AAGGCTAATT 


X XuLLnAALA 


11160 


AALj I t-AGCAA 


GCTGCTAACA 


ACCAAAGAGC CTACGATAAA 


ACCATGGAAA 


a jpj tccc ap 

AnLA I L.uOAlj 


11220 


AGTTAAGCAA 


AATGTAGAAA 


CTGCGCTTCG AGCTACCAAA 


GATAGTACTG 


L.I_(jV3 X vAjLv_ i 


11280 


ATTGGCTAAA 


AAGATGAAAA 


CTGTCCTCTC ACAAGAAAAA 


CGCTACGAAA 


AljOLAGCTCA 


11340 


GTCCATGACT 


CAAAAGCCAC 


TTGAAGAGGA ACAAATCCAA CTTTTCTTTT 


CAGACATCCA 


11400 


ACCATTACCA 


GOTTOTAAAfi 


TCTTAGTCCA ACTGGAAAAA 


GAAAATTTGT 


CCATTGACGA 


11460 


CCGAGTTTTG 


GTTCAAAAAf* 


TACAACTAAC TGTCCGTGGC 


CAAGAAAAAA 


TCGGTATTAT 


11520 


CGGGCCAAAT 


GGTGTTGGGA 


AATCAACTCT GTTAGCCAAG 


TTACAGAGAC 


TTCTGAATGA 


11580 


TAAAAGAGAG 


ATTTP A C*WC\ 

*i A A X •vitV' £ X \J 


GTTTTATGCC ACAAGATTAC 


CACAAAAAAC 


TGCAATTGGA 


11640 


TTTATCCCCA 


ATAGCCTATC 


TCAGTAAAAC TGGGGAAAAA 


GAGGAACTAC 


R /*■ R R R R ffipn X 

AC»AAAATCCA 


11700 


ATCTCACCTA 


GCTAGTCTCA 


ATTTCAGTTA TCCAGAAATG 


CAGCATCAAA 


X X4_VjL. 1 X X 


11760 


ATCTGGCGGA 


CAACAGGGAA 


AACTCCTGCT TTTGGATTTA 


GTCCTGCGCA 


nALLAAAL X X 


11820 


TtTTCCTGCTG 


GATGAACCCA 


CACGAAACTT TTCTCCCACT 


TCTCAACCCC 


AAA! LAuAAA 


11880 


ACTCTTTGCT 


ACCTATCCAG 


GCGGTCTCAT CACTGTTTCG 


CATGACCGTC 


rmmm/vrim R R ft 
1 1L 1 1 AAA 


11940 


AbAAbTCTGC 


TCGATCATCT 


ATCGCATGAC AGAACACGGT 


TTGAAGCTAG 


1 X AA XXX nun 


12000 


AvjATTTATAA 


ATTTGCAACA 


TAGCAAAAAT CCAGAGACGA 


CCTCTGGATT 


tl i 1 1 Al_ A X 4- 


12060 


TGTTTTAAAC 


GTTCAATCCG 


TTCTGAGATA GGTGGGTGGG 


TATAAAAGAG 


TTTTTYSfS A & f 

1 i t. X 1 V3VS/VH_ 


12120 


tXtXt-ACCTT 


TCTTAGGATC 


ATTGATATAA AGGGCACTGC 


TAGCATCATC 




12180 


CTCATAGGTT 


TGCTATTGTC 


CAACTTATCT AGGGCATTAA 


TCATTCCCTG 


GGGATTGCGA 


12240 


GTCAGCTCGA 


CACTAGATGC 


ATCTGCCAGA AATTCCCTCT 


GACGAGAAAT 


AGCGAGCTGA 


12300 


ACCAAGGTTG 


CAGCGAGAGG 


TGCCAGTACA ATAGCTAGTA GGGAAACCAC 


TAGCATAATG 


12360 


ATTTCAAGAC 


CATTTCCATC 


TCGGTCATCA TCACTTCGTC 


TGCGACCTGC 


TCCACCCCAC 


12420 


CACATCATAC 


GACCTGCCAT 


ACTAGAAAGC ATGGTGATAG 


CACTAGCAAG 


GGCAACTGCA 


12480 


ATAGTCGAAA 


TACGGATATC 


ATAATTACGA ATATGACTGA 


CTTCATGTCC 


CATAACAGCT 


12540 


TCTAGTTCTT 


CACGATTCAT 


GATAGCTAGT AGACCTGAAG 


TCGCAGCAAC 


AGCCGCATTT 


12600 


TGAGGATTAG 


AACCTGTCGC 


AAAGGCATTT AAGGCTGGAT 


CATCAATGAT 


GAAAACACGG 


12660 


GGCATAGGAA 


TCTGAGCGAC 


CAGAGCCATA TCTTCCACTA 


CATGGTAGAG 


GTCTGGTGCC 


12720 
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GTTTGCTCAT CCACCTCACG CGCTCCATTC ATGGACATGA CAATCTCTGT CGATTGAAAA 12780 

ATCATAGACA AAGCGTAGAT AAAGCCGATA ATCAGTGCAA TAACCAAACC ACCAAGTCCA 12840 

GATCTTATAA AGAGATAACC AACCGCATAA CCAACAAGAG CTAAGAGTAG GAAAAATACC 12900 

AGCAACAAAA TCCAGGTTTT TCGTTTATTG CTTGCAATTT GATCAAACAA CATCTTAGTC 12960 

ACCTAAACCG CTAAAATCAA CTTTAGGAAC CGACTTTTCC TCTTCAGGTG TTTGAAGGAA 13020 

' ATCTGCCGCT TTAAATCCAA ACATTCCAGC GATAATATTG CTCGGGAAAG TTTCTAATTT 13080 

TACATTGTAG TTGCTGACAA CACTGTTATA GAGTTGACGA GAGTAAGAAA TTTTATTTTC 13140 

TGTGTTTGTC AACTCCTCTT GCAATTTAAC AAAGTTAGCA CTAGCTTTCA AATCTGGATA 13200 

GCTTTCTGCA ACTGCAAAAA TACCTGAAAC CTGACGAGTG AGGGCATCAC TGGCTTTCAT 13260 

AGCTTCTGCT GGTGAAGTCG CTGCCGCCAC TTGGTTACGT AGTTCTGCCA CCTTTTCAAG 13 320 

GGTAGAACCT TCATATTTGG CATAACCTTT TACAGTCTCA ATCAAGTTTG GCAAGAGGTC 13380 

ATTGCGACGT TTCAACTGAA CATCAATCTG ACTCCAAGCC TCCTTGGTTT GCATACGATT 13440 

TTTAACCAAA CCGTTATAGC TAACAATCAC AAAAATAACA ATAAGAGCGA TAACTCCAAG 13500 

AATAATCCAA GTCATAATAT AAGTCCTTTC TGCTTTTAGA TTAGTACCAG TATATCAAAT 13560 

TTTCTATGAT TGTGGTAAAA TAAGATGATA CTAAAGAAGG AAATAACTAT GAAACCAAAA 13620 

ACATTTTACA ACTTGCTTGC CGAGCAGAAT CTTCCACTTT CGGACCAGCA AAAAGAACAA 13680 

TTTGAACGTT ATTTTGAGCT CTTGGTCGAG TGGAATGAGA AGATTAATTT GACGGCGATT 13740 

ACGGACAAGG AAGAAGTTTA TCTCAAACAT TTTTACGATT CGATTGCACC CATTCTTCAA 13800 

GGTTTGATTC CCAATGAAAC TATCAAACTT CTTGATATCG GGGCTGGGGC AGGATTTCCT 13860 

AGTCTACCAA TGAAAATTCT CTATCCGGAG TTAGATGTGA CCATTATTGA TTCACTCAAT 13920 

AAGCGCATCA ACTTCCTACA ACTCTTGGCT CAAGAACTGG ATTTGAACGG AGTTCATTTC 13980 

TACCACGGAC GTGCCGAAGA TTTTGCCCAA GACAAGAACT TCCGTGCTCA ATATGATTTT 14040 

GTAACAGCTC GTGCGGTTGC CCGTATGCAG GTCCTATCTG AATTGACTAT TCCCTACCTT 14100 

AAGGTTGGTG GCAAACTATT AGCACTCAAG GCTAGCAATG CGCCTGAGGA ATTATTAGAA 14160 

GCTAAGAATG CCCTCAATCT CCTTTTTAGT AAGGTCGAAG ACAATCTCAG TACGCCCTAC 14220 

CGAATAGAGA TCCGCGCTAT ATCACAGTGG TAGAAAAGAA AAAAGAAACA CCAAATAAAT 14280 

ATCCACGTAA GGCTGGTATG CCAAATAAAC GCCCACTTTA AATTTTTTAG TAAACAAATG 14340 

TTTACAAAAT CAGCCTCGCT CTTTTATTTC TAGGCTCGGG AAAAAATGAT TTACAAAATC 14400 

AGCCTCGCTC TTTTATTTCT AGGCTCGGGA AAAAATGATT TACAAAATCA TTTTTTTCTG 14460 
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CTATACTATC CTAAGCAAAG GTTTTTAATG TCATCCCGTG AGGTGACGAA GACGCAGAAA 
TATTTAAAAC TCTTTAAAAT CTAAATTTTA AAGAAGTCTT ACTCTGAGGG CCTATTGCTG 
TAAAATAATG GGCTCTTTTT TGATGCCCAA AAGTGAGGTT TATATGAAAC AAGAATCAAC 
TGTTGATTTG TTAC 

(2) INFORMATION FOR SEQ ID NO: 107: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 6405 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



14520 
14580 
14640 
14654 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 107: 
AGAAAAATCT GCTTTACAGA AAATAAAAAT AATAGGAGAA AATCTATGTC AGATTTGAAA 
AAATACGAAG GTGTCATTCC AGCCTTCTAC GCATGTTATG ATGATCAAGG AGAAGTAAGC 
CCAGAACGTA CGCGTGCCTT GGTTCAATAC TTCATTGATA AAGGTGTTCA AGGTCTTTAT 
GTCAATGGTT CTTCTGGTGA ATGTATCTAC CAAAGCGTTG AAGATCGCAA GTTGATTTTG 
GAAGAAGTCA TGGCGGTAGC AAAGGTAAAT TGACCATTAT TGCCCATGTT GCTTGCAATA 
ATACTAAAGA TAGTATGGAA CTTGCTCGCC ATGCTGAAAG CTTGGGAGTA GATGCTATTG 
CAACGATTCC ACCAATTTAT TTCCGCTTGC CAGAATACTC AGTTGCCAAA TACTGGAACG 
ATATCAGTTC TGCAGCTCCA AACACAGACT ACGTGATTTA CAACATTCCT CAATTGGCAG 
GGGTTGCTTT GACTCCAAGC CTTTACACAG AAATGTTGAA AAATCCTCGT GTTATCGGTG 
TGAAGAACTC TTCTATGCCA GTTCAAGATA TCCAAACCTT TGTCAGCCTT GGTGGAGAAG 
ACCATATCGT CTTTAATGGT CCTGATGAGC AGTTCCTAGG AGGACGCCTC ATGGGGGCTA 
GGGCTGGTAT CGGTGGTACT TATGGTGCTA TGCCAGAACT CTTCTTGAAA CTCAATCAGT 
TGATTGCGGA TAAGGACCTA GAAACAGCGC GTGAATTGCA GTATGCTATC AACGCAATCA 
TTGGTAAACT CACTTCTGCT CATGGAAATA TGTACGGTGT CATCAAAGAA GTCTTGAAAA 840 
TCAATGAAGG CTTGAATATT GGATCTGTTC GTTCACCATT GACACCAGTG ACTGAAGAAG 900 
ATCGTCCAGT TGTAGAAGCG GCTGCTGCCT TGATTCGTGA AACCAAGGAG CGCTTCCTCT 
AATCTAAAAG GAGGTATTTA TGACATATTA CGTTGCAATT GATATCGGTG GAACCAACAT 
CAAGTATGGT TTGGTTGATC AAGAGGGGCA ACTTCTTGAA TCGCATGAAA TGCCAACTGA 
GGCGCATAAG GGTGGACCTC ATATCTTACA AAAGACCAAA GATATCGTAG CTAGTTATTT 1140 
AGAAAAAGGC CCAGTAGCAG GTGTTGCCAT ATCTTCTGCT GGGATGGTGG ATCCGGATAA 1200 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 



960 
1020 
1080 
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GGGTGAGATT 
GGAAATCGAA 
TCTTGCTGAG 
TGGAACCGGT 
TTCAGCCTGT 
* TACAAC AGCT 
TGGCCGTAGA 
CCGTATGGTT 
AGTGGTTATT 
TACAGCCTTG 
CCATCACCAA 
CTAGTTTGGC 
GTTTTTTATT 
AAAAGTAGGC 
GGATATAGGA 
CTTTATTGGG 
GACTATCTAA 
CATCCATGAT 
CACCTAGTCG 
CAAAGAAGTA 
CGTCAATCAA 
TCAGTGGACT 
AAATAAATTC 
GAGAAATATG 
CTTGCAAAAA 
CAATGATAGT 
GGAAGACTAG 
ATAAAAACGC 
ATTTTCTACT 



TTCTATGCTG 
GAAAGCTTTA 
GCAGTATCTG 
ATCGGTGGTT 
GAAGTCGGGT 
TTAGTGAAAT 
ATTTTCAAAG 
GACTATCTAG 
CTTGGTGGTG 
AAAGAGGCTT 
AATACAGCAG 
TCAGCCAAAC 
TTCCCAGTAG 
ATAAATCAAA 
GGGTTGGCTG 
AACACTTGTA 
AATAGAAGGA 
GCTGGTTGTC 
CATAAAACGT 
GATACGCTCA 
GTCTTGTGTT 
GTGCTTGGAG 
TCGGTAGCCA 
TAATTTTTGG 
ATAGCGAGCG 
TGCGATATCT 
ATCAGAGAAT 
ATCTCTGTTT 
CAAATTGTGG 



GGCCGCAAAT 
CTATTCCTTG 
GTTCAGGCAA 
GCTTGATTAT 
ATATGCATAT 
ATGTAGCTGA 
AAGCCACTGA 
GAAAAGGTCT 
GTATCATGGG 
TGGTACCAAG 
GGATGTTGGG 
TAGGATTTTC 
CTATTAAAGA 
TCGATAAAGA 
TGGGTGGCTA 
AAGAGTACAG 
GTTGAGCCTG 
CAGGCAAAGC 
AATTTCATTT 
GCATCTTCGA 
TGTTCCCTCA 
ACTTGGTTGG 
GTAAAGCCAC 
GTGACTTGTT 
ATTTCTTGTT 
GGTTTGTCCA 
AGTCACACTT 
TAAAAACGAA 
TACAATTAAG 



791 
CCCTAACTAC 
TGAGATTGAA 
GGGAGCAAGT 
GGATAGGAAA 
GCAGGATGGA 
AGCCCATGGA 
AGGAAACAAA 
GGCAAATATT 
GCAAGAGGCT 
TTTAGCAGAA 
TGCATATTAT 
TTACACGTTT 
TTTTTTCCTT 
AGAGCATAGG 
CAAGAACAGT 
TCTTTGCCCC 
AAAGTGAGAA 
CGTCTTGGTC 
CACGGGCGAC 
TTAGCTGGGC 
TATTGCTATA 
CTTGATTTTC 
ACTTTTTAGC 
GAGAAGATAA 
CTAGGTCTGT 
TAGGGAAAGC 
CATTATAACA 
AAAATCGAAA 
AGTAAGATTT 



GCAGGCACCC 
AATGATGTCA 
GTGACACTTT 
GTCTTCCATG 
GCTTTTCAAG 
GAAGATGTTG 
ATCTGCATGG 
TGCTACGTTG 
ATCCTCAAAC 
AAAACACGAT 
CATTTTAAGA 
TTGTCTACGA 
GCTTTCGCGA 
AAGTTGAGCG 
CTCTGTATAG 
CATCTCCTTA 
GCCCAGTACT 
TGTCAAAGCT 
GAGGCCAGAA 
AATTCGTTCT 
ACTTCTGAGG 
TGCCTGATGT 
AAAGCGGGTC 
ATCATCTGTA 
CATTTCTTCA 
TCCTTTACAT 
CATAATATAA 
AAGCTTCTCT 
TAAGTTAGAA 



AGTTCAAAAA 
ACTGTGCAGG 
GCTTGACCAT 
GTTTTAGCAA 
ACTTGGCTTC 
ATCAGTGGAA 
AAGGTATTGA 
CCAATCCAGA 
CTAAGATCCG 
TAGAATTTGC 
CAAAACAATC 
TAGCCGTTGA 
TTGATTTCCA 
GATATTCGTT 
GTCTGGCTAT 
GCATCTAATA 
AGACAATTTT 
TCGCAGACCA 
CTCCCTGTTC 
AGTTGGATTT 
ACTCGTTTGG 
TGGTATTGGA 
AAAGCAGCTT 
ATCGTTTCAG 
AAATGTGAAT 
GAGTCATACT 
GGATAGATAA 
CTTTTCCATA 
ATGAGACTGA 



1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2880 

2940 
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TTTGTATGAG 


AAAATTTAAC 


AGCCATTCGA TTCCGATTCG 


GCTTAATTTA 


TTGTTTTCAA 


3000 


TCGTCATTTT 


ACTCTTTATG 


ACCATTATTG GTCGTTTGTT 


GTATATGCAG 


GTTTTGAACA 


3060 


AGGATTTTTA CGAAAAAAAG CTAGCTTCAG CTAGTCAGAC CAAGATTACA AGCAGTTCAG 


3120 


CCCGTGGGGA 


AATTTATGAT 


GCTAGTGGAA AACCTTTGGT 


AGAAAATACG 


TTAAAGCAGG 


3180 


TTGTTTCCTT 


TACGCGTAGC 


AATAAAATGA CGGCTACAGA 


CTTAAAAGAA 


ACAGCTAAAA 


3240 


AGTTACTGAC 


TTATGTGAGC 


ATCAGTTCTC CAAATTTGAC 


AGAACGCCAG 


CTGGCGGATT 


3300 


ACTATTTGGC 


TGATCCTGAA 


ATCTATAAAA AAATAGTGGA AGCTCTCCCA AGTGAGAAAC 


3360 


GCTTGGATTC AGATGGCAAT 


CGTCTATCCG AATCAGAACT 


GTATAACAAT 


GCGGTCGATA 


3420 


GTGTACAAAC GAGTCAACTA AACTATACAG AGGATGAAAA GAAAGAAATC TATCTTTTTA 


3480 


GTCAGTTAAA 


TGCTGTTGGA AACTTTGCGA CAGGAACCAT 




Pr^TV*T& R am/' 

t-U 1 1_ I AAATG 


3540 


ATTCTCAGGT 




GPCTfTATTT CAAAfifJAfiAT 


flfT" "TWin ATT 


AGTATTTCTA 


3600 


CTTCTTGGGA 


TAGAAAGGTT 




lni nu 1 luuu 


ALs 1 Vj 1A1 LLA 


3660 


GTGAAAAAGC 


TGGTCTCCCA 


GCGGAAGAAG CAGAAGCCTA 


TCTTAAAAAA 




3720 


TAAATGACCG 


TGTAGGAACC 


TCCTATTTGG AAAAGCAATA 


TGAAGAGACC 


TTACAAGGAA 


3780 


AACGCTCGGT 


AAAAGAAATC 


CATCTGGATA AATATGGCAA 


TATGGAAAGC 


GTGGATACAA 


3840 


TTGAGGAAGG 


TAGTAAGGGA 


AACAATATCA AACTGACCAT 


TGATTTGGCT 


TTCCAAGATA 


3900 


GCGTGGATGC 


TTTACTGAAA 


AGTTATTTCA ATTCTGAGCT 


AGAAAATGGT 


GGAGCCAAGT 


3960 


ATTCTGAAGG 


TGTCTATGCA 


GTCGCCCTTA ACCCAAAAAC 


AGGTGCGGTT 


TTGTCTATGT 


4020 


CAGGGATTAA 


ACATGACTTG 


AAAACGGGAG AGTTGACGCC 


TGATTCCTTG 


GGAACGGTAA 


4080 


CCAATGTCTT 


TGTTCCAGGT 


TCGGTTGTCA AGGCGGCGAC 


CATCAGCTCA 


GGTTGGGAAA 


4140 


ATGGAGTCTT 


GTCAGGAAAC 


CAGACCTTGA CAGACCAGTC 


CATTGTCTTC 


CAAGGTTCAG 


4200 


CTCCCATCAA 


TTCTTGGTAT 


ACTCAGGCTT ACGGTTCATT 


CCCTATCACA 


GCGGTCCAAG 


4260 


CTCTGGAGTA 


TTCATCAAAT 


ACCTATATGG TCCAAACAGC 


CTTAGGTCTT 


ATGGGGCAAA 


4320 


CCTATCAACC 


CAATATGTTT 


GTCGGCACCA GCAATCTAGA 


GTCTGCTATG 


GAGAAACTGC 


4380 


GTTCAACCTT 


TGGCGAATAT 


GGCTTGGGTA CTGCGACAGG 


AATTGACCTA 


CCAGATGAAT 


4440 


CTACTGGATT 


TGTTCCCAAA 


GAGTATAGCT TTGCTAATTA 


CATTACTAAT 


GCCTTTGGGC 


4500 


AGTTTGATAA 


CTATACGCCG 


ATGCAGTTGG CTCAGTATGT 


AGCAACTATT 


GCAAATAATG 


4560 


GTGTTCGTGT 


GGCTCCTCGT 


ATTGTTGAAG GCATTTATGG 


TAATAATGAT 


AAGGGAGGAC 


4620 


TGGGTGACTT 


GATTCAGCAA 


CTGCAACCGA CAGAGATGAA 


TAAGGTCAAT 


ATATCCGACT 


4680 


CCGATATGAG 


CATCTTGCAC 


CAAGGTTTTT ATCAGGTTGC 


CCATGGTACT 


AGTGGATTGA 


4740 
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CAACTGGACG 


TGCCTTTTCA 


AATGGTGCCT 


TGGTATCCAT 


TAGCGGAAAA 


ACAGGTACAG 


4800 


CCGAAAGCTA 


TGTGGCAGAT 


GGTCAGCAAG 


CAACCAATAC 


CAATGCGGTG 


GCCTATGCCC 


4860 


CATCTGATAA 


TCCCCAAATC 


GCTGTCGCAG 


TGGTCTTTCC 


TCATAATACC 


AATCTAACAA 


4920 


ATGGTGTAGG 


ACCTTCCATT 


GCGCGTGACA 


TTATCAATCT 


GTATCAAAAA 


TACCATCCAA 


4980 


TGAATTAGAA 


AGGAAATTAT 


GCTTTATCCA 


ACACCTATTG 


CCAAGTTGAT 


TGACAGTTAT 


5040 


TCTAAGTTAC 


CAGGTATCGG 


GATTAAGACG 


GCTACGCGTC 


TGGCCTTTTA 


TACGATTGGG 


5100 


ATGTCTGCTG 


ATGATGTCAA 


TGAATTTGCA 


AAAAATCTCC 


TTTCTGCTAA 


GAGAGAATTG 


5160 


ACATATTGTT 


CTATTTGTGG 


ACGTTTGACA 


GACGACGATC 


CTTGTTCTAT 


CTGTACTGAT 


5220 


CCGACTCGTG 


ACCAGACAAC 


AATTTTAGTT 


CTTGAGGATA 


GTAGAGATGT 


GGCAGCCATG 


5280 


GAAAATATCC 


AAGAATACCA 


TGGACTCTAT 


CATGTCCTTC 


ATGGCCTCAT 


TTCTCCTATG 


5340 


AATGGTATCA 


GTCCGGACGA 


TATCAATCTC 


AAGAGCCTTA 


TGACTCGTCT 


TATGGATAGT 


5400 


GAGGTTTCAG 


AAGTGATTGT 


GGCGACTAAT 


GCTACAGCGG 


ATGGTGAAGC 


GACTTCCATG 


5460 


TATCTTTCAC 


GTTTGCTCAA 


GCCGGCTGGT 


ATCAAGGTTA 


CGCGTCTAGC 


ACGAGGTCTC 


5520 


GCTGTGGGAG 


CGGACATTGA GTATGCGGAC 


GAAGTGACAC 


TCTTACGAGC 


CATTGAAAAT. 


5580 


CGGACAGAGT 


TGTAAGTGTA 


GGCAAATTTA 


CGAACTCCAT 


TCATTTATAA 


AAAATCAAAG 


5640 


AGGCTGAAAA 


TCGTTCCTAT 


CGGCCTCTTT 


TTGTATAGTG 


TGATGAGTAG 


GCTCAGGTTC 


5700 


AAGTTTTAAA 


AAACCAAGCA 


AATATGATAT 


ACTAAAGAGC 


GAGTATTCTA 


GTAGAATTAG 


5760 


GACAAATAAT 


ATGAAACAAA 


CGATTATTCT 


TTTATATGGT 


GGACGGAGTG 


CGGAACGCGA 


5820 


AGTCTCTGTC 


CTTTCAGCTG 


AGAGTGTCAT 


GCGTGCGGTC 


GATTACGACC 


GTTTCACAGT 


5880 


CAAGACTTTC 


TTTATCAGTC 


AGTCAGGTGA 


CTTTATCAAA 


ACACAGGAAT 


TTAGTCATGC 


5940 


TCCGGGGCAA 


GAAGACCGTC 


TCATGACCAA 


TGAAACCATT 


GATTGGGATA 


AGAAAGTTGC 


6000 


ACCAAGTGCT 


ATCTACGAAG 


AAGGTGCAGT 


GGTCTTTCCA 


GTCCTTCACG 


GGCCAATGGG 


6060 


AGAAGATGGC 


TCTGTTCAAG 


GATTCTTGGA 


AGTTTTGAAA 


ATGCCTTACG 


TTGGTTGCAA 


6120 


CATTTTGTCA 


TCAAGTCTTG 


CCATGGATAA 


AATCACGACT 


AAGCGTGTTC 


TGGAATCTGC 


6180 


TGGTATTGCC 


CAAGTTCCTT 


ATGTGGCTAT 


CGTTGAAGGC 


GATGATGTGA 


CTGCTAAAAT 


6240 


CGCTGAAGTG 


GAAGAAAAAT 


TGGCTTATCC 


AGTCTTCACT 


AAGCCGTCAA 


ACATGGGGTC 


6300 


TAGTGTCGGT 


ATTTCTAAGT 


CTGAAAACCA 


AGAAGAACTC 


CGTCAAGCCT 


TAAAACTTGC 


6360 


CTTCCGATAT 


GACAGCCGTG 


TCTTGGTTGA 


GCAAGGAGTG 


AATGC 




6405 


(2) INFORMATION FOR SEQ ID NO: 108: 









) 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11309 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi> SEQUENCE DESCRIPTION: SEQ ID NO: 108: 



CGAGCTCGGG 


TACCGGGATT 


TTAAGGAGTT 

X X l^f\\J\XrX\J X X 


TGATATGTAT 


A AfVT ATT 1 71 T< 


TA APPATTTT 
InALUil 111 


fin 

DU 


ATTAGTATTA 


TCTGTTGTGA 


TTGTGATTGC 




r* a a cc a 21 nr* & 


afiiarrn a to 




CAGCAATGTA 


TTTGATGCCA 


X A \m0*V\J\J X W*\ 


TTTGTTTG A A 

X X X V9 X X X wXul 


PGPAGTAAAG 


l- 1 I^VjIAju 111 


i on 


TGAAGCTGTA 


ATGCAGCGTT 

£\ X 1 X 


TGAPAGGGAT 


rfwiwp r f^iTf^TTT 
111 r\\j 1L1 1 1 


1 IV. 1\jVjU 1 flu 






AGCATTGACG 


GT ATT A TV AA 
va x *» x x n. x 


GTAGATAAGA 
o x nun a nnun 


AAATAATGGG 


\-/iv>v>/\t- 1 Abb 


TCTTTGCCTC 


JUU 


TTTTTATTTT 


TAAAGGATGT 


TTGAGAAGGT 




A a & & & tt a 


AA/VrvA AL» 


icn 
JoU 


AAAGAAAATA 


TGAAAGATAG 


AATAAAAGAA 


T ATTT A f A A G 


AC* AAGGfi A A A 


GGTGAPTTITT 




AATGATTTGG 


CTCAGGCTTT 


GGGAAAAGAC 


AGTTCCAAGG 


ATTTTT'GTG A 


nTTY2 ATT A A A 


inn 
sou 


ACCTTGTCCT 


TAATGGAAAG 


AAAGCACCAA 


ATT CG TTTTG 

*» X X \*VJ X X X X VJ 


AAGAAGATGG 


T AGTPTG AP A 




TTAGAAATTA 


AGAAAAAACA 


TGAGATTACC 


CTCAAGGGGA 


TTTTTCATGC 


PPATAA A A AT 


uUU 


GGCTTTGGCT 


TTGTTAGTCT 


GGAAGGCGAG 


GAGGACGACC 


TTTTTGTAGG 


GAAAAATGAT 


660 


GTCAACTATG 


CTATTGATGG 


TGATACCGTC 


GAGGTAGTGA 


TTAAGAAAGT 


cGPTGAprr^r' 


720 


AATAAGGGAA 


CAGCAGCAGA 


AGCCAAAATT 


ATTGATATCC 


TAGAACACAG 


TTTGACAACA 


780 


GTTGTCGGGC 


AAATCGTTCT 


GGATCAGGAA 


AAACCTAAGT 


ATGCTGGCTA 


TATTCGTTCA 


840 


AAAAATCAGA 


AAATCAGTCA 


ACCGATTTAT 


GTTAAGAAAC 


CAGCCCTAAA 


ATTAGAAGGA 


900 


ACAGAAGTTC 


TCAAAGTCTT 


TATCGATAAA 


TACCCAAGCA 


AGAAACATGA 


TTTCTTTGTC 


960 


GCGAGTGTTC 


TCGATGTAGT 


GGGACACTCA 


ACGGATGTCG 


GAATTGATGT 


TCTTGAGGTC 


1020 


TTGGAATCAA 


TGGACATTGT 


ATCCGAGTTT 


CCAGAAGCTG 


TTGTTAAGGA 


AGCAGAAAGT 


1080 


GTGCCTGATG 


CTCCGTCTCA 


AAAGGATATG 


GAAGGTCGTC 


TGGATCTAAG 


AGATGAAATT 


1140 


ACCTTTACCA 


TTGACGGTGC 


GGATGCCAAG 


GACTTGGACG 


ATGCAGTGCA 


TATCAAGGCT 


1200 


CTGAAAAATG 


GCAATCTGGA 


GTTTGGGGTT 


CACATCGCAG 


ATGTTTCTTA 


TTATGTGACC 


1260 


GAGGGGTCTG 


CCCTTGACAA 


GGAAGCCCTT AACCGTGCGA CTTCTGTTTA 


CGTGACAGAC 


1320 


CGAGTGGTGC 


CAATGCTTCC 


AGAACGACTA 


TCAAATGGCA 


TCTGCTCTCT 


CAATCCCCAA 


1380 


GTTGACCGCC 


TGACCCAGTC 


TGCTATTATG 


GAGATTGATA 


AACATGGTCG 


TGTGGTCAAC 


1440 


TATACCATTA 


CACAAACAGT 


TATCAAGACC 


AGTTTTCGTA 


TGACCTATAG 


CGATGTCAAT 


1500 
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GATATCCTAG CTGGCGATGA AGAAAAGAGA AAAGAATATC ATAAAATTGT ATCAAGTATC 1560 

GAACTCATGG CCAAGCTTCA TGAAACTTTA GAAAACATGC GTGTGAAACG TGGAGCTCTC 1620 

AATTTTGATA CCAATGAAGC GAAGATTTTA GTGGATAAAC AAGGTAAGCC TGTTGATATC 1680 

GTTCTTCGGC AGCGTGGTAT TGCCGAGCGG ATGATTGAGT CTTTTATGTT GATGGCTAAT 1740 

GAAACAGTTG CCGAACATTT CAGCAAGTTG GATTTGCCTT TTATCTATCG AATTCACGAG 1800 

-GAGCCTAAGG CTGAAAAGGT TCAGAAGTTT ATTGATTATG CTTCGAGTTT TGGCTTGCGC 1860 

ATTTATGGAA CTGCCAGTGA GATTAGTCAG GAGGCACTTC AAGACATCAT GCGTGCTGTT 1920 

GAGGGAGAAC CTTATGCAGA TGTATTGTCC ATGATGCTTC TTCGCTCTAT GCAGCAGGCT 1980 

CGTTATTCGG AGCACAATCA CGGCCACTAT GGACTAGCTG CTGACTATTA TACTCACTTT 2040 

ACCAGTCCAA TTCGTCGTTA TCCAGACCTT CTTGTTCACC GTATGATTCG GGATTACGGC 2100 

CGTTCTAAGG AAATAGCAGA GCATTTTGAA CAAGTGATTC CAGAGATTGC GACCCAGTCT 2160 

TCCAACCGTG AACGTCGTGC CATAGAAGCT GAGCGTGAAG TCGAAGCCAT GAAAAAGGCT 2220 

GAGTATATGG AAGAATACGT GGGTGAAGAG TATGATGCAG TTGTATCAAG TATTGTCAAA 2280 

TTCGGTCTCT TTGTCGAATT GCCAAACACA GTTGAAGGCT TGATTCACAT CACTAATCTG 2340 

CCTGAATTTT ATCATTTCAA TGAGCGTGAT TTGACTCTTC GTGGAGAAAA ATCAGGTATC 2400 

ACTTTCCGAG TGGGTCAGCA GATCCGTATC CGTGTTGAAA GAGCGGATAA AATGACTGGA 2460 

GAGATTGATT TTTCATTCGT ACCTAGTGAG TTTGATGTGA TTGAAAAAGG CTTGAAACAG 2520 

TCTAGTCGTA GTGGCAGAGG GCGTGATTCA AATCGTCGTT CGGATAAGAA GGAAGACAAG 2580 

AGAAAATCAG GACGCTCAAA TGATAAGCGT AAGCATTCAC AAAAAGACAA GAAGAAAAAA 2640 

GGAAAGAAAC CTTTTTACAA GGAAGTAGCT AAGAAAGGAG CCAAGCATGG CAAAGGGCGA 2700 

GGGAAAGGTC GTCGCACAAA ATAAAAAGGC ACGCCACGAC TATACAATCG TAGATACGCT 2760 

AGAGGCAGGG ATGGTCCTGA CTGGAACTGA AATCAAGAGT GTACGAGCTG CTCGAATTAA 2820 

TCTCAAGGAT GGCTTTGCTC AAGTGAAAAA TGGAGAAGTT TGGCTGAGCA ATGTTCATAT 2880 

CGCGCCTTAC GAAGAGGGCA ATATCTGGAA CCAGGAACCA GAACGTCGTC GTAAACTCCT 2940 

GCTCCATAAA AAGCAAATTC AAAAATTGGA ACAAGAGATC AAAGGGACAG GAATGACCTT 3000 

AGTTCCCCTT AAGGTCTATA TAAAAGATGG CTACGCTAAG CTTCTTTTAG GACTTGCCAA 3060 

AGGGAAGCAT GACTATGACA AACGGGAGTC TATCAAACGT CGTGAGCAAA ATCGAGATAT 3120 

CGCGCGTGTG ATGAAAGCTG TTAATCAGCG ATAAAAAGAG GAATTGAAAA TGGAAAAATT 3180 

AGTTGCCTAT AAACGCATGC CTTTGTGGAA TAAACAAACA ATGCCTGAAG CTGTTCAGCA 3240 
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AAAGCAOAA'P Ar*A AAAr ,f PTV* cr , 'ttf**TVTir*c*r*r % ptlxx AnvuKom 


GTCTTGAAGG 


GAGCTCTCAA 


3300 


A * i AAbbouAAbT TLTAGCTGAA 


CACCTCTTTG 


AAGCAGGGGC 


3360 


AGACAATCCA ATGGCCCAAC CTCAAGCCTG GCACCGAGTG 


GAAGCTGCCA 


CAGATGATGT 


3420 


GGAATGGTAC TTGGAATTTT ATTGTAAACC TGAGGATTAT 


TTTGCTAAAA 


AATACAATAC 


3480 


CAATCCTGTT CATTCAGAGG TCCTAGAGGC CATGCAGACA 


GTGAAACAAG 


GGAAAGCTTT 


3540 


GGATTTGGGT TGTGGTCAGG GGCGTAATTC TCTTTTTCTA 


GCCCAGCAAG 


ATTTTGATGT 


3600 


GACGGCTGTA GATCAAAATG GACTAGCTCT TGAAATCTTG 


CAAAGCATTG 


TGGAGCAGGA 


3660 


AGATTTGGAC ATGCCTGTTG GCCTTTACGA TATCAATTCA 


GCTAGCATTG 


AACAAGAATA 


3720 


TGATTTTATC GTTTCAACAG TTGTTCTCAT GTTTCTACAA 


GCGGACCGCA 


TTCCAGCTAT 


3780 


TATTCAAAAT ATGCAGGAGA AAACCAGTGT TGGTGGTTAC AACCTTATCG TTTGTGCCAT 


3840 


GGACACGGAG GATTATCCTT GCTCGGTTAA CTTCCCATTC 


ACCTTTAAAG 


AAGGAGAACT 


3900 


GGCAGACTAT TACAAGGATT GGGAATTGGT TAAGTACAAT 


GAAAATCCAG 


GCCATTTGCA 


3960 


CCGTCGCGAT GAGAATGGCA ATCGTATTCA ACTACGCTTT 


GCGACCTTAC 


TAGCTAAGAA 


4020 


AATCAAGTAA ACACACATGA AGATTAGGAA TTTTCCTGAT 


CTTTTTTCTT 


TTTTACGAAT 


4080 


GATATAGAAA AGGAGGGAAT TCATGTTTGT TGCGAGAGAT 


GCTAGGGGAG 


AATTGGTAAA 


4140 


TGTGTTAGAG GATAAACTTG AGAAGCAAGC ATACACCTGC 


CCAGCTTGTG 


GAGGCCAGCT 


4200 


CCATTTGCGT CAAGGACCAA GTGTACGGAC GCATTTTGCC 


CATAAATCCT 


TAAAAGACTG 


4260 


TGATTTTTTC TTTGAAAATG AAAGTCCAGA ACACCTGGCC 


AATAAGGAAT 


CCCTCTATCA 


4320 


CTGGTTGAAA AAAGAGACAA AGGTTCAATT AGAGTACCCG CTTTCAGAAC TTAAACAGAT 


4380 


TGCGGATGTA TTTGTAAATG GCAATCTAGC TCTAGAAGTT 


CAGTGTAGTC 


CCTTGCCTCA 


4440 


GAAAGTCCTT AAAGAGCGAA GTGAGGGCTA TCGTAGTCAG 


GGTTACCAAG 


TACTGTGGTT 


4500 


GCTGGGTCAA AAACTGTGGC TCAAGGAGCG TTTGACTCGT CTACAGCAAG GTTTTCTTTA 


4560 


TTTCAGTCAA AACATGGGCT TTTATGTTTG GGAATTAGAC AAGGAAAAAC AAGTTTTAAG 


4620 


ACTCAAATAC CTGATTTACC AGGATCTCCG CGGTAAACTC 


CATTATCAAA 


TCAAGGAATT 


4680 


TTCCTATGGT CAAGGTAGTT TATTGGAAAT ATTGCGTCTT 


CCCTATAAGA 


GACAAAAAAT 


4740 


ATCTCATTTT ACAGTTTCTG AGGACAAGGA CATCTGTCGC 


TATATCCGGC 


AACAACTTTA 


4800 


TTATCAAAAT CTCTTTTGGA TGAAAGAACA AGCAGAAGCC 


TATCAAAAGG 


GAGAAAATAT 


4860 


CCTGACTTAT GGACTGAAAG AATGGTATCC ACAAATTCGA 


CCAATAGTGG 


GCAAATTTTT 


4920 


CCAGATTGAA CAAGACTTGA CTAGCTATTA TCAGCACTTT 


TATACCTATT 


ACCAAAAAAA 


4980 


TCCTCAAAAT GATTGGCAAA AGCTTTATCC ACCAGCCTTT 


TATCAGCAAT 


ATTTCTTGAA 


5040 
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AAATATGGTA 
TGAAAAAGAT 
CTTAAAAGAT 
TAGTGCGGAT 
GAAGCTTTAC 
AGAGTACTAT 
TGAGCCTGAA 
AAAGCTGCAG 
TTCACAACGT 
AACCTTCGCT 
TAAAGAAGTT 
GGTTCGCCGT 
TGCCAAAACC 
CAAGAGTGCT 
TTTGGTAGCA 
AAAAATCTTG 
TGAATACAGT 
GGGTGAGGAT 
CGAAAATCAA 
TATGCTTCTC 
TCACAGTATG 
TATCTTTTTG 
GGAAGAAGTG 
CCGTGGAACA 
AGATCAAAAT 
CCAAGAGTAT 
CATTCCACAC 
CTCAGCCTTG 
CTACCTCAAG 



GAATAGAAAG 
ACATGGGATC 
TTAACAGAAC 
AACCTACTAG 
GCTTATGCTC 
GCCAAGGCCA 
TTTATGGAGA 
GTTTATCAAC 
GAAGAAGAAT 
ATCTTGGACA 
CAGCTATCTC 
GGTGCCTATC 
TTGCAAACCA 
CGTCATGCAG 
GCAGTTCGCA 
GGGATTTCAG 
TTTACCTACC 
TACTTGAGCC 
GGCAAGCGTT 
AACTGGCAAG 
CATTCAAGCT 
GCTGAGATTG 
GAAGACGACG 
GTTTTCCGCC 
GGGGAGGTCT 
TATGGTTTGA 
TTCTACTATA 
GCTGAAAAAA 
GCAGGTAAGT 



GATGGAGGAA 
TATCAACGAT 
AATTGGAGAC 
AAATCACTGA 
ATATGAAGAA 
TGACACTCTA 
TTAGCGAAAA 
ACTATTTTGA 
TATTGGCTGG 
ATGCGGATAT 
ATGGGACTTA 
AAGCTCTTTA 
ATGTTAAGGT 
CCCTCGCAGC 
AGCATTTGCC 
ATCTCAAGAT 
AAGAAGCCTT 
GTGTTAAACG 
CAGGTGCCTA 
ACAATCTGGA 
ATACTCGTGA 
CCTCAACTAC 
CAACACGCTT 
AAACTCAATT 
TGACAAGCGA 
GTAAGGAAGA 
ACTACTATGT 
TTGTCCATGG 
CGGACTATCC 



797 
TCTAATGGTA 
CTACCCAACT 
AGTAGCCCAG 
ATTTTCTCTT 
TGACCAGGAT 
CAGCCAGTTA 
GCAGTATGCT 
CAAGCTTTTG 
AGCTGGAGAA 
TGTGTTCCCT 
CACACGTTTG 
TGCGACTTAC 
GCAAAATTAC 
GAATTTTGTT 
ACTCTTACAT 
GTACGATGTC 
GAAAAAAGCA 
TGCCTTCAGC 
CTCTGGTGGT 
CAATCTCTTT 
AACTCAGCCT 
CAATGAAAAT 
TGCTATTCTC 
TGCTGAGTTT 
TTTCCTAAAT 
CAATCCTGAA 
ATATCAATAT 
TAGTCAAGAA 
ACTTAATGTC 



TTACAAAGAA 
GACCAGGCTT 
TATGAAGGCC 
GAAATGGAAC 
ACACGTGAAG 
GACCAAGCCT 
GACTTTTTAG 
CAAGGCAAGG 
ATCTTTGGTT 
TATGTCCTAG 
ATGGAGTCTA 
GAACAATTCC 
CGTGCTAAAG 
CCAGAAAGTG 
CGCTATCTTG 
TACACACCGC 
GAAGATGCTT 
GAGCGTTGGA 
TCTTATGATA 
ACTCTTGTTC 
TATGTTTACG 
ATCTTGACGG 
AATAACTTCC 
GAACACGCCA 
AAACTCTACG 
ATCCAATACG 
TCAACTGGCT 
GACCGTGACC 
ATGAGAAAAG 



ATGAAATAAA 
GGGAAGAAGC 
ATCTCTTGGA 
GCCAGATAGA 
CTAAGTATCA 
TTTCATTCTA 
AAGCTCAACC 
ATCACGTTCT 
CAGCAAGTGA 
ACGATGATGG 
AAAAACGTGA 
AACACACCTA 
TTCGTAACTA 
TTTATGACAA 
AGCTTCGTTC 
TTTCATCTGT 
TGGCAGTCTT 
TTGATGTTTA 
CCAATGCCTT 
ATGAAACAGG 
GGGATTACTC 
AGAAATTATT 
TAGATGGTTT 
TTCACCAAGC 
CAGACTTGAA 
AGTGGGCTCG 
TTGCGGCCGC 
GCTATATCGA 
CTGGTGTTGA 



5100 

5160 

5220 

5280 

5340 

5400 

5460 

5520 

5580 

5640 

5700 

5760 

5820 

5880 

5940 

6000 

6060 

6120 

6180 

6240 

6300 

6360 

6420 

6480 

6540 

6600 

6660 

6720 

6780 
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TATGGAGAAG 
GTTTGAAGCC 
ATGCTAACCA 
AGCGTCAAAA 
ATATTCCTAT 
AGCCTAAAAA 
AACATGCGCC 
CCAAGGAAAA 
CGGTGGATGT 
AGTCTAAATA 
TTGTCTTGGA 
GTGGTCAGCG 
CAGAACTCAC 
TAGCAGATGT 
AAAATAGATA 
GTGCCATCAC 
CAGACCTTAT 
AAAGCAACCC 
AACAATATGG 
AACAATATGG 
GTAAAGCTCA 
CTGAATTGAC 
CTCAAATCAT 
CAGAAGGTGC 
AAAATGGTGG 
AAGCCGCTTT 
AAGCCTACAG 
ATATTGATGA 
CAACATTTGT 
ACCAAGCCTT 



GAAGACTACC 
CTTGTTGAAA 
TAACATGCGT 
GCAGGTCACA 
TATTCCCCAT 
TATTCTGGAA 
AAATGCTAAG 
TTTTGCCCAG 
CTTATCTACA 
CATCGTCTTT 
TGATATTTTT 
AACCATTTAT 
CGCAACATTA 
TCAACTGTCT 
GAGTAACACT 
ACTATTATCA 
CAGCATGAAA 
TTCAGCCCAA 
CTCAGAGCTT 
CGAAAACTAC 
AATTCGTACA 
AGATGAAGCC 
CCGTCTTAAT 
TGATTTTGCT 
AGAAATTACC 
CGCTTTAGAT 
TAGCCAATAT 
CTACAAAGAA 
TCAAAGCATT 
CCAAAATATC 



TCAACGATGC 
AATTAGGATT 
CGTCCTGTCG 
GGTTTCTTGA 
GAAACGGTTG 
ATTGGGACGG 
ATTACAACTA 
TTTGACAGTC 
CTGACAGAGT 
CTGCCAGAAA 
CAAGGTGGTG 
CGAGGCCTTC 
GTGCCTTTAG 
GAAAGCGAAT 
TATCTCAAAG 
GTAGCAACTT 
GGGGATGTCA 
CAAGTCTTGT 
GATGATAAAG 
CAACGTGTCT 
AGTAAATTAG 
TATAAGAAAG 
AATGAAGATA 
CAATTAGCCA 
TTTGATTCTG 
GTGGATGGTG 
TACATTGTAA 
AAATTAAAAA 
ATCGGAAAAG 
TTTACCCAAT 



798 
CTTTGCAGTC 



GGCATAAAAT 
TCAAAGAAGA 
AAGAATTGGA 
CTTATTTCCG 
CTATCGGTTT 
TTGATCGTAA 
GCAAGCAAAT 
CTTATGATTT 
TCCTCAAACA 
ATGTTGCCAA 
AAAAATTATT 
GAGATGGTAT 
GATTTTCAGA 
GAGTAGACAT 
TAGCAGCTTG 
TTACAGAACA 
TAAATATGAC 
AGGTTGATGA 
TGTCACAAGC 
TTGAGTTGGC 
CCTTTGATGA 
AGGCCAAAGA 
AAGATAATTC 
CTTCAACAGA 
TTTCTGATGT 
AACTCACTAA 
CTGTTATCTT 
AATTGCAAGC 
ATATCGGTGG 



TTTGAACGCC 
GGTTGAATCG 
AATTGTAGAC 
AGACTTTGCC 
TTTTCTTATG 
TTCAGCTCTC 
TCCAGAAATG 
CACTCTCCTA 
CGTCTTTATG 
TTTGGAAGTT 
GGATATTATG 
TGATGCAACC 
TCTCATGCTT 
AAAATTTAAG 
GAAGAAAAAA 
TTCGAAAGGG 
TCAATTTTAT 
CATCCAAAAA 
TACTATTGCC 
AGGTATGACT 
AGTTAAGAAG 
GTACACTCCA 
AGTTCTCGAA 
AACTGATGAA 
AGTACCTGAG 
GATTACAGCA 
GAAAACAGAA 
GACTCAAAAA 
AGCCAATATC 
TGGAGATTCA 



GTTTAAATGA 
TATAGTAAGA 
TTGATGCGTC 
CGCAAGGAAA 
GAAACCATGC 
TTGATGGCTG 
ATTGGTTTTG 
GAGGGAGATG 
GATTCTGCCA 
GGTGGTGTGG 
GAAGTCCGTC 
TTAGACAATC 
CGTAAAAATG 
AAAAAATAGT 
TTATTGGCAG 
TCAGAAGGTG 
GAGCAAGTGA 
GTTTTTGAAA 
GAAGAAAAAA 
CTTGAAACAC 
GTAGCAGAAG 
GATGTAACGG 
AAAGCCAAGG 
AAAACAAAAG 
CAAGTCAAAA 
ACTGGCACAC 
AAATCATCTA 
CAAAATGATT 
AAGGTTAAGG 
AGCTCAAGCA 



6840 

6900 

6960 

7020 

7080 

7140 

7200 

7260 

7320 

7380 

7440 

7500 

7560 

7620 

7680 

7740 

7800 

7860 

7920 

7980 

8040 

8100 

8160 

8220 

8280 

8340 

8400 

8460 

8520 

8580 
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GTAGTACATC AAACGAATAG TCCAAATCAA TGAGTCAGGG AAAAAACTCG ACTTCAGGAA 8640 

AAAATGAAGC AAACATTCCC ACAATAAAAC GCATAGTACA AGGTTTGTAC TGCCCCCCAA 8700 

AAAGTTAGAC AATTAATTTA TCCGAAGGAT TTAGTTCTGT ATTGCACAGA GCTAAGTCCT 8760 

TTTAGTTTTA TCTTAATTCT CTTATTGTTG TAATAATCAA TATAGTCTAT AATGGCTCGT 8820 

TCCAATTGAT TAAGTGATTT AAATGTTTTC TCATAGCCAT AAAACATTTC GGATTTTAAA 8880 

- ATGCCAAAGA AAGATTCCAT CCTACCGTTG TCTTGGCTGT TGCCCTTACG TGACATGGAT 8940 

GCTTGAATTC CCTTACTCTC TAGGAAGCGA TGATAAGAAT CGTGTTGATA TTGCCAGCCT 9000 

TGGTCACTAT GGAGAATCGT ATTCTCGTAG TGCTTCTCTT TGAATGCCTG TTCCAACATT 9060 

AACGATCAAT CAATTTAATC ATGTACCTAA GATTAGAATT GTTTATCCCA AATTTATTTG 9120 

AAAGCTTCTC TAAGCTATAT CCTTGTTTTC TAAGTTCATA GATCTGAACT TTATCATCAT 9180 

AAGTTAATTT CATAATAAAA ACACCCCAAA AGTTAGATTT TTTCTGTCTA ACTTTTGGGG 9240 

TGTAGTTCAT GTACACCTGA TATGATGCGT TTTATAATTT TAAAGACTTT TTGACCAGCC 9300 

TCATTTTTTT AACTTGATAC TCAGTGAAAA GCAAAGATTA AACTAGGAAG CTAGCTGTAG 9360 

GCTGCTCAAA GAACAGCTTT GAGGTTGTAG ATAAAACTTG TGAGGTCACC AACATATATA 9420 

ATGTGAAGCT GACGTGGTTT GAATAGATTT TAGAAGAGTA TGAGTCTGGA AGTTTTAATG 9480 

GATAATGCAA GATTCCATAG AATGGGTAAG CTAGAGTTCT TATGTGAAGA GTTTGGGCAT 9540 

AAACTTTTAC CTTTTCCTCC CTACTCATCT TAGTATAGAA AAGTGAATCT GAAATAGTAC 9600 

ATAACTGCTT CTAAAACATT CTTATAAATT GATTTAAATT CTCAAATCAT ATTATTCAGT 9660 

TCTTATTTCA TTTTGTTCTA CAATCCTGTT GAGAAGACAC GTGTTCATAT CAAAAAGGTA 9720 

TTGGCAAGTT GCAATACCTT TTTACGAGGC TCTGTTGTCT TATTTTTGTT TCAACTGACT 9780 

ATATCTCCTA TGGTTCTAGT TCAGAAGGCT AGGCTATAAT TATGATTGAT AAGAAGTATC 9840 

ATTCCAAGTA TTGGGAGTGA ATGTTTCAAA ATCATGGGTT TCTATAATGG TCAGGCTGGC 9900 

ATTTGCTAGA CCGCCATCTT TACGAAGAAG TGGTTCTTTA TAGCCTAGGA GAGTACGAAG 9960 

ACTGGCAGTA AGATTGGCGC CGTGTCCGAC AATTAGAATA CGTTCAGCTG GACTATCTTT 10020 

TAATGATTTG ATAAATTGGA TGGTCCGTTG AGTTGTACTA TAGAGGGATT CGGCTCCGAA 10080 

CATTCGAGTG TCAAATTGAG CAAGATTTGA ACGAAAAGCC TGGATTTGTT GCGGGTAAAT 10140 

AGCTTCCAAG GTTGCAATTT TCAAACCTTC TAACTTCCCA AGTTGCCATT CACGGAGATT 10200 

AGGAACGATT TCTAAAGAAC AGGGGGTATA GAGTTGACTT TGGATAATCT CAGCAGATTT 10260 

GACCGCTCGA GGTAAATCAC TTGAATAAAT CTGATCAAAA GGAATTTCCT TGAGATACTG 10320 
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ACCAAGTCGT TTTAGGGTTT CAATGGATTC AGGAAGAAGA GGAGAATCAC CACTAGCACC 10380 

TTGAAAACGA CCTTCTTGGT TCCAGAGGGT ACGACCGTGG CGGACAAAGT AGAGTTTCAT 10440 

TACTTGATGT CCTCCAAAAT ATCTACAAAG TCTGCCTTTA CAAAGCTAGC CAAGTCTTGT 10500 

GGCGCGACGA TAATGCTGTG TCCGACTTCG CCTGCAGAGA CAATCATTTG ATCCAAATCT 10560 

AGAGCAATTT TATCGATAAA AATGGGATAA TTGTGTTTCT GACGAATTCC GACAGGATTA 10620 

TTGGCTCCAT GAATGTAACC AGTTGTTTTT TCTAAGTCCT TTTGTGGAAT CATGCTCACT 10680 

TTTTTATTGC CAGAAATTTT AGCTAGTTTC TTTTCAGACA AGTGCTGAGT GATAGGGACA 10740 

ATTCCGATAA TCGGTCCGGT CTTGTCTCCC AAAAGCGCCA AGGTTTTGAA AATCTGATCT 10800 

CGTTCATAAC CTTGAGGAAG CTCTCCTTCT AGGGCATTGA TTTGAATCCC CTGATGAGGG 10860 

ATAGCTGCTT TAGATAGGAT TTGTTCCACC AATGTTTTTT TGATTTTAAC TTTTTTTGCC 10920 

ATTATTTATA TTTATCCTCC AATTGACTCA TCCAAATACC AAGCCAGATT CCCAGCGCAA 10980 

AGAAGAAGGC GATGATGACA TAACCGACAA GTGAAAGTCC TGTGTATTGG ATACTTTCAG 11040 

CGTTTCCTGC ATTTGGAATT AAGATCAAAA GGGTACTTGA TAGGACGATA CCGATGATGA 11100 

AATGATAGAC GAACTGTTTA CGGAGTTCTT CTAGTTCTCC GTCCGTCCAA GCGTAGGCCA 11160 

CTTCTTCTTT CTTGCCTTTA CCTTTGGACA TCTTGTAAAG AGGTGGGAGG GCAATATAGA 11220 

CATGACCTGC CTCGACTAGC GGACGCATGT AACGGTAGAA AAATGTCAAG AGCAAGGTCT 11280 

GGATATGGGC ACCGTCGGTA TCCGCATCG 11309 

(2) INFORMATION FOR SEQ ID NO: 109: 

<i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5548 base pairs 
{B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 109: 

CCATAGTCTA ACAAGTCTTT GTAAAGGTTT ATCCCTGATT CATGTAAAGA TTGTGTAAAG 60 

AATCAAAAAA AGCCACTTTT GAAAAATGGC TGCTCCTAAA AATAGCTTTA AAAATTATTA 120 

GTCCTGTGCG AAAGATTGGT TAGGAAGAAA AATCGTGAAG CAACTGCCTC TGCCAAGCTG 180 

ACTCGTCACC GTGACTTGGC CACCTAATAA TTGACTGAGT TCTTTGACAA TGGCAAGGCC 240 

AAGACCAGTG CCACCAGTTT GTCTGCTTCG ACCTTTATTA ACTCGGTAAA AACGTTCAAA 300 

AATACGATCC TGCTCTAATT GACTAATACC AATCCCTGTA TCTGATACAG AAATCTTAAT 360 

GCCTTCGTTC ACCTTTTGGG TCTTGACCTC AATTTTTCCC CCTTGTTCAG TGTAACGGAT 420 
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GGCATTGGAT 


AAAAGATTGA 


GTAAGATTTG 


GG AAAGTAAT 


TGACTATCTG 


ATACGAGGGT 


480 


GACATCATCT 


GGCACCTGCA 


CCTTTAGCTG 


TAAATCCTTC 


TTCTTGAGCT 


GAGGTTGCAA 


540 


GCTTTGAGTC 


AAATCCTGTA 


CAAATTCTGC 


CAAAGAAAGG 


GTCGTCCATT 


GTATAGGCAT 


600 


TTGTTGAGCC 


TTAGATAAGG 


TAAGAAGATG 


CTCAACAATA 


TGCTCAAGAC 


GCAAACTTTC 


660 


TTTGTAAATA ATGTCTAGAA AGTCATCCTT GAGCGCTTCT 


TCTTCAGCTG 


ACATCCCCTT 


720 


AATGGTTTCA 


GCAAAGCCCT 


TAATCGAAGT 


AACTGGTGTC 


CTCAATTCAT 


GGGAGGCATT 


780 


TGAGACAAAG 


GCTAAATTTA 


ACTTTTCATA 


AGTTCTAATC 


GTTGTTAAAT 


CATATAGCAA 


840 


GACGAGCACA 


GCTTCCACAG 


ATTGGGTGGG 


GCTAAAAACG 


GGAACTGCTG 


TCACTTCTAA 


900 


AATCAAGTCA 


CCCTCATGAA 


ACCCACTTAC 


TTCTTGTTTT 


AACCTTGTTT 


TTTGATCAAA 


960 


GGCTTGGTGA ACTAAATTCC 


GAATATCCAT 


CCGTTTGAGG 


TCATCAAGTG 


AACTTATGTC 


1020 


GCCGTCCACA 


TCGGGAAAAT 


AATGAGGCAG 


AGAGCGACTG 


GATAATAACA 


TCTGACCTTG 


1080 


AGCGGAAACT 


AAAAACGTCC 


CCATGGTTAG 


GTGCGACAGA 


AGAACCTCCA 


TTGTTTCGGC 


1140 


TAGATCCTTG 


TATTGCTGAT 


CCTGTTGGGA 


GACTTTGGTT 


TTTAGGCCAG 


ACACATACTG 


1200 


AGCCAAAGAC 


TTTAAGTCTT 


CTTGCCCTTT 


TTCTAAAAAG 


TATTCACTAC 


TGGTCAAGAG 


1260 


AGGTTGGTGC 


AAGGTCTCAA 


AAGCAACTTC 


CCATTTCCAA 


AGGCAAAAGA 


GCCAGTAGCC 


1320 


ACCTAGTCCC 


AAAGAAAGGG 


CTAGAAGAAA 


GAGACCGATG 


CCTTTACTGA 


TCCAAGTTAA 


1380 


TGCCATCCCT 


GCAATCAGAA 


TGAGGCTAAC 


ACTTAGATTG 


ACTAGCCAAA 


ATTGAAGGTA 


1440 


GCGTTTCATC 


TATAACTCCT 


TGAACTTATA 


ACCATAACCC 


CGAATGGTTC 


GAATAAATTG 


1500 


AGGGGCTTTA 


GGATTGTCTT 


CAATTTTTTC 


CCTCAACTTA 


CCAATATGAA 


CGTCCACCAA 


1560 


ACGTGTTTCC 


TGCCCAAAGT 


CATACCCCCA 


GATACGTTCC 


AAAAGACGCT 


CTCTAGTCAG 


1620 


TGTCATGTTG 


GGATGTTTCA 


TAAGATAGAG 


CAAGAGTTCA 


AATTCTTTTG 


GGGTCAAACT 


1680 


CAGTAACTTA 


TTCGCCTTGT 


AGACTTCATG 


ACGCTCAGGG 


TATACTTTCA 


AGGTCCCAAA 


1740 


TAGCCAAGAA 


TCGTCAGCGA 


TATTATCTGA 


ATCATCTCCT 


TCTTGTTCTC 


CTTTAGTTCG 


1800 


CCTGAGGACA 


GCCTTGACAC 


GCGCCAGCAA 


TTCTCTAGGG 


CTAAAAGGCT 


TGGTCAGGTA 


1860 


GTCATCAGCC 


CCTAATTCCA 


AGGCCAAAAC 


CTTATCAAAT 


TCATCACTTT 


TCGCAGAAAC 


1920 


CATCATAATT 


GGAGTTTTGA 


CGCCTTTGGC TCTCAGCCGC 


TTACAAACTT 


CCATGCCATC 


1980 


TAATTGTGGT 


AACATGATAT 


CAAGCAAGAT AAAATCAAAG 


GGTTCTGTTT 


CTGCCAAAGC 


2040 


TAAGGCCTTC 


CGTCCATTTG 


TCACCAATTG AGTAGAAAAG 


CCTTCCTTAC 


TTAAATGGTA 


2100 


GTCAAGCAAT 


TTCAGAATGT 


GTTCTTCATC 


ATCCACTAAT 


AAGACTTGTT 


TTGTCATCTA 


2160 
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TTATCTCCTA TTGGTAACAT TATAACACAA TTATCAGAAA TCCTAACATT GCTAAATCAG 2220 

ATTAAATTTG CCTATCAAGA CTAGTATCTG GTCAAACGCT CAATCATCTC CTTGTGCTCT 2280 

GGATAGGTCG CCAGTAGATC TACCCTTTCA AATAATTCAA AATCCTCAAA TTCAAAACCA 2340 

GGAGCAACAA GACAAGAAAC CAGAGCATCA TCCTTATCAA CTGTTGATCC CCAAATAGTG 2400 

CCCTTAGGAA CACAGTAGTG AAGTTGTTGC CCTTTGGATA TGTCCAGGCC TAAAGTGACT 2460 

GCTTCGTAGT GACCATCTGC TGTAATCATG TGAACAGTAA GTGGGGATCC TGCATGAAAA 2520 

TACCAGATTT CATCTGCTGT CAATCGGTGA AAATGTGAAG GATTCGTTTC TTCTAATAAG 2580 

AAATAAATAC TGGTATAAAG CGCCCTTCCC TTACCAGCAA GGTTTATAGT GTCTGAAGCT 2640 

TTTTTTGTTT GTCTAAAATA GCCACCTTCA ATATGGGGAG CTAACTCTAG AGTTCTTATC 2700 

AAGTCTTCTT TATCCGTCGG AGCCAATGGG TTGAAGTAAC TCTTGTTCAA AGTGGTTTTA 2760 

CGATTTCAAG AACTCCTCTC AGTTCTGAGG ACACGGTAAT GATTGATGCG ACGGAAGTAC 2820 

AAATCAATCG CCCTAAAAAA AGAATTAGCG AATGATTCTG GTAAAAAAAA TGCCACGCTA 2880 

TGAAGGCTCA AGCGATTGTC ACAAGTCAAG GGAGAATTGT TTCTTTGGAT ATCGCTGTGA 2 940 

ACTATTGTCA TGATATGAAG TTGTTCAAAA TGAGTCGCAG AAATATCGGA CAAGCTGGTA 3000 

AAATCTTGGC TGACAGTGGT TATCAAGGGC TCATGAAGAT ATATCCTCAA GCACAAACTC 3060 

CACGTAAATC CAGCAAACTC AAGCCACTAA CAGTTGAAGA TAAAGCCTAT AACCATGCGC 3120 

TATCCAAGGA GAGAAGCAAG GTTGAGAACA TCTTTGCCAA AGTAAAAACG TTTAAAATGA 3180 

TTTCAACAAC CTATCGAAAT CATCGTAAAC ACTTCGGATT ACGAATGAAT TTGATTGCTG 3240 

GCATTATCAA TCATGAACTA GGATTCTAGT TTTGCAGGAA GTCTATTATT TGGTTAGGTG 3300 

AATTAGTGAA GCGTTTAGGC AAGTGTCTCT GGTTACGACG TCATGGACTC TAAATCGATT 3360 

ATATTTAGGG GTCATGACTA GTGAAGCAGT TAGCTAGTTC GCATATAAGC GGCTAGCGTC 3420 

TAACAATTAG GAACTTTAGT TCCAATAACT TTAAGATTAC GACGTTTTAG GACATAAATC 3480 

GATCATATTT ATGTCCTAAA ACTAGTGAAG CGCCTAGCCA AAGTCCGAAT AGGATTTGGC 3540 

GTTAGTTACT TAGATTGCTT TGCAATCAAG TAACTTTGGC GATTTACATC TTCTCTGGCG 3600 

CTTCTACTCC AAGCAAGCGA AGGGCTTCTT TGAGAACGAC TGCGGTTGCG TAGCTGAGGG 3660 

CTAGACGGCT GTCGCGTTCT GGGCTTTCAT CCAAGATACG TGTATGTGCA TAGTATTTGT 3720 

TAAAGGATTG AGCCAGGCTA ATTGCAAATT TAGCAATGAT AGAAGGTTCA AAGTTATCTG 3780 

CCGCACGGTT GATAATACGT GGGAAGTCTT GAATGAGTTT AATGATTTCC CAGCTTTCAG 3840 

TATCATTCAA GCTATAGTTG CCAGCTGTTT CTGGTTTGAA ATCGGCTTTG CGTAAGATAG 3900 

ATTGGATACG AGCGTAGGCA TATTGAACGT AAGGTCCAGT TTCACCCTCG AAGGATACCA 3960 
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TAGCCTCTAG 


GTCGAAGTCG 


TATCCATTTG 


TACGGTCGGT 


TTTGAGGTCA 


TAGAATTTAA 


4020 


TGGCTCCAAT 


CCCAACAGCA 


TGTGCTACTT 


GGTCTTTGTT 


TTCTAGTTCA 


GGATTTTTAG 


4080 


CCTCGATTTG 


GACCTTGGCA 


CGGCTAACAG 


CCTCTGCAAC 


AGTAGGCTCT 


AGCAAGATGA 


4140 


CATTCCCTTT 


ACGAGTAGAG 


AGTTTCTTCC 


CTTCTTTTGT 


AACCAAACCA 


AAAGGAACGT 


4200 


GAGTAATGTC 


GTCACTCCAG 


TCGTAGCCCA 


TCTCTTGCAA 


GACAGCTTTG 


AGCTGTTTAA 


4260 


AGTGGGCAGA 


TTGTTCTTGA 


CCAACGACAT 


AGATAGATTT 


AGCAAATTGG 


TATTCGTTTT 


4320 


TACGGTAGAG 


GGCTGCAGCC 


AAGTCACGTG 


TGATATAGAG 


AGTTGCACCA 


TCAGACTTCT 


4380 


TGATGAGGGC 


TGGATGTTCA ATT C CAT ATT 


TCTCAAGATT 


CACAACTTGG 


GCACCTTCTG 


4440 


ATTCAAGAAG 


TAGTCCTTTT 


TCAGAAAGAA 


TGTCTACAAC 


TG CATC CATC 


TTATCATTGT 


4500 


AGAAGGCTTC 


TCCGTTATAG 


CTGTCAAATT 


CAACCTTCAA 


TTCATTGTAA AGGCGGTTAA 


4560 


ATTCCACTAA 


ACTTTCATCG 


CGGAACCATT 


GCCAAAGAGC 


GAGAGCTTCC 


TCATCTCCAT 


4620 


TTTCAAGTTT 


ACGGAACCAT 


TCGCGCGCTT 


CTTCATCCAA 


GCTAGGGTCA 


TTTTCAGCTT 


4680 


CAGCGTTGAT 


GCGGACATAG 


AGTTTAAGGA 


GTTCATCGAT 


TGGATGAGCT 


TTTACAGCTT 


4740 


CTTCGTCGCC 


CCATTTTTTG 


TAGGCAACAA 


TCAACATCCC 


AAATTGTTTA 


CCCCAGTCTC 


4800 


CCAAATGGTT 


GACCTTGACC 


GTTTGATAAC 


CGATTTTTTG 


GAAAATATGT 


GACAAGCTAT 


4860 


CTCCGATAAC 


AGTTGAACGC 


AGGTGGCCAA 


TAGAAAATGG 


TTTAGCGATA 


TTCGGACTAG 


4920 


ACATGTCGAT 


AACAACATTT 


TCTTGTTTAC 


CAATATTTTG 


GTCAGCATAG 


TGTTCTTTTT 


4980 


CAGTGGTAAC 


AGCTTGCAAT 


ACTTGAGCAG 


AAATGGCAGA 


TTTATCAAGG 


AAAAAGTTAA 


5040 


CGTAAGGTCC 


TGTTGCGACA 


ACTTTTTCAA 


AGGCTTGGCT 


GTTCATTTTT 


TCAGCCAGTT 


5100 


CAGCCGCAAT 


CATTTGTGGT 


GCTTTACGTT 


CGACTTTTGC 


AAGAGAAAAA 


GCAGGGAAAG 


5160 


CAATGTCTCC 


CATTTCTGAG 


TTTTTAGGGG 


TTTCCAGTAA 


CTTTAAAATA 


GCCTCTTGGT 


5220 


CCAGGCTATC 


AATGATGCTA 


GATAATTCGC 


TAGCAATCAA 


TTCTTTTGTA 


TTCATTAAGA 


5280 


GCTCCTTTTT 


GGACTTTTCT 


ACTATTTTAT 


CACAATTTTA 


AAGAAAGAAG 


AAAAAATTTT 


5340 


TGAAATCTCC 


TGTTTTTTTG 


GTATAATATG 


GTTATAAATA 


TAGTTATAAA 


TATGCACGCA 


5400 


AGAGGATTTT 


ATGAGAAAAA 


GAGATCGTCA 


TCAGTTAATA 


AAAAAAATGA 


TTACTGAGGA 


5460 


GAAATTAAGT 


ACACAAAAAG 


AAATTCAAGA 


TCGGTTGGAG 


GCGCACAATG 


TTTGTGTGAC 


5520 


GCAGACAACC 


TTGTCTCGTG 


ATTTGCGG 








5548 


(2) INFORMATION FOR SEQ ID NO: 110: 









(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3132 base pairs 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) 


SEQUENCE DESCRIPTION: 


SEQ ID NO: 


110: 






TACCCGGTAG 


TCTTAGCAGA 


CACATCTAGC 


TCTGAAGATG 


CTTTAAACAT 


CTCTGATAAA 


60 


GAAAAAGTAG 


CAGAAAATAA 


AfiAflAAAPAT 


GAAAATATCC 


ATAGTGCTAT 


GGAAACTTCA 


120 


CAGGATTTTA 


AAGAftAAftAA 




ATTAAGGAAA 


AAGAAGTTGT 


TAGTAAAAAT 


180 


CCTGTGATAG 


APAATAAPAP 


TafipaaTviaa 


GAAGCAAAAA 


TCAAAGAAGA 


AAATTCCAAT 


240 


AAATCCCAAG 


fi Afi ATTAT A P 


UViAL lA- A ill 


GTGAATAAAA 


ACACAGAAAA 


TCCCAAAAAA 


300 


GAAGATAAAG 


TTGTCTATAT 


TGCTGAATTT 


AAAGATAAAG 


AATCTGGAGA 


AAAAGCAATC 


360 


AAGGAACTAT 


CCAGTCTTAA 


GAATACAAAA 


GTTTTATATA 


CTTATGATAG 


AATTTTTAAC 


420 


GGTAGTGCCA 


TAGAAACAAC 


TCCAGATAAC 


TTGGACAAAA 


TTAAACAAAT 


AGAAGGTATT 


480 


TCATCGGTTG 


AAAGGGCACA 


AAAAGTCCAA 


CCCATGATGA 


ATCATGCCAG 


AAAGGAAATT 


540 


GGAGTTGAGG 


AAGCTATTGA 


TTACCTAAAG 


TCTATCAATG 


CTCCGTTTGG 


GAAAAATTTT 


600 


GATGGTAGAG 


GTATGGTCAT 


TTCAAATATC 


GATACTGGAA 


CAGATTATAG 


ACATAAGGCT 


660 


ATGAGAATCG 


ATGATGATGC 


CAAAGCCTCA 


ATGAGATTTA 


AAAAAGAAGA 


CTTAAAAGGC 


720 


ACTGATAAAA 


ATTATTGGTT 


GAGTGATAAA 


ATCCCTCATG 


CGTTCAATTA 


TTATAATGGT 


780 


GGCAAAATCA 


CTGTAGAAAA 


ATATGATGAT 


GGAAGGGATT 


ATTTTGACCC 


ACATGGGATG 


840 


LATATTGCAG 


GGATTCTTGC 


TGGAAATGAT 


ACTGAACAAG 


ACATCAAAAA 


CTTTAACGGC 


900 


ATAGATGGAA 


TTGCACCTAA 


TGCACAAATT 


TTCTCTTACA 


AAATGTATTC 


TGACGCAGGA 


960 


TCTGGGTTTG 


CGGGTGATGA AACAATGTTT 


CATGCTATTG 


AAGATTCTAT 


CAAACACAAC 


1020 


GTTGATGTTG 


TTTCGGTATC 


ATCTGGTTTT 


ACAGGAACAG 


GTCTTGTAGG 


TGAGAAATAT 


1080 


TGGCAAGCTA 


TTCGGGCATT 


AAGAAAAGCA 


GGCATTCCAA 


TGGTTGTCGC 


TACGGGTAAC 


1140 


TATGCGACTT 


CTGCTTCAAG 


TTCTTCATGG 


GATTTAGTAG 


CAAATAATCA 


TCTGAAAATG 


1200 


ACCGACACTG 


GAAATGTAAC 


ACGAACTGCA 


GCACATGAAG 


ATGCGATAGC 


GGTCGCTTCT 


1260 


GCTAAAAATC 


AAACAGTTGA 


GTTTGATAAA 


GTTAACATAG 


GTGGAGAAAG 


TTTTAAATAC 


1320 


AGAAATATAG 


GGGCCTTTTT 


CGATAAGAGT 


AAAATCACAA 


CAAATGAAGA 


TGGAACAAAA 


1380 


GCTCCTAGTA 


AATTAAAATT 


TGTATATATA 


GGCAAGGGGC 


AAGACCAAGA 


TTTGATAGGT 


1440 


TTGGATCTTA 


GGGGCAAAAT 


TGCAGTAATG 


GATAGAATTT 


ATACAAAGGA 


TTTAAAAAAT 


1500 


GCTTTTAAAA AAGCTATGGA TAAGGGTGCA CGCGCCATTA 


TGGTTGTAAA 


TACTGTAAAT 


1560 
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TACTACAATA GAGATAATTG GACAGAGCTT CCAGCTATGG 


GATATGAAGC 


GGATGAAGGT 


1620 


ACTAAAAGTC AAGTGTTTTC AATTTCAGGA GATGATGGTG 


TAAAGCTATG 


GAACATGATT 


1680 


AATCCTGATA AAAAAACTGA AGTCAAAAGA AATAATAAAG 


AAGATTTTAA 


AGATAAATTG 


1740 


GAGCAATACT ATCCAATTGA TATGGAAAGT TTTAATTCCA 


ACAAACCGAA 


TGTAGGTGAC 


1800 


GAAAAAGAGA TTGACTTTAA GTTTGCACCT GACACAGACA 


AAGAACTCTA 


TAAAGAAGAT 


1860 


ATCATCGTTC CAGCAGGATC TACATCTTGG GGGCCAAGAA 


TAGATTTACT 


TTTAAAACCC 


1920 


GATGTTTCAG CACCTGGTAA AAATATTAAA TCCACGCTTA 


ATGTTATTAA 


TGGCAAATCA 


1980 


ACTTATGGCT ATATGTCAGG AACTAGTATG GCGACTCCAA 


TCGTGGCAGC 


TTCTACTGTT 


2040 


TTGATTAGAC CGAAATTAAA GGAAATGCTT GAAAGACCTG 


TATTGAAAAA 


TCTTAAGGGA 


2100 


GATGACAAAA TAGATCTTAC AAGTCTTACA AAAATTGCCC 


TACAAAATAC 


TGCGCGACCT 


2160 


ATGATGGATG CAACTTCTTG GAAAGAAAAA AGTCAATACT 


TTGCATCACC 


TAGACAACAG 


2220 


GGAGCAGGCC TAATTAATGT GGCCAATGCT TTGAGAAATG 


AAGTTGTAGC 


AACTTTCAAA 


2280 


AACACTGATT CTAAAGGTTT GGTAAACTCA TATGGTTCCA 


TTTCTCTTAA 


AGAAATAAAA 


2340 


GGTGATAAAA AATACTTTAC AATCAAGCTT CACAATACAT 


CAAACAGACC 


TTTGACTTTT 


2400 


AAAGTTTCAG CATCAGCGAT AACTACAGAT TCTCTAACTG 


ACAGATTAAA 


ACTTGATGAA 


2460 


ACATATAAAG ATGAAAAATC TCCAGATGGT AAGCAAATTG 


TTCCAGAAAT 


TCACCCAGAA 


2520 


AAAGTCAAAG GAGCAAATAT CACATTTGAG CATGATACTT 


TCACTATAGG 


CGCAAATTCT 


2580 


AGCTTTGATT TGAATGCGGT TATAAATGTT GGAGAGGCCA 


AAAACAAAAA 


TAAATTTGTA 


2640 


GAATCATTTA TTCATTTTGA GTCAGTGGAA GCGATGGAAG 


CTCTAAACTC 


CAGCGGGAAG 


2700 


AAAATAAACT TCCAACCTTC TTTGTCGATG CCTCTAATGG 


GATTTGCTGG 


GAATTGGAAC 


2760 


CACGAACCAA TCCTTGATAA ATGGGCTTGG GAAGAAGGGT 


CAAGATCAAA AACACTGGGA 


2820 


GGTTATGATG ATGATGGTAA ACCGAAAATT CCAGGAACCT 


TAAATAAGGG 


AATTGGTGGA 


2880 


GAACATGGTA TAGATAAATT TAATCCAGCA GGAGTTATAC 


AAAATAGAAA 


AGATAAAAAT 


2940 


ACAACATCCC TGGATCAAAA TCCAGAATTA TTTGCTTTCA 


ATAACGAAGG 


GATCAACGCT 


3000 


CCATCATCAA GTGGTTCTAA GATTGCTAAC ATTTATCCTT 


TAGATTCAAA 


TGGAAATCCT 


3060 


CAAGATGCTC AACTTGAAAG AGGATTAACA CCTTCTCCAC 


TTGTATTAAG 


AAGTGCAGAA 


3120 


GAAGGATTGA TT 






3132 


(2) INFORMATION FOR SEQ ID NO: 111: 









(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14672 base pairs 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 111: 



CGAGATTTCT 


TTAAATGAAC 


TACGTGAAAT 


CTACCCATCA 


TCCAGATCTG 


GATATTCTCT 


60 


CCTATCTATA 


AGTAAAGTTT 


TAGGAGATTT 


TAATATAAGT 


TCTCATGCTT 


TTAAAGCTTC 


120 


GGTAAGAGAT 


TTAAAACCGC 


TCAGTTTCCC 


ACTCATTTGC 


TTCTGGGAGA 


GTTCTCATTT 


180 


TATTATTCTT 


GAAAAAATTA 


GTAAAAACAA 


GTTTTATATT 


TTAGATCCTG 


CAAAAGGCAG 


240 


GCAGAGAATG 


TCAATAAGTG 


AATTTGAAAG 


GCATTATTCA 


AATATCATTT 


TAACATTTAA 


300 


AAAGTTAGAT 


AGCTTTATGT 


CTCGTAAAGA 


TAATAAGAAG 


TCGCCTGTTT 


TAAAGTATTT 


360 


TTTTAAGTAT 


AGGAATAAGC 


TAGGGATTTT 


ATTTTTTGTA 


ACAGCATTAT 


TGTATGTAAT 


420 


ACAATCATTA 


GTACCTATAG 


CTAATAGATA 


CATAATTGAC 


ACGAATTTCA 


AGGACGATTC 


480 


GTATTCGTCT 


AGAATGTTAT 


TTACTATATT 


ATTTATATTT 


ACTGTTTCAT 


TCTCACTAAT 


540 


GTATTTATTA 


AGACAGATAT 


ATGTTGCATC 


CTTAAAATAT 


ATAATGGATA 


AAGAGATTAG 


600 


CTATGATTTT 


ATGAAACATT 


TGATATATTT 


ACCTTACAGT 


TTTTATGAAA 


AACGTACTTT 


660 


AGGGGATATA 


CTTTTTAGAG 


CTAACTCTAT 


TGTTTATATA 


AGAGAAATAC 


TATCAAATAA 


720 


TTTTATAGCA 


GCTATACTTG 


ATTTGTTAAT 


GATTGTGGTT 


TATGCTGTGG 


TTTTATTTAG 


780 


CTTTTCTAAG 


TACATGGTAA 


TCTTTTTAAT 


ATCACTAAGT 


CTAGCTCTAT 


CTATTGTAAT 


840 


GTATCCAATC 


ATAAAAATCT 


CAAAAAATTT 


AATTGATAAA 


AATATAAAAG 


AAAAGGTTAA 


900 


TGTTCAAAAT 


ATTACTTCCG 


AAGTAATTTC 


TAAAAATAGT 


GATATTAAGC 


TAACTGGAGA 


960 


AGAGGAATTT 


TGGATTAACA 


AATGGGATAA 


TTTTAATACA 


AAACAGCTCA 


TCATAGGTCG 


1020 


AAAACTTGAT 


ATACATTTAT 


CAATTGTTAG 


TAGTATAACG 


AATGTTTTAC 


AAATTATTCT 


1080 


CCCTGTTTTG 


ACCCTTATTG 


TAGGTGTAAA 


TATAAAAACA 


TTCGAACAAT 


TGACGTTAGG 


1140 


ACAAATTGTA GCAATAAGTA CAGTCTCACC 


ATACTTTATT 


TCTCCTATAA 


TTTCTTTAAG 


1200 


TGATAACTAT 


ATACAATTAA 


TGTTATTAAA 


GGGATATTTT 


TTAAGAATAG 


AGGATGTGTT 


1260 


TAATACTAAA TCCGAATTAA TTCCAGAAAG AGTCAGTCAA GATATAAAAT TTGATAAAAA 


1320 


AATAGAATTA 


AAAGATATTT 


GGTATAAATA 


TGGATTATTT 


GATGATTATG 


TTTTGAAAGG 


1380 


AATAAATGTT 


ACTATTAAAA AAGGAGAAAC TGTTGCTATT GTTGGAGAAT 


CAGGTTCAGG 


1440 


TAAGAGTACA 


TTAGCTAAAA TTTTATTAGG 


TTTATTAGAA 


CCTAATATTG 


GTTCAATAGA 


1500 


AGTTGATGGA GTAGAAAAAG 


AAGAAATTGG 


TCAAACATTG 


TATAGAAAGA 


TTTTTGGAGC 


1560 
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AGTGTTACAA AATTCAACCC TAAGTTATGG TACCTTAAGA GAGAATTTGA CATTTGGACA 1620 

CTTTGTTTCA GATGAAGAAT TAATGACAAA TCTAAATTCA ATTGGTCTTA GCAATGTAGT 1680 

TAAATCTTTA CCTCTTGGAT TAGAGACAAT CATCGCTGAA GAAGGTAATA ACTTTTCTGG 1740 

AGGGCAGCAG CAAATGATAC TTTTAGCTCG TTGTCTTTTG TCGAAACCTT CGGTAGTTGT 1800 

TTTGGACGAA GCAACAAGTA GTTTAGATAA TTTATCTCAA CAAATTACAA CTTCTTACTT 1860 

AAGTGAAATC GGTACCACTA AGATTTTAAT TGCCCATCGA CTAGATACTA TCAAGTCTGC 1920 

AGATAAGATC TTAGTAATGC ATAATGGTGA AATTGTAGAG ATTGGGACCC ATAGAGAACT 1980 

TCTTGAACTA GGAGGCATTT ATAAGCAATT GTATTCAAAT AATTAGTTTT TGATTAAAAG 2040 

GGTAAATTTA TGAAGATTAT GAAAAAAAAA TATTGGACTT TAGCGATATT ATTCTTTTGT 2100 

TTGTTCAATA ATTCTGTTAC TGCTCAAGAA ATACCTAAAA ATCTTGATGG CAATATAACT 2160 

CACACTCAGA CTAGCGAAAG TTTTTCTGAA TCTGATGAAA AACAGGTTGA CTATTCTAAT 2220 

AAAAATCAAG AAGAAGTAGA CCAAAATAAA TTTCGTATTC AAATCGATAA GACAGAATTA 2280 

TTTGTAACAA CAGATAAACA TTTAGAAAAA AACTGTTGTA AATTGGAACT TGAACCACAA 2340 

ATAAATAACG ATATTGTTAA CTCTGAAAGT AATAATTTAC TAGGCGAAGA TAATTTAGAT 2400 

AATAAAATTA AGGAAAATGT TTCTCATCTA GATAATAGAG GAGGAAATAT AGAGCATGAC 2460 

AAAGATAACT TAGAATCGTC GATTGTAAGA AAATATGAAT GGGATATAGA TAAAGTTACT 2520 

GGTGGAGGCG AAAGTTATAA ATTATATTCT AAAAGTAATT CTAAAGTTTC AATTGCTATT 2580 

TTAGATTCAG GAGTCGATTT ACAAAATACT GGATTACTGA AAAATCTTTC AAATCACTCA 2640 

AAAAACTATG TCCCCAATAA AGGATATTTA GGAAAAGAGG AGGGAGAGGA AGGAATAATA 2700 

TCAGATATTC AAGATAGATT AGGTCATGGT ACGGCTGTTG TAGCTCAAAT TGTAGGGGAT 2760 

GACAATATTA ATGGAGTAAA TCCTCACGTT AATATTAACG TCTATAGAAT ATTTGGTAAG 2820 

TCGTCAGCTA GTCCAGATTG GATTGTAAAA GCAATTTTTG ATGCTGTAGA TGATGGCAAT 2880 

GATATTATCA ATCTTAGTAC TGGACAATAT TTAATGATTG ATGGAGAATA TGAGGACGGA 2940 

ACAAATGATT TTGAAACATT TTTGAAGTAT AAAAAGGCTA TTGATTACGC GAATCAAAAA 3000 

GGAGTAATTA TAGTAGCTGC ATTAGGGAAT GACTCCCTAA ATGTATCAAA TCAGTCAGAT 3060 

TTATTGAAAC TTATTAGTTC ACGCAAAAAA GTAAGAAAAC CAGGATTAGT AGTTGATGTT 3120 

CCAAGTTATT TCTCATCTAC AATTTCGGTC GGAGGCATAG ATCGCTTAGG TAATTTATCA 3180 

GATTTTAGCA ATAAAGGGGA TTCTGATGCA ATATATGCGC CTGCAGGCTC AACATTATCT 3240 

CTTTCAGAAT TAGGACTTAA TAACTTTATT AATGCAGAAA AATATAAAGA AGATTGGATT 3300 
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TTTTCGGCAA CACTAGGAGG ATATACGTAT CTTTATGGAA ACTCATTTGC TGCTCCTAAA 3360 

GTTTCTGGTG CGATTGCAAT GATTATTGAT AAATACAAAT TAAAAGATCA GCCCTATAAT 3420 

TATATGTTTG TAAAAAAATT CTGGAAGAAA CATTACCAGT AAAAAATGGT ATAAAAGTGT 3480 

TAAATATACC AAACGTATTG AGATATGATT TGAATATGTT ACAATTAGAA TATAAAAATG 3540 

AACAAAGTTG GGATAGTTTC ATAGATAATG TTAATTTAAT TGAGTTGGAA GAGAGAATTC 3600 

AAACTACTAT TGGAATTAAA CAAATAAACA CACACAATAT TATTACTATT GCCCGAGAAG 3660 

GGTACTCTCA AAATTATTTA CCTAACACTT CAGAAAATAC ATATAATTCA TTACAAGTCA 3720 

GTTTAGTTGG AGTATTACTA CTTTTTATAA GTATGGTAAA TATTTTATGG GCTAAAAAAA 3780 

GTAAATGAAA ATAAAATTTG GAGCCCTCTG AAAAAGTAAG TCCTACAGTT CAACTAAAAT 3840 

GAGTCAAAAG ATGAATCACC TTGATGTAGG GGAGTTTGTC TTATTGCTGC CTGAACACCT 3900 

CCGTTCAGAG GAAGAACATT ATAAATCTGT TTTTGAAGAC GACTTAACCA GTCGCATATC 3960 

TAGTCAAGAT GAACGACAGC AAATGACTGC TACGGTAGGT TATTTAGAAT CAGGTCAGGA 4020 

TCGTTTTGTG TATAATACGA CCCCTATTTC TTACCAGCAG TTTTTGAAAG ATCCAATCAT 4080 

CATTGTTATA ACACCCCAAT CAACTGGTCC ACAGTCCATT TTGTTTTGGA TAGACGCAGT 4140 

ACAGAACTAC GTTCTCTTTA ATCAATTGTC TGATGCCCAG GAGCTTATCC AGAGACAAGG 4200 

CATTGAAAAT TGGGTCTCAG AAATGCAAAC AGGTTACCAC AACTACATCA CATTATTGGA 4260 

TAATATCCAG AGGGAACGTT GGGTAATGCT AGCAGGAGCT GTGCTTGGGA TTGCAACTTC 4320 

AATCTTGTTG TTTAACACTA TGAATAGGCT CTACTTTGAA GAATTTAGAC GTGCCATTTT 4380 

TATCAAACGC ATTGCAGGTC TCAGGTTCTT AGAAATCCAT CGCACTTATC TCTTTGCTCA 4440 

ACTGGGTGTG TTTTTACTGG GATTTGTTGC GAGTGTATTT CTTCAGGTAG AGATAGGAGT 4500 

TGCTTTCTTA GTCTTGTTAC TCTTTACTGG TCTATCTCTT TTACAGTTAC ATGTCCAAAT 4560 

GCAGAAAGAA AACAAGATGT CCATGCTTGT TTTGAAGGGA GGTTAATATG ATTGAACTTA 4620 

AACAGGTGAG TAAATCTTTT GGAGAACGAG AGTTATTTTC GAATCTTTCA ATGACATTTG 4680 

AGGCTGGAAA AGTCTATGCC TTAATTGGTT CAAGTGGTAG CGGAAAAACA ACCTTGATGA 4740 

ACATGATTGG GAAATTAGAA CCTTATGATG GGACGATTTT TTACCGAGGT AAAGACTTGG 4800 

CCAATTATAA ATCAAGTGAT TTTTTCCGTC ACGAATTGGG CTACCTCTTC CAGAACTTTG 4860 

GCTTAATTGA AAACCAAAGT ATTGAAGAAA ACCTTAAGCT AGGTCTCATT GGTCAAAAGT 4920 

TGAGTCGGTC GGAACAGCGG TTGAGGCAGA AGCAGGCTTT AGAACAGGTC GGCCTGGTTT 4980 

ATCTTGACCT AGATAAGCGC ATCTTTGAGT TATCGGGCGG AGAATCGCAA CGGGTTGCCT 5040 

TGGCAAAAAT TATCTTAAAG AATCCACCCT TTATTCTGGC AGATGAGCCA ACAGCTTCAA 5100 
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TAGACCCAGC 


AACCTCTCAG 


TTGATTATGG 


AGATTTTGCT 


ATCTCTTCGA 


GATGATAATA 


5160 


GGCTAATCAT 


TATCGCAACA 


CATAATCCGG 


CAATTTGGGA 


GATGGCTGAT 


GAAGTGTTCA 


5220 


CGATGGATCA 


TCTGAAATAA 


AAATCCTTGT 


TTTTAATTGC 


ACGATGAGTT 


ACTGAAATAT 


5280 


TATCATGAAT 


CAAGAATTGG 


AGTTAATTTA 


GAATTGTACT 


TAATTTAGAA 


TTGTACTTTA 


5340 


TTAATATTGA 


GGTAACTTTT 


TCTTGATAAA 


GGAAGAAATA 


ATGGAGAGGA 


AGTTAGAATG 


5400 


AAAAAATTCG 


ACAATTATAT 


TATTGAGAAG 


CCTTGCGATT 


CTAATTCAGA 


TAAACTGCAA 


5460 


AAAATCTTAA 


TAATTGAAAG 


TTTGGTAGAT 


GATATTTTGC 


AATTTTCTCT 


CAGAATCAAT 


5520 


AATAGTGTAG 


GAGAGATTTT 


CCTCCTACAA 


CCGTTTTAAA 


AGAAAACTAT 


CTTTATTCCA 


5580 


TGTTATTTTG 


AGGAAGATAT 


TGTGAAAGTC 


AAAGATGATG 


ATAAAGTTGA 


GTGGAATTTG 


5640 


TTAGAATTTC 


AAAAATTTAG 


AGCATTTTTG 


GCTTAGTAAT 


CTGTGTTGAA 


GGCTCAAAAC 


5700 


CTATGGTAAA 


AAAGTAGCTT 


TGAAAACGTA 


TTGCCTCCAA 


AGATTTAGTT 


AAATAATGAT 


5760 


TTAACACAAA 


AAGAAATTAT 


TGAAGTTCTG 


GAAAGATGTT 


GTTTCAGTAT 


TGAGAAAAGG 


5820 


TGGGAAAAAC 


TTGCGATTTT 


CACAGAGAAA 


GGAAGAAAAA 


GTATAGAAAT 


ATAGTCAATT 


5880 


GAAACAAGAA 


CAGGATAAAA 


GAACCTTTTG 


TGCCATATTT 


TTCTCCTTTC 


GCTTTACAAT 


5940 


TGGATTGAAC 


ACCTTTATTG 


TATCGCGTTT 


GGAGTTTTTT 


TGGTATAACC 


TTCGACGCAC 


6000 


ACCCGCATAG 


CGGGTGTTTT 


TTTTGTCTCG 


CACCTAACGG 


AGCGAGACAA 


ACTAATAGTC 


6060 


ACTTAATCAA 


AAAACGCACC 


ATATCAAAAA 


CTAAAAAGTT 


TGATATCATG 


CGTCATGTCT 


6120 


TAAACTAATT 


GACTATACTT 


TCTATTCAAA 


TGAGCTTTTA 


ACCAATTGAT 


TGAGCCAATC 


6180 


CACTCTTAAA 


ACCAAAGAGC 


AATTTCTCGC 


TTAGCTGACT 


CTTCTGAATC 


TGAACCATGT 


6240 


ACAACATTTT 


GGATAATCTC 


ATTTTCTCCA 


GCAGCTTTTG 


CAAAATCACC 


TCGAATAGTG 


6300 


CCTGGTAAAG 


CTTCTTCTGG 


ACGAGTTGCA 


CCCATCATGG 


TCCGCCAAGT 


TTCGATTACT 


6360 


TTGGGACCAG 


AAATGACACC 


CACAAGAACT 


GGACCTGAAG 


TCATGAATTC 


ACGAATCGGT 


6420 


GGGTAAAAAC 


TCTGACCAAC 


CAAGTCCTGA 


TAGTGCTGGT 


CAATCAACTC 


TTCTGAAACC 


6480 


TGTGAACGAA 


ACTCCAATTT 


TTCGATTGTA 


AATCCACGTT 


GTTCGATGCG 


CTTTAACACT 


6540 


TCACCCACTA 


GCCCTCTTTT 


TACACCATCT 


GGTTTGATGA 


TAAAGAATGT 


TTGTTCCATA 


6600 


CCCGTCTCCT 


TTGTCAGCTT 


CTTTCTTTTA 


TTTTACCACA 


TTTCGTGGAA 


AAATGGAGAA 


6660 


AGTTTTCAGA 


AGAGAGAATG 


AGAGAACCCT 


CGGGTTCTCT 


CATTCTCTCT 


TATTCTACTG 


6720 


TTTCTTCCAC 


AGTTTCAACG 


GCAGTATCCA 


CAACTACTTC 


TGTTGTTTCT 


TCATTTCCTT 


6780 


CTTCCTCTAC 


TGGAGGATTA 


AGGTATTCTT 


CTTCGTTGAC 


AGCATGTGGT 


TCAAGGTTAC 


6840 
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GGTAACGGGC CATACCAGTA CCAGCTGGGA TGATCTTACC GATGATAACA TTTTCTTTAA 6900 

GTCCAAGGAG ATGGTCTTTC TTACCACGGA TAGCTGCGTC AGTAAGGACA CGAGTTGTTT 6960 

CCTGGAAGGA AGCCGCTGAC AAGAAACTGT TTGTTTCAAG TGAGGCTTTG GTAATTCCCA 7020 

TAAGGACTGG GCGACCTGTC GCTGGAACTC CACCTGCGAT AAGGACATCT TTGTTGGCAT 7080 

CTGTAAAGTC ATTGATATCC ATGAGGGTAC CCATGAGAAG ATCTGTATCA CCTGGATCCA 7140 

TGACACGGAC TTTACGGATC ATTTGACGAA CCATTACCTC GATGTGTTTG TCACCGATTT 7200 

CTACCCCTTG GCTACGGTAA ACTTTTTGTA CTTCACCGAG AAGGTACGTT TCAACTGACA 7260 

AGACATCACG AACTGCAAGG AGACGTTTTG GTTGGATAGA ACCTTCTGTC AGAGCAGCAC 7320 

CACGCGCTAC TTGGCCCCCA ACTTCGACAC GCATACGAGC TGTAAATGGA ACGACATATT 7380 

CACCTTCGCC AGTTTCACCC TTAACAAAGA CTTTCTTGGT ACGAGTTGAT GCATCTTCTT 7440 

CGATAGCAGT AACTTGTCCT TTAACCTCTG TAATAACCGC TTCCCCTTTA GGATTGCGGG 7500 

CTTCAAAGAT TTCTTGGACA CGAGGAAGAC CCTGAGTGAT ATCGGTATTT GAGGCAACCC 7560 

CACCTGTGTG GAAGGTACGC ATTGTAAGCT GTGTACCAGG TTCCCCGATA GATTGGGCAG 7620 

CGATTGTACC AACTGCTTCA CCAACTTCAA CCGCATCACC AGTCGCCAAG TTGATACCGT 7 680 

AACAGTGACG GCAGACACCG TGACGAGTGT TACATGTAAA TACAGAACGG ATAGTCACTT 7740 

CTTCCACACC AGCATTGACA ATTTCACGCG CCTTGTCTTC TGTAATCAAT TCATTTGGAC 7800 

CAATAATCAC TGCACCAGTT TCTGGATGTT TAACAGTTTT CTTAGTGTAA CGACCGTTGA 7860 

GACGCTCTTC GAGAGACTCG ATCATCTCTT TTCCTTCTGC GATAGAACGG ATCAAGAGAC 7920 

CACGGTCAGT TCCACAGTCG TCCTCACGGA TGATAACGTC TTGGGCAACG TCGACCAAAC 7980 

GACGAGTCAA GTAACCTGAG TCGGCTGTCT TAAGGGCCGT ATCGGTCATA CCTTTACGAG 8040 

CACCGTGAGT TGAGAAGAAC ATTTCCAATA CCGACAAACC TTCGCGGAAG TTTGAAAGGA 8100 

TTGGCAATTC CATGATACGT CCATTCGGAG CAGCCATCAG ACCACGCATA CCGGCAAGCT 8160 

GTGAGAAGTT TGAGATGTTA CCACGGGCTC CAGAGTCCAT CATCATAACG ATTGGGTTCT 8220 

TAGGATCTTG GTTAGCAATC AAGCGTTTCT CAAGTTTTTC ACGGGCAGCA CGCCATTCAG 8280 

CTGTAACAGC ATTGTAACGC TCGTCGTCTG TGATCATACC ACGACGGAAT TGTTTGGTGA 8340 

TTTGTTCGAC ACGTTTGTGT GATTCTTCAA TGATTTCAGC CTTGTCATCA ACGACTGGGA 8400 

TATCGGCAAT ACCCACTGTC AATCCTGCAA GAGTTGAGTG GTGGTAACCG AGGTTCTTCA 8460 

TGCGGTCAAG TAGGGCAGAA GTTTCTGTCG TACGGAAACG TTTGAAGATT TCAGCGATGA 8520 

TATTTCCAAG GTTTTTCTTC TTGAATGGAG GGTTGAGCTC AAGATTGCTG ATAGCTTCCT 8580 

TGATATCTCC ACCAAGTGGC AAGAAGTATT TAGCTGGAAC ACCTTCTGTC AAGTTGGCAT 8640 
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TGTTTGGTTC 


TTGCAAGTAT 


GGTAGCCCCT 


CTGGCATGAT 


ATCGTTGAAG 


AGAATTTTAC 


8700 


CAACTGTTGT 


AAGCAAGACC 


TTATGTCTTT 


GCTCTTCTGT 


CCAAGGCTTG 


TTGAGGCTGT 


8760 


CTGTTGCGAT 


ACCAACACGT 


GAGTGGAGGT 


GAACATAACC 


ATTGCGGTAA 


GCCATAACCG 


8820 


CTTCGTCACG 


GTCTTTGAAG 


ACCATTCCTT 


CACCTTCGCG 


ACCAGCTTCT 


TCCATGGTCA 


8880 


AGTAGTAGTT 


ACCCAAAACC 


ATGTCCTGAG 


ATGGAGTAAC 


TACCGGTTTC 


CCATCTTTCG 


8940 


• GGTTCAAGAT GTGCTCAGCA GCTAGCATGA GGATACGAGC 


TTCTGCTTGT 


GCTTCTTCTG 


9000 


AAAGTGGTAC 


GTGGATGGCC 


ATTTGGTCCC 


CGTCAAAGTC 


AGCATTGTAG 


GCTTCACAGA 


9060 


CAAGTGGGTG 


CAAGCGAAGA 


GCCTTACCAT 


CAATCAAGAC 


TGGCTCGAAG 


GCTTGGATAC 


9120 


CCAAACGGTG 


AAGGGTCGGT 


GCGCGGTTCA 


AAAGCACTGG 


GTGTTCTTTA 


ATCACTTCTT 


9180 


CAAGGATATC 


CCAGATACGC 


TCATCTCCGC 


GTTCCACCAA 


GCGTTTAGCT 


GCTTTGACGT 


9240 


TTTGCACGAT 


ATCACGGGCA 


ACGATTTCAC 


GCATGACAAA 


TGGTTTAAAG 


AGTTCAATCG 


9300 


CCATTTCACG 


CGGCACACCA 


CATTGGTACA 


TCTTAAGAGT 


TGGACCAACG 


GCGATAACTG 


9360 


AACGTCCTGA 


GAAGTCAACA 


CGTTTACCGA 


GCAAGTTTTG 


ACGGAAGCGT 


CCTTGTTTAC 


9420 


CTTTAAGCAT 


GTGGCTCAAT 


GATTTCAATG 


GACGGCTACC 


TGGTCCTGTG 


ATTGGACGAC 


9480 


CACGACGACC 


ATTGTCAATC 


AAAGCGTCAA 


CTGCTTCTTG 


AAGCATACGC 


TTCTCATTTT 


9540 


GAACGATGAT 


ACCTGGTGCA TTTAACTCAA GCAAACGAGC 


CAAACGGTTG 


TTACGGTTGA 


9600 


TAACACGGCG 


GTAAAGGTCA TTCAAGTCAG ATGAGGCAAA 


ACGGCCACCA 


TCCAACTGCA 


9660 


ACATTGGACG 


AAGATCTGGT 


GGGATAACCG 


GAAGGATGTT 


AAGAATCATC 


CATTCAGGTT 


9720 


TGTTTCCAGA 


CTTGTAAAAG 


GCATCCAAAA 


CATCCAAACG 


ACGGATGGCT 


TTGACACGCT 


9780 


TTTGTCCAGT 


AGCTGTTTTC 


AATTCTTCTT 


TGAGTTCAGC 


AATTTCTTTT 


TCAAGATCTA 


9840 


CTTGCTTCAA 


AAGGTCTTGG 


ATGGCTTCCG 


CACCCATCTT 


GGCAACAAAT 


GAACCATAAC 


9900 


CATATTCACG 


CAAGCGCTCT 


CGGTATTCGC 


GCTCTGTCAT 


GATAGACTTG 


TGCTCAAGTG 


9960 


GTGTATCCTT 


AGGATCAATC 


ACCACATAAG 


CCGCAAAGTA 


GATAACTTCC 


TCGAGGGCAC 


10020 


GAGGGCTCAT 


ATCAAGGGTC 


AAGCCCATAC 


GGCTTGGAAT 


CCCCTTGAAG 


TACCAGATGT 


10080 


GAGATACAGG 


AGCTTTCAAT 


TCGATATGTC 


CCATACGCTC 


ACGACGAACT 


TTCGTACGCG 


10140 


TTACTTCAAC 


CCCACAGCGG 


TCACAAACAA 


TTCCTCTGTA 


ACGAATGCGT 


TTGTACTTAC 


10200 


CACAAGCACA 


TTCCCAGTCT 


TTTGTAGGAC 


CAAAGATCAC 


TTCATCAAAG 


AGTCCTTCAC 


10260 


GTTCTGGTTT 


CAAGGTACGA 


TAATTGATTG 


TTTCAGGTTT 


TTTGACTTCT 


CCATAAGACC 


10320 


ATGAACGGAC 


TTTACTTGGA 


GAAGCTAGGG 


TGATTTGCAT 


ACTTTTAAAA 


CGATTTACAT 


10380 
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TTCTTCCCTT 


TCTATTCTAA 


GTGAACTGCT 


TATTCTTGTT 


CAGCAGCTTC 


10440 


TTCTGTTGCT 


TCCGCTTTTG 


TTGCTTTCTC 


AGCTTCTTCA 


GCTTCAAAGG 


CTGCTTTAGC 


10500 


CTCTTGGGCT 


GCTTTTTCGC 


GGGCTTTTTC 


AAGGTCATCT 


ACGTGGATGA 


CATCTTCGTC 


10560 


CATTCCTTCA 


TCCAAGTCGC GAAGTTCCAC 


TTCTTGGTCA 


TCTTCGTCTA 


GGACACGCAT 


10620 


GTCAAGACCA 


AGAGATTGCA 


ATTCTTTGAC 


AAGAACTCGG 


AAGGATTCTG 


GAACACCTGG 


10680 


TTTTGGAATT 


GfiTTTVICPTT 

W I 1 I\jV_V_ 1 1 


TTGTAATAGC 


TTCATAGGCT 


TTCAAACGTC 


CGTTGATATC 


10740 


GTCCGACTTG 


TAAfilVAAftA 


TTTCTTGAAG 


GACATTTGAC 


GCACCGTAGG 


CTTCAAGAGC 


10800 


CCAAACCTCC 


ATPTP APfYl A 


AACGTTGTCC 


ACCAAACTGA 


GCCTTACCTC 


CGAGTGGTTG 


10860 


TTGGGTAACA 


ul lxinulAJu 


GTCCGACTGA ACGCGCGTGC 


AATTTATCAT 


CAACCATGTG 


10920 


GTGGAGTTTG 


ATCATGTACA 


TGACTCCGAC 


AunAALALub 


TTATCAAACG 


GTTCACCAGT 


10980 


ACGTCCATCG 


TAAAGfiATfV; 


TTTTGGCATC 


uL 1A1LLA 1 A 


CCTGCTTCTT 


TAACAGTTGA 


11040 


CCAAAGATCT 


TPAnAAPTTf: 


CTC CATC AAA 




GCGATGTGAA 


TACCAAGAGT 


11100 


ACGAGCTGCC 


ATACCAAGGT 


GAAGCTCCAT 




ATATTCATAC 


GTGATGGTAC 


11160 


CCCAAGTGGG 


TTCAACATGA 


TGTCGACTGG 


AGTTCCGTCT 


GGAAGGTAAG 


GCATGTCTTC 


11220 


TACAGGAACG 


ATACGAGAGA 


CAACCCCTTT 


GTTTCCGTGA 


CGTCCGGCCA 


TTTTATCTCC 


11280 


GACCTTAATC 


TTACGTTTTT 


GAGCGATGTA 


AACACGAACC 


AACATGTTAA 


CACCTGATTG 


11340 


CAACTCATCT 


CCATTTACAC 


GTGTAAAGAT 


CTTAACATCA 


CGAACGACAC 


CATCGGCACC 


11400 


GTGTGGTACA 


CGAAGAGAAG 


TATCACGCAC 


TTCACGAGAC 


TTGTCTCCAA 


AGATAGCGTG 


11460 


CAAGAGACGT 


TCTTCAGCTG 


AAAGATCTTT 


CTCACCCTTA 


GGTGTTACTT 


TACCTACAAG 


11520 


AATATCACCT 


TCTTTAACCT 


CAGCACCAAT 


ACGGATAATC 


CCCATTTCGT 


CAAGGTCTTT 


11580 


GAGGGCATCT 


TCACCAACGT 


TTGGAATTTC 


GCGAGTGATT 


TCTTCAGGCC 


CAAGCTTTGT 


11640 


ATCGCGCGTT 


TCTGATTCGT 


ATTCTTCAAG 


GTGAACAGAT 


GTGTAGACAT 


CGTCCTTCAC 


11700 


CAAGCGTTCG 


CTCATGATAA 


CGGCATCCTC 


GAAGTTGTAA 


CCTTCCCAAG 


TCATGTAGGC 


11760 


AACGATTGGG 


TTTTGTCCAA GCGCCATTTC 


TCCATTTTCC 


ATAGAAGGTC 


CGTCAGCGAT 


11820 


GAAATCGCCT 


TTTTCAACGA 


CATCACCAAC 


TTTTACGAGA 


GTGCGTTGGT 


TGTAAGCAGT 


11880 


ACCTGAGTTT 


GAACGACGGA ATTTTTGGAT 


GTGGTAAACA 


TCCAATGAAC 


CATCTTCACG 


11940 


ACGAACTTCT 


ACCTTGTCAG 


CATCTGCGTA 


AGTAACTTTA 


CCATCATACT 


GAGCAATCAC 


12000 


AGCCGCACCA 


GAATCGTGGG 


CTGCTTGGTA 


TTCCATACCA 


GTACCAACGT 


AAGGTGCCTG 


12060 


AGGATTAATC 


AATGGCACAG 


CCTGACGTTG 


CATATTGGCT 


CCCATGAGGG 


CACGGTTGGA 


12120 


GTCATCGTTT 


TCCAAGAAAG 


GAATACATGC 


TGTCGCAACG 


GCAACTACCT 


GTTTTGGTGA 


12180 
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AACGTCCATG 


TAGTCAACAA 


TATTAGCTGG 


ATACTCTTGG 


TTGACCCCTT 


GGTGACGTCC 


12240 


CATGACAATC 


TTCTCAGCAA 


AGGTTCCATC 


TTCATTCAGA 


CGAGAGTTAG 


CCTGAGCTAC 


12300 


AGTATATTCA 


TCTTCTTCAT 


CAGCTGTCAA 


CCAAACAATT 


TCGTTCGTGA 


CAACACCTGT 


12360 


TTCACGGTCA 


ACCTTACGGT 


ATGGTGTTTG 


AACAAAACCA 


TATTTGTTCA 


AGTGTCCATA 


12420 


AGATGACAAG 


TTATTGATCA 


AACCGATGTT 


AGGTCCTTCA 


GGTGTCTCGA 


TTGGACACAT 


12480 


ACGACCATAG 


TGAGTGTAGT 


GCACGTCACG 


TACTTCATAT 


CCAGCACGGT 


CACGAGTCAA 


12540 


ACCACCAGGT 


CCTAAGGCTG 


ACAAACGGCG 


TTTGTGAGAC 


AACTCAGAAA 


GCGGGTTGTG 


12600 


TTGGTCCATG 


AACTGTGACA 


ACTGTGATGA 


ACCAAAGAAT 


TCTTTAACTG 


CAGCTGTTAC 


12660 


AGGACGGATA 


TTGATAATTT 


GTTGTGGTGT 


CAAGACTTCA 


TTGTCCTGAA 


CAGACATACG 


12720 


TTCACGGACA 


TTACGTTCCA 


TACGAGAAAG 


TCCCAAACGT 


ACTTGGTTGG 


CAAGCAATTC 


12780 


ACCAACCGCA 


CGGATACGAC 


GATTTCCAAG 


GTGGTCGATA 


TCATCTACAC 


GGCCAAGTCC 


12840 


TTCAGCCAAG 


TTGAGGAAGT 


AGCTCATCTC 


AGCAAGGATA 


TCTGCAGGAG 


TCACCGTACG 


12900 


AACCTTGTCA 


TCTGGGTTAG 


CATTACCAAT 


GATCGTTACG 


ACGCGATCTG 


GATCAGTTGG 


12960 


AGCAATAACC 


TTGAATTTTT 


GAAGAACAAC 


AGGCTCAGTC 


ACAACGGCTG 


CATCGTTTGG 


13020 


GATGTAGACA ATCTTGTTCA AGTCGCCATC 


CAAATGGCTT 


TCAATGCTTT 


CAATCACGCT 


13080 


ACGAGTCATA ATCGTACCAG 


CTTCTACCAA 


GATTTCTCCA 


GTTTCAGGGT 


CTACCAATGG 


13140 


CTCTGCAATG 


GTTTGGTTGA 


GCAAACGTGT 


TTTAACATTG 


AGTTTTTTAT 


TGATTTTGTA 


13200 


ACGACCAACT 


GCTGCCAAGT 


CATAACGACG 


TGGGTCAAAG 


AAGCGAGCTA 


CAAGCAAGCT 


13260 


ACGTGAGCTT 


TCAGCCGTCT 


TAGGCTCACC 


TGGACGAAGG 


CGTTCGTAAA 


TTTCTTTCAA 


13320 


GGCTTCGTCT 


GTACGAGAGT 


CCATTGGATT 


CTTGTGGATA 


TCTTTTTCAA 


CAGTGTTGCG 


13380 


AACCAATTCG 


CTGTCACCAA 


AGATATCAAA 


GATTTCATCA 


TCACCTGAGA 


AACCAAGAGC 


13440 


ACGAACCAAG 


GTTGTAAATG 


GAATCTTACG 


AGTACGGTCG 


ATACGAGTGT 


AGGTGATATC 


13500 


TTTTGAGTCG 


CTTTCAAGTT 


CCAACCAAGC 


TCCACGGTTA 


GGGATAACAG 


TTGAACCATA 


13560 


GCCCACCTTA 


CCATTTTTGT 


CTACTTTGTC 


GTTAAAGTAA 


ACACCTGGTG 


AGCGGACCAA 


13620 


CTGAGAAACG 


ATAATACGTT 


CACCACCATT 


GATGATGAAA 


GTACCCATTT 


CTGTCATGAT 


13680 


TGGGAAATCA CCAAAGAAAA 


CTTCTTGGGT 


CTTGATTTCG 


CTTGTTTCTT 


TATTGATCAA 


13740 


ACGGAAGGTT 


ACAAAAATTG 


GTGCTGAGTA 


GCTAGCATCG 


TGGATACGAG 


CTTCTTCTAG 


13800 


CGTATATTTT 


GGTTCCTTGA 


TTTCATATCC 


AACAAATTCC 


AACTCCATTG 


TGTCTGTGAA 


13860 


GTTTGAAATT 


GGCAATACAT 


CTTCAAACAC 


TTCCTTAAGA 


CCGTGGTCTA 


GGAAAGCTTT 


13920 
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GAATGAGTCA GTTTGAATTT CAATCAAATT TGGTAAGTCA AGAACTTCTT TGATTCTTGA 13980 

AAAACTACGA CGGGTACGAT GTTTCCCGTA TTGAACGTCA TGTCCTGCCA AGATGATTCT 14040 

CCTTTGTAAA TAAGTTCCAA GCCTTGTCAA TCAGGCTTTT CTAATCGTCA TATGGTTGTA 14100 

AACCCCTTAT CACCGTGTCC TCTTGACGAA TTTTCAGAAT CTTTAAGCCT CTGTTACAAA 14160 

TGCTCAAAAT CTTGAAAAAA AGCACAAAAA GAGCAGCTAA ATCTGACTTT TTCAGAAGAT 14220 

TTAACTGCTG TGAGCCTTGT CTGGACAATA TTTCAGACAA AACCTACGAC AAATGATTAC 14280 

CCATATTATA CCCTATTTAG CTAGATTTTT CAAGGGGTTT CAGTAGGTTT TTGGTAAATT 14340 

TTTTCCCATA GAAAACTTGG CATCACATTC GAATCACGCT ATGGTACAAA AAACTGAAAA 14400 

AACTATTGAC TGAAAATCAT TTTCAAGGTA TAATAATAAA CGTTAAGGCG GTATAGCCAA 14460 

GTGGTAAGGC ACGGCTCTGC AAAAGCTTGA TCGTCGGTTC AAATCCGTCT ACCGCCTTCT 14520 

ATAACTTGAT TTATCAGGTT TCAAATGAAC AGAAAGCCCA ATTTGAAGGG CTTTTTTTAT 14580 

TTTCCCTCGA ATAAATACGT ATAACTTTAA AAACTTTTGG AGCGAGTTTG TGGCAGAGTT 14640 

CTTTCCATGG CATAATTCCC TTTTGAAATC AG 14672 
(2) INFORMATION FOR SEQ ID NO: 112: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7902 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 112: 

AGGAGACTAT TCAAGCCCAA ATTgAGTAGC CCAGCAAAGA CTGTATAGAC TGTGATACGT 60 

TTTTCATAGC CATTGGTAAA GAGAATTTGG GAACCAAGAA TGGTATCTAA GGCCAGGATA 120 

ATCGTACGAA AAGCGAAGAG AGAGGTCAAG ATGCCGCCTC CGATATATTT TTCACTACCG 180 

TAAAGTAGGA TGGCATTTGG TCCTAAAACC ATGAGTCCAA AACTCAGTGG AATGATAAAG 240 

AAGTTAAAGA TTCGACTACC TCTATTAACC AGAGAAACAT AGGCTTCTTT GTCTCCTTTC 300 

CCCAGATAGT AACTGAGACG AGGCACACTC ACTCCAATTG CACCTGTTAC AACCCCAGCT 360 

ATAACGGTCA CAATTCGCTG AGCTATGGTA TAGTAACTAA CGTTGACATC AATCCCTGTT 420 

TTAACGAGGA AGAGGCGATC TAAAAAAGTG AAGAGCATAT TGGCATTGGC AAAGACTAAC 480 

ATGGCTGTCA GAGGGAGAAA GAGTGGTTTA AAATCACTTA GGTGAATTTT AACAAGTTTG 540 

ATGTCTCTTT TAATCCAAAA ATAACTAATC AGGTAGTTAA TCAGCGTCGA TAAACTCATC 600 

ACAAGTGTAT AGACAACAAT ATCGTGTTCA TTTTTAACAA ATAAGAAAAT AGAGACCAGC 660 
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ATCAGGATAC GGATGAAGGC AGTTTTGTAA AAGAGAAAAC TGTAATTTTC CAGAGCTTCA 720 

TTGACCCATT CGATTGAAAA AATCTGGGCA ATGAGTTGAA TCCCCATAAC AAGGTAGACC 780 

TTTTTGACGA TTGGATTATC AGTAAAGAAG AGAGGATAGG CTAGGATATA GACAGCAGTG 840 

GTCAAAATCG TACAAGCGAT GCACAAATAA AAAAGACTAG AAAAGGTTCT GTTAAGATCT 900 

TTTTTGTTAT CCTTGACATT ACTGATAGCC CTTAAACCGT AGTTATAGAC ACCATAAGTT 960 

GCAAAGGGCA AGAAAAATGA CAAAATAGTG TCGACTGAGT TGAAGTAACC ATAGTCAGTT 1020 

CGGTCGAAGA CACGCGCGAC ATAGGTTCCA GTTAGGATGG GAAAAATAAT ATTCAAGACA 1080 

CGAATTCCCA TGTAAGATAG AGCATTTAAT TTTATACTTT TCATTCAATT TACCTCGTTT 1140 

TTCATTATAT CATAAAGTTA GCTAATAAGA AATGAAGGGC AGTAAGTCAA GTAATCACTT 1200 

TGAAGTTTCA AATCTTAAGT TTTAAGTTTT CTTTAAGGAA AGTATATTAT TCTGAAGGAC 1260 

TCTAAAATTT CGCAGCCATT TATTAGTAAT TGCTACAGAA TTCCTAGTCA TTACTAGAAA 1320 

TGGACTAGTT TCTTTGAATA ATAGAACTGC ATAATTCTCC TATTCTAGAA GGGGAGGACC 1380 

AGTATTTCTT TTATGATAGG ACTAGATTGT GGTATAATAG AGAGAATAAG TTTTTTTAGT 1440 

AAGACAAAGG AGAAAATAGA TGATTTATGC AGGAATTCTT GCCGGTGGAA CTGGCACACG 1500 

CATGGGGATC AGTAACTTGC CAAAACAATT TTTAGAGCTA GGTGATCGAC CTATTTTGAT 1560 

TCATACAATT GAAAAATTTG TCTTGGAGCC AAGTATTGAA AAAATTGTAG TTGGTGTTCA 1620 

TGGAGACTGG GTTTCTCATG CAGAAGATCT TGTAGATAAA TATCTTCCTC TTTATAAGGA 1680 

ACGTATCATC ATTACAAAGG GTGGTGCTGA CCGCAATACA AGTATTAAGA ACATCATTGA 1740 

AGCCATTGAT GCTTATCGTC CGCTTACTCC AGAGGATATC GTTGTTACCC ACGATTCTGT 1800 

TCGTCCATTT ATTACACTTC GCATGATTCA GGACAATATC CAACTTGCCC AAAATCATGA 1860 

CGCAGTGGAC ACAGTGGTAG AAGCGGTTGA TACTATCGTT GAAAGTACCA ATGGTCAATT 1920 

TATTACAGAT ATTCCAAATC GTGCTCACCT TTATCAAGGA CAAACACCTC AAACATTCCG 1980 

TTGCAAGGAC TTCATGGACC TTTATGGATC TCTTTCTGAT GAAGAGAAGG AAATCTTGAC 2040 

AGATGCATGT AAAATCTTTG TGATCAAAGG AAAAGATGTG GCTTTGGCCA AAGGTGAATA 2100 

CTCAAATCTG AAGATTACAA CCGTAACAGA TTTGAAGATT GCAAAAAGTA TGATTGAGAA 2160 

AGACTAGTAA AATGATTAAT CAAATTTATC AACTAACTAA GCCTAAGTTT ATCAATGTCA 2220 

AATATCAGGA AGAGGCTATT GACCAAGAGA ATCATATCCT TATCCGTCCC AACTACATGG 2280 

CTGTCTGTCA TGCGGATCAG CGTTACTATC AGGGAAAACG TGATCCCAAG ATTTTGAATA 2340 

AAAAGCTTCC AATGGCAATG ATTCACGAGT CATGTGGAAC CGTCATTTCT GACCCGACCG 2400 
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OAACCTACGA 


GGTTGGTCAA 


AAAGTTGTCA 


TGATTCCCAA 


TCAGTCTCCT 


ATGCAGAGTG 


2460 


7VCTV* JV » * mm 

ATGAAGAATT 


CTATGAAAAC 


TACATGACAG 


GGACCCATTT 


CTTGTCTAGT 


GGATTTGATG 


2520 


GCTTTATGAG 


AGAGTTTGTT 


TCTCTCCCTA AAGATCGTGT 


GGTGGCTTAT 


GATGCTATTG 


2580 


AAGATACGGT 


TGCAGCCATT 


APAGAGTTTfl 


1 Lnli 1 Vj 1 vjvjvj 


CATGCACGCT 


ATGAATCGTC 


2640 


TATTGACTCT 


TGCTCATAGC 


AAGPfiflttArtT 
ftnuv. uuunu v. 




TATTGGAGAT 


GGAAGTTTAG 


2700 


CTTTTGTGGT 


TGCCAATATT 




rxrtrrujif*r*f+ a a 
LI 1 1 (jlA. Av>A 


AGCAGAGATT 


GTGGTTATTG 


2760 


GTCGTCATTG 


GGAAAAGTTG 


GAACTCTTCT 


CATTTGCCAA 


AGAATGCTAT 


ATTACGGATA 


2820 


ATATTCCTGA 


AGATTTGGCC 


TTTGACCATG 


CTTTTGAATG 


TTGTGGTGGT 


GATGGTACTG 


2880 


GACCAGCTAT 


TAATGACTTG 


ATTCGCTACA 


TTCGTCCTCA 


GGGAACGATT 


CTCATGATGG 


2940 


GAGTTAGCGA 


ATATAAAGTC 


AAlUit-AArA 


CTCGCGATGC 


CTTAGAAAAG 


GGCTTGATTT 


3000 


TGGTTGGGTC 


ATCTCGTTCT 


\J\3 I CuvJA 1 l\j 


ATTTTGAAAA 


TGCTATCCAA 


ATGATGGAAG 


3060 


TCAAGAAATT 


TGCCAATCGT 


(-11 AAAAAT A 


TCCTTTATCT 


AGAAGAACCT 


GTAAGAGAAA 


3120 


TTAAAGATAT 


TCATCGTGTC 


1 i i vjv_~rV/i.l_.L.o 


A 1 I I AAAC AC 


AGCCTTTAAA 


ACAGTGTTTA 


3180 


AGTGGGAAGT 


ATAAGTACTG 




(aTGGAGAAAA 


TCATTAAAGA 


AAAAATTTCT 


3240 


TCCTTACTTA 


GTCAAGAAGA 




AGT. gttgaac 


AACTGGGTGG 


AATGACCAAT 


3300 


CAAAACTATT 


TGGCCAAAAC 


AAPA&ATAAG 




TTAAATTCTT 


TGGTAAAGGG 


3360 


ACAGAAAAGC 


TTATCAATCG 


A<**AAGATG A A 




TTGAACTACT 


AAAGGATTTA 


3420 


GGCTTAGATG 


TAAAAAATTA 


TCTTTTTGAT 

A W A. £ A. A x un A 


ATTGAAGfTG 


GTATCAAAGT 


AAATGAGTAT 


3480 


ATCGAATCTG 


CGATTACGCT 


TGATTCAACG 


TCAATCAAGA 


CCAAGTTCGA 


CAAAATTACT 


3540 


CCAATATTAC 


AAACTATTCA 


TACGTCTGCT 


AAGGAATTAA 


GAGGAGAATT 


TGCTCCTTTT 


3600 


GAAGAAATCA 


AAAAATACGA 


ATCCTTGATT 


GAAGAACAAA 


TTCCTTATGC 


CAACTATGAA 


3660 


TCTGTTAGAA 


ATGCAGTCTT 


CTCCTTAGAG 


AAAAGACTGG 


CTGACTTAGG 


TGTTGACAGA 


3720 


AAATCTTGTC 


ATATCGATTT 


GGTGCCTGAA 


AACTTTATCG 


AATCACCTCA 


AGGACGACTT 


3780 


TATTTGATTG 


ACTGGGAATA 


TTCATCAATG 


AATGATCCAA 


TGTGGGATTT 


GGCTGCCCTC 


3840 


TTTTTAGAGT 


CTGAATTCAC 


TTCCCAAGAG 


GAAGAAACTT 


TCTTATCTCA 


CTATGAGAGT 


3900 


GACCAAACAC 


CGGTTTCTCA 


TGAAAAGATT 


GCTATTTATA 


AAATTTTACA 


AGATACTATT 


3960 


TGGAGTCTAT 


GGACTGTCTA 


TAAGGAAGAG 


CAAGGTGAAG 


ATTTTGGTGA 


CTATGGTGTG 


4020 


AATCGTTACC AAAGAGCTAT 


TAAAGGTTTG 


GCTTCTTATG 


GAGGTTCAGA 


TGAAAAGTAA 


4080 


AAACGGAGTT 


CCTTTTGGCC 


TTCTCTCAGG 


TATTTTCTGG 


GGCTTGGGTC 


TAACGGTTAG 


4140 


TGCTTATATC 


TTTTCGATTT 


TTACAGATTT 


GTCACCCTTT 


GTGGTGGCTG 


CAACTCATGA 


4200 
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TTTTTTGAGC 
CTCAATTTTC 
TATCGGTATG 
TGTATCGGCT 
GATTTCGAAA 
' CTATAAGGTT 
TATTGCATGG 
AATCGAAGCC 
CTTCTCTCAT 
TTTTGCAGCC 
ACCAGCCAAG 
TGTTTTCTTG 
TGGAGTTTAT 
GCGGGATTGG 
GTTAATCAAA 
GACATCATCA 
GGTGTTCGTC 
CTTGTAAAAG 
AATATGTTCC 
ACCAACGAAT 
AGCAAGGCAG 
ATTGTCAGCT 
AATATGGTTA 
AGCATTTATG 
GAAAATTAAA 
TTTTACGAAT 
ATGAAATCTA 
AATACGGTGA 
GCCGTAATCC 



ATCTTTATCT 
TTAAATATTC 
CAGGCCAATC 
ATTTACCCTG 
AATACTGTAT 
GAACAGGTTA 
GGAAGTGAGA 
CTCTTAATCC 
CAGTCATTTA 
TTTGATATGA 
GCTACAGGCT 
GGTGCACCGC 
ATTATTATTA 
GAACTCGCTT 
AACCTTTGAT 
TCATTGTTGG 
TCGTTTTCAA 
AAGAATTGGC 
GCAATGATTT 
GGTTCTTGGT 
GTCGCATCCT 
TTATCGACAA 
AGGATAATAT 
AGATCGATAG 
GATTCCAACA 
AGATAGATGA 
TAAGTTAAAA 
AAATGTTGAT 
AGCCGTTTTT 



TACTAGCTTT 
GCAATGTCAG 
TTTATGCAGT 
CGATTTCAGT 
TTGGGATTGT 
ATTCTTTCTA 
GTGTTCTTAG 
GTCAAGTAAC 
CTGCAGTAGC 
TTTCCTACTT 
TGAACGTGAG 
TAGATATGCT 
AAGAATAAAG 
GCGTCCTATG 
TGAGTACCAA 
TTATCTTAAA 
TGATAAATAC 
CAACAGCTAT 
GACACGTTCG 
TTATGGAGAT 
TAGTGGTGTA 
GGCTTATGTA 
CAAAGAGCTA 
TGTCCAAGAC 
TCTGACAAAA 
GTAGAAAAAG 
AAATCAGGTT 
CAGGAGCTTT 
ATGGAACGTT 



817 
TCTCTTGGTA 
TGTTATCATC 
TAAGTATATC 
TCTATTGGCT 
CTTGATTATT 
CATTGGGATT 
CTCTTTTGCC 
TTCGTTCTTG 
CAATGGACAA 
GGCTTATTAT 
CTATGTAGTA 
GACCATTATG 
GAGATTCGTG 
ACTGAAAATA 
ATTGAGTTTC 
GAACAATTCG 
GCTGACTACA 
GTTATTGATG 
ACTTATTTTA 
GACTACAAGG 
TCCTTCTGGG 
AGTGGTGAAT 
GATGTCTATG 
TATCGTAAAT 
TAGTCGGATG 
AAATGGAGTT 
TGACCAATCA 
TGTTGGGTGA 
ATTTTCAGAT 



AAAGAAGGGA 
GGAGCCTTGC 
GGAAGTTCTT 
TTCTTCTTTT 
GGAGGGATTA 
CTTTGTGCTT 
ATGGAAAGTG 
TCCTATCTTG 
TTGCTAGGTC 
ATCGCTATCA 
TGGACGGTCT 
ACGTCACTTG 
TGAAAGCCAT 
CCCCTAAAGC 
TCAAAGAAAA 
ATTACTTGAA 
ATAACTTTTA 
CTGACAATTA 
GTGTTTATCG 
TTCAAGACAT 
ATGCTCCAAC 
TTGTTGATCT 
TTGAAGAATT 
TAGAAGAAAT 
TTTTTTGATT 
ATTTATGAAA 
ACAGATTTTG 
TATTGCAGAT 
AGACGATGCG 



AAGTTCGCCT 
TAGCAGGCCC 
TAGCTTCATC 
TGAAGCACAA 
TTGCTCAGAC 
TGGTTTGTGC 
AATTGAGTGA 
TGATTGTGCT 
TCATGATTGT 
ATCGCTTGCA 
TGTTTGCAGT 
TCGTCATTGC 
TATCTTAGCA 
CTTGGTTCAG 
AGGAATCAAT 
AGAGAAATAC 
CTCTCTCTAT 
TCTCTTTAAA 
TGAAGATTGT 
TATTGTTGAT 
TGCAGAAAAG 
CTATTGGGAC 
AGAAGGCAAT 
TCTTAAAAAC 
TTTTACGAAC 
ATCACAAACT 
AAAGTGCTAG 
ATCTCAGGTT 
CATTTGTCGA 



4260 

4320 

4380 

4440 

4500 

4560 

4620 

4680 

4740 

4800 

4860 

4920 

4980 

5040 

5100 

5160 

5220 

5280 

5340 

5400 

5460 

5520 

5580 

5640 

5700 

5760 

5820 

5880 

5940 
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AAC3 ACTTTT a 


a a a RfiMTirv^iv 
AAAAlllLCA 


TCTTTCTCTA TTTTAGATGA 


CTGTTATCCT 


TGGGATTTGA 


6000 


GTGAAATATft 
vj i w/vu\ inln 


IXjAI Gv-GCl- 1 


GTACTTTTAT 


TTTACAAGGG 


AAATCTTGAC 


CTCCTGAAAT 


6060 


TPPPf^a artr*T 


AGLGGTCGTG 


GGCAGTCGTG 


CTTGTAGCAA 


ACAGGGAGCT 


AAGTCAGTTG 


6120 


a a a & & fiv^ a t> 

/UinAAo 1 l_ A 1 


TCAAGGCTTG 


GAAAATGAAC 


TGGTTATTGT 


CAGTGGTCTG 


GCCAAGGGCA 


6180 


1 1 vjAt_Av~AGL. 


AGCTCATATG 


GCAGCTCTTC 


AGAATGGCGG 


AAAAACCATT 


GCAGTGATTG 


6240 


GAACAGGACT 


GGATGTGTTT 


TATCCTAAAG 


CCAATAAACG 


CTTGCAAGAC 


TACATCGGCA 


6300 




GGTTCTAAGT 


GAATATGGAC 


CTGGTGAACA 


ACCTCTGAAA 


TTTCATTTTC 


6360 


1 vjL.v_CG I AA 


TCGCATCATT 


GCTGGACTTT 


GTCGTGGTGT 


GATTGTAGCA 


GAGGCTAAGA 


6420 


TGCGTTCAGG 


TAGTCTCATT 


ACGTGTGAGC 


GAGCAATGGA 


AGAAGGACGC 


GATGTCTTTG 


6480 


CTATTCCTGG 


TAGCATTTTA 


GATGGACTAT 


CAGACGGTTG 


CCATCATTTG 


ATTCAAGAAG 


6540 


tjAGCAAAATT 


GGTCACCAGT 


GGGCAAGATG 


TTCTTGCGGA 


ATTTGAATTT 


TAAAAATGAC 


6600 


C f AAGCTAGA 


ATTCTAAGAA 


AAAATCAATT 


TTAAGAGAAA ATGAACCCAA CATTTCCATA 


6660 


A I AAAACGCA 


TATTAGCAAG 


TTTTTAACAC 


TTGATAATAT 


GCGTTTTTTC 


TAAGTGGATT 


6720 


A*j I AGAGT AG 


AGGATTTTTC 


TCATATAATA 


CTCTTCGAAA 


ATCTCTTCAA 


ACTACGTCAG 


6780 


CTTCCATCTG 


CAACCTCAAA 


ACAGTATTTT 


GAGCgaCTtC GTCAGTCTTA TCTACAACCT 


6840 


t-AAAGCAGTG 


CTTTGAGCAA 


CCTGTGGCTA 


GCTTCCTAGT 


TTGCGCTTTG 


ATTTTCATTG 


6900 


AG i AT AAGGG 


AAAGTATAGT 


GAATTGAAAT 


AAGATGTGAA 


CAACTCTATC 


AGGAAAGTCA 


6960 


IV IV i'i u 1 1 x x rtvnm « 

AA i^TAATTTA 


TAGAAATATT 


TTAGCAGCCA 


AGGTGTACTG 


TTATAGATTC 


AATTACACTA 


7020 


1 AA 1 1 1 AGTG 


TAATTGAGAA 


AGGAGAAATG 


ATTGTGATTG 


ATGTTGGCTA 


GGTTATGTTC 


7080 


AA I\»ATTCCT 


ACCGTCTCAA 


ATCTTGTCAG 


TAAGGAAAAA 


TAAATTCTTC 


AAAAGTAGAG 


7140 


x rnrn x /■* x 

A I IACAAGGC 


TTGTTTAAGA 


AAGAATTCAA 


AGACCTTGAC 


AAATAAAAAT 


AAAATGGTTA 


7200 


T*pa m a a a a a a 
1 XA1AAAAAA 


TGGTCTGAAA 


TAGATGATGA 


TACTTTTCGA 


AAATCTCTTC 


AAATACGTCA 


7260 


GCTCAGCTTT 


GCCTTGCTGT 


GTTTTGAGCA 


AGCTACGGTT 


AGCTTCCGAG 


TTTGATTT^C 


7320 


ATTTACTAGA 


AATGAAACTG 


ATGAGAGATA 


TCAGTAGACA 


TTTGAGTCAG 


GATATTATGG 


7380 


AAAATGATAA 


AAAGAGCTCG 


TGAGATTGGC 


ATATCAGACT 


ACTAAAGTAT 


TGAGTTTGTT 


7440 


AGGATTTTAG 


CGACTAGTTA 


GCTGGGAAAG 


GAAGATATTT 


GTGACAAATA 


ATAAACTGTA 


7500 


TTCGTTGATA 


GAATTTAGAA 


ATAAAATATA 


TGAAGAATTA 


GAACTTTCCA 


GAAGTGATTT 


7560 


AGCGATTTTA 


CTATGTGCCA 


TGCTTATCGC 


CTCTATCGGA 


TTAAATATGG 


ATTCGACTCC 


7620 


CGTGATTATT 


GGAGCCATGT 


TAATCTCTCC 


TTTGATGACA 


CCTATTCTGG 


GAGTGGGGCT 


7680 


CTCTCTAGCT 


ATATTTGATT 


TTAAATTGTT 


AAGAAAATCT 


TTTAAAATAT 


TAGCTATTCA 


7740 
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AATTCTTGCC AGTCTAATAG CTTCAACACT TTATTTTTAT CTTTCTCCCA TTTCGTATGC 7800 
TAGTTCGGAG ATTGTTGCTA GAACCTCTCC GACTATTTGG GATGTTCTCA TTGCTTTTGT 7860 
AGGAGGGATA GCAGGTATCA TTGGTGCTAG GAAAAAAGAG AC 7902 
(2) INFORMATION FOR SEQ ID NO: 113: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18627 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS : double 
<D) TOPOLOGY: linear 



(xi> SEQUENCE DESCRIPTION: SEQ ID NO: 113: 



GAAGTTGAAA 


TGGCCAGCTG 


ATGAGCAATA 


TCGGTCATAG 


AAATCTTCTC 


AATCAACTTT 


60 


TGCGCAATTT 


TTTGGTTGAT 


AATACGAGGA 


ATTTGGTGAT 


TTTTCTTGAC 


GATAGAAGTT 


120 


TCAGCGACCA 


TCATTTTTGA 


ACAGTGATAG 


CACTTGAAAC 


GACGCTTTCT 


AAGTAGAATT 


180 


CTAGTAGGCA 


TACCAGTTGT 


CTCAAGGTAA 


GGAATCTTAG 


ACGGTTTTTG 


AAAGTCATAT 


240 


TTCTTCAATT 


GGTTTCCGCA 


CTCAGGGCAA 


GATGGGGCGT 


CGTAGTCCAG 


TTTGGCGATG 


300 


ATTTCCTTGT 


GTGTATCTTT 


ATTGATGATG 


TCTAAAATCT 


GGATATTAGG 


GTCTTTAATG 


360 


TCTAGTAATT 


TTGTGATAAA 


ATGTAATTGT 


TCCATATGAA 


TCTTTCTAAT 


GAGTTGTTTG 


420 


GTCGCTTTTC 


ATTATAGGTC 


ATATGGGACT 


TTTTTTCTAC 


AATAAAATAG 


GCTCCATAAT 


480 


ATCTATAAGG 


GATTTACCCA 


CTACAAATAT 


TATAGAGCCA 


AAAATCCTTT 


GTTTACTAAA 


540 


CAAGGGATTT 


TTCTTTTGTC 


TCTGCTCCTT 


TTTTGATATA 


ATAGTTCTAT 


GTTAAAATCA 


600 


GAAAAACAAT 


CACGTTATCA 


AATGTTAAAT 


GAAGAATTGT 


CCTTCCTATT 


GGAAGGCGAA 


660 


ACCAATGTTT 


TGGCTAATCT 


TTCCAACGCC 


AGTGCTCTCA 


TAAAATCACG 


TTTTCCTAAT 


720 


ACCGTATTTG 


CAGGCTTTTA 


TTTGTTCGAT 


GGAAAGGAAT 


TGGTTTTAGG 


CCCCTTCCAA 


780 


GGAGGTGTTT 


CCTGCATCCG 


TATTGCACTA 


GGCAAGGGTG 


TTTGTGGTGA 


GGCAGCTCAC 


840 


TTTCAGGAAA 


CTGTTATTGT 


TGGAGATGTG 


ACGACCTATC 


TCAACTATAT 


TTCTTGTGAT 


900 


AGTCTAGCTA AAAGTGAAAT 


TGTGGTGCCG 


ATGATGAAGA 


ATGGTCAGTT 


ACTTGGAGTT 


960 


CTGGATCTGG 


ATTCTTCAGA 


GATTGAGGAT 


TACGATGCTA 


TGGATCGAGA 


TTATTTGGAA 


1020 


CAATTTGTCG 


CTATTTTGCT 


TGAAAAGACA 


GCATGGGACT 


TTACGATGTT 


TGAGGAAAAA 


1080 


TCTTAATGTA 


TCAAGCACTT 


TATCGAAAAT 


ATAGAAGTCA 


AAACTTCTCC 


CAGTTAGTTG 


1140 


GTCAAGAAGT 


TGTGGCTAAG 


ACTCTTAAAC 


AAGCGGTGGA 


GCAAGAGAAA 


ATAAGTCACG 


1200 
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CTTATCTTTT TTCTGGTCCT CGTGGAACGG GAAAAACCAG TGTTGCTAAA 


ATCTTTGCCA 


1260 


AGGCTATGAA CTGTCCCAAT CAAGTGGGTG GCGAACCTTG CAATAACTGC 


TATATTTGTC 


1320 


AAGCAGTGAC GGACGGTAGT TTAGAAGATG TCATTGAAAT GGATGCAGCT 


TCTAATAATG 


1380 


GGGTAGATGA AATTCGCGAA ATTCGTGATA AATCTACCTA TGCGCCTAGC 


CTTGCTCGTT 


1440 


ATAAGGTTTA TATCATAGAT GAGGTTCACA TGCTGTCTAC AGGGGCTTTT 


AATGCCCTCC 


1500 


TAAAGACGCT GGAAGAACCA ACACAGAATG TAGTCTTTAT TTTGGCCACT 


ACTGAATTGC 


1560 


ACAAGATTCC TGCTACTATT CTATCCCGTG TGCAACGTTT TGAGTTTAAA 


TCAATTAAGA 


1620 


CACAGGATAT TAAGGAACAT ATTCACTATA TCTTAGAAAA AGAAAATATC 


AGTTCTGAAC 


1680 


CAGAGGCTGT GGAAATCATT GCCAGACGGG CGGAAGGTGG AATGCGGGAC 


GCCTTGTCTA 


1740 


TTTTGGATCA AGCCCTGAGT TTGACACAGG GAAATGAGCT GACGACTGCT 


ATCTCTGAAG 


1800 


AAATTACTGG CACCATTAGC CTATCAGCCT TGGATGATTA TGTGGCGGCC 


TTGTCTCAAC 


1860 


AGGATGTTCC CAAAGCTTTG TCTTGCTTGA ATCTTCTTTT TGACAATGGT 


AAGAGCATGA 


1920 


CTCGTTTTGT GACCGATCTT TTGCACTATT TAAGAGACTT GTTAATTGTT 


CAAACAGGGG 


1980 


GAGCAAATAC TCATCATAGT TCAGTCTTTG TAGAAAATTT GGCACTTCCT 


CAAAAAAATC 


2040 


TGTTTGAAAT GATTCGCTTA GCAACAGTGA GTTTAGCAGA TATTAAGTCT 


AGTTTGCAAC 


2100 


CCAAGATTTA TGCTGAAATG ATGACCGTCC GTTTGGCGGA AATCAAGTCC 


GAACCAGCTC 


2160 


TATCAGGAGC GGTTGAAAAT GAAATTGCTA CGCTGAGACA GGAAGTTGCC 


CGTCTCAAAC 


2220 


AAGAGCTTTC TAATGTAGGT GCGGTTCCTA AACAAGTTGC ACCAGCTCCT 


AGTCGACCAG 


2280 


CTACGGGCAA AACAGTCTAT CGTGTCGATC GCAATAAAGT GCAATCTATC 


TTACAAGAGG 


2340 


CCGTCGAAAA TCCTGATTTA GCACGTCAAA ATTTAATTCG TTTGCAGAAT 


GCCTGGGGAG 


2400 


AGGTAATTGA AAGTCTAGGT GGGCCGGACA AGGCTCTGCT AGTTGGTTCT 


CAACCGGTTG 


2460 


CTGCCAATGA ACACCATGCT ATTCTTGCTT TTGAGTCTAA CTTCAATGCT 


GGTCAAACTA 


2520 


TGAAACGAGA CAATCTCAAT ACCATGTTTG GTAATATCCT CAGTCAGGCG 


GCAGGTTTTT 


2580 


CACCTGAGAT TTTAGCTATT TCCATGGAGG AATGGAAAGA AGTTCGCGCA 


GCCTTTTCAG 


2640 


CCAAAGCCAA ATCTTCTCAA ACTGAAAAAG AAGTAGAAGA AAGCCTGATT 


CCAGAAGGAT 


2700 


TTGAATTTTT GGCTGATAAA GTGAAGGTAG AGGAAGACTA AAGAAAGATT 


TCATGATACA 


2760 


ATAAGTTTAT GAATAAACAA CAATTTATTA TTATGGCGCT GTTTACAGCT 


GCTGAGACCT 


2820 


ATTTTTTCAA TGAAGCCTGG ATGACTGGCC GCTATATTAT GGCAGCCTTT 


TGGGCAATTT 


2880 


TACTCTTTAG AAATTTCCGA GTCAGTTATG TGATGGGCAA AATCGTTGAT 


GTCATCGATC 


2940 


AGCATTTTAA TAGGAAAGAC TAGCCCTCAG CTTCCAGACA AAATCAAAGC 


CTTTTAGGCT 


3000 
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TTTTTTTGTT 


ATACTAGAAA 


AGTATATTTA 


TAGAATTTTT 


GCTCTATTTC 


TGGGGAAATC 


3060 


AGACGTTTTT 


CTAGTAAGTA 


CTGTAAAAGT 


TTTGAAAAAG 


AAAGGAACTA 


TCATGTCAGT 


3120 


ATTAGAGATC 


AAAGATCTTC 


ACGTTGAGAT 


TGAAGGAAAA 


GAAATTTTAA 


AAGGGGTTAA 


3180 


CCTGACCCTG 


AAAACAGGAG 


AAATTGCCGC 


TATCATGGGA 


CCAAATGGTA CAGGTAAATC 


3240 


GACTCTTTCT 


GCCGCTATCA 


TGGGAAATCC 


AAACTATGAA 


GTAACTAAAG 


GTGAAGTTTT 


3300 


GTTTGATGGC 


GTAAACATCC 


TTGAGTTGGA 


AGTGGATGAG 


CGTGCGCGTA 


TGGGACTTTT 


3360 


CCTTGCTATG 


CAATACCCAT 


CAGAAATCCC 


TGGAATTACC 


AATGCTGAGT 


TTCTTCGTGC 


3420 


CGCTATGAAT 


GCGGGTAAAG 


AAGATGATGA 


GAAGATTTCA 


GTTCGTGAGT 


TTATTACTAA 


3480 


GCTAGATGAA 


AAAATGGAAT 


TGCTCAACAT 


GAAAGAAGAA 


ATGGCAGAGC 


GTTACCTCAA 


3540 


CGAAGGCTTC 


TCTGGTGGTG 


AGAAAAAACG 


CAATGAAATT 


CTTCAACTTT 


TGATGTTGGA 


3600 


GCCAACATTT 


GCTCTTTTGG 


ACGAGATTGA 


CTCAGGTCTT 


GATATTGACG 


CTCTTAAAGT 


3660 


TGTGTCTAAA GGTGTCAATG 


CCATGCGTGG 


TGAAGGTTTT 


GGTGCTATGA 


TCATCACTCA 


3720 


CTACCAACGT 


CTTTTGAACT ATATCACACC 


TGATGTGGTA 


CACGTGATGA 


TGGAAGGTCG 


3780 


TGTTGTCCTT 


TCTGGTGGTC 


CAGAATTGGC 


TGCGCGTTTG 


GAACGTGAAG 


GATACGCAAA 


3840 


ATTAGCTGAA 


GAACTTGGCT 


ACGACTACAA 


GGAAGAATTG 


TAATTCCCTC 


GTATCTTTTA 


3900 


GGAGAAGTAA 


ATGACTAGAG 


AAAATATTAA 


ACTTTTTTCA 


GAAATGCACG 


CTGAACCAAG 


3960 


CTGGTTGGCT 


GATCTCCGTC 


AAAAAGCTTT 


TGACAAGATT 


GAGACTTTGG 


AATTACCAGT 


4020 


TATTGAGTGT 


GTCAAATTCC ACCGTTGGAA 


TCTGGGTGAT 


GGAACGATTA 


CAGAAAATGA 


4080 


GCCATCAGCA 


AATGTTCCAG 


ATTTCACAGC 


TTTAGATCAT 


CACTTGAAGT 


TGGTGCAAGT 


4140 


AGGAACTCAA 


ACTGTTTTCG 


AACAAACTCC 


AGTTGAGTTA 


GCTGAACAGG 


GTGTTGTCTT 


4200 


CACAGACTTT 


CACTCAGCTT 


TAGAAGAAAT 


TCCAGAGCTG 


ATCGAAGAAT 


TCTTCATGTC 


4260 


ATCTGTTAAG 


TATGATGATG 


ACAAGTTGGC 


GGCTTACCAC 


ACAGCTTACT 


TTAACAGTGG 


4320 


TGCTGTACTC 


TATATTCCAG 


ATAACGTAGA 


AATCACAGAG 


CCAATTGAAG 


GAATTTTCTA 


4380 


CCAAGATAGC 


GATAGCAATG 


TGCCGTTTAA 


CAAGCATATT 


ATGATTATCG 


TTGGTAAAAA 


4440 


TTCTAAGATT 


AGTTATCTGG 


AGCGTTTAGA 


GTCACGCGGT 


GAAGGAAGTG 


ACAAAGCAAC 


4500 


TGCCAATATC 


ACAGTGGAAG 


TGATTGCACG 


TTCTGGTGCG 


CAAGTCAAGT 


TTGCTGCTAT 


4560 


CGACCGTCTA 


GGTGAAAACG 


TCACTGCCTA 


CATTAGCCGT 


CGTGGTAAAT 


TAGGCAACGA 


4620 


TGCAAGTATT 


GACTGGGCTA 


TCGGTGTCAT 


GAACGAAGGA 


AATGTCGTTG 


CTGATTTTGA 


4680 


TAGTGACTTG 


ATTGGTAATG 


GTAGCCATGC 


TGACCTCAAG 


GTTGTAGCTC 


TTTCAAGTGG 


4740 
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TCGTCAGGTA CAAGGGATTG ATACTCGTGT AACTAACTAT GGCTGCAACT CAATCGGAAA 4800 

CATTCTACAA CATGGGGTTA TCCTTGAAAA AGCAACTTTG ACTTTCAATG GTATCGGCCA 4860 

CATCATCAAG GGTGCTAAGG GAGCAGATGC GCAACAAGAG AGCCGTGTTC TCATGCTTTC 4920 

AGACCAAGCG CGTTCAGATG CTAACCCAAT TCTTTTGATT GATGAAAATG ACGTAACTGC 4980 

AGGCCATGCA GCCTCTATTG GTCAGGTAGA TCCAGAAGAT ATGTACTACC TCATGAGTCG 5040 

TGGCTTGGAT AAGGCAACTG CAGAGCGTTT GGTTGTTCGT GGTTTCCTTG GATCTGTTAT 5100 

CGTGGAGATT CCAGTCAAGG AAGTTCGTGA TGAAATGATT GCAACTATCG AAGAGAAATT 5160 

GTCAAAACGC TAAGGGGCAG CCTATGTTAG ATGTAGAAGC GATTCGCAAG GATTTTCCAA 5220 

TTTTAGATCA GATTGTCAAT GATGAACCTC TGGTCTATCT GGACAATGCT GCGACGACAC 5280 

AAAAACCACT AGTAGTTCTG AAAGCTATTA ACAGCTACTA TGAGCAGGAC AATGCCAATG 5340 

TTCACCGTGG TGTCCATACC TTAGCGGAAC GAGCGACAGC TTCTTATGAA GCTGCTCGTG 5400 

AAACCATTCG TAAGTTTATT AATGCAGGCT CTACAAAGGA AGTTCTCTTT ACCAGAGGAA 5460 

CGACAACCAG CCTTAACTGG GTGGCACGCT TTGCTGAGGA AATTCTCACT GAGGGAGACC 5520 

AGGTCTTGAT TTCAGTAATG GAACACCATT CTAATATCAT TCCATGGCAG GAAGCTTGTC 5580 

GAAAGACTGG AGCAGAGCTT GTCTATGTCT ATCTTAAAGA CGGTGCCTTG GATATGGAGG 5640 

ATTTGCGAGC TAAATTGACT GATAAGGTTA AATTTGTTTC CCTAGCTCAT GCCTCCAATG 5700 

TTCTTGGTGT GGTCAATCCG ATCAAGGAAA TCACTCAATT AGCCCACCAA GTTGGGGCAA 5760 

TTATGGTAGT GGATGGTGCT CAATCTACAC CTCATATGAA GATTGATGTC CAGGACTTGG 5820 

ATCTGGACTT TTTCGCCTTT TCGGGTCACA AGATGGCTGG TCCGACTGGT ATCGGTGTCC 5880 

TTTACGGCAA AGAAAAGTAT CTTGAGCAAA TGTCTCCAGT AGAATTTGGC GGCGAGATGA 5940 

TTGATTTTGT CTACGAGCAA TTTGCTAGTT GGAAGGAATT GCCTTGGAAA TTTGAGGCTG 6000 

GAACGCCAAA TATGGCAGGA GCTATTGGAC TTGCGACTGC AGTTGATTAT CTGGAAAAGA 6060 

TTGGTATGGA TGCCGTTGAA GCTCATGAAC AGGAATTGAT TGCGTACGTC TATCCAAAAC 6120 

TGCAGGCAAT TGAGGGATTG ACCATTTACG GTTCTCAGGA TTTGGCTCAA CGTTCGGGTG 6180 

TTATTGCCTT TAACCTAGGT GATCTCCATC CTCACGATCT TGCGACGGCT CTGGATTATG 6240 

AAGGAGTGGC TGTTCGTGCT GGTCACCATT GTGCGCAACC CTTGCTTCAG TATTTGGAAG 6300 

TCCCAGCAAC AGCTCGTGCA AGTTTTTATA TCTACAATAC CAAGGCAGAT TGCGACAAAC 6360 

TAGTCGATGC CCTACAAAAG ACAAAGGAGT TTTTCAATGG CACTTTCTAA ACTAGATAGC 6420 

CTTTATATGG CAGTGGTAGC AGACCATTCG AAAAATCCAC ATCACCAAGG GAAGTTAGAA 6480 

GATGCTGAGC AAATCAGTCT CAACAATCCG ACTTGTGGGG ATGTCATCAA CCTCTCTGTC 6540 
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AAGTTTGATG 


CAGAGGACCG 


TTTGGAAGAT 


ATTGCTTTTC 


TAAATTCAGG 


ATGCACGATT 


6600 


TCAACTGCTT 


CTGCTAGTAT 


GATGACAGAT 


GCCGTTTTAG 


GAAAAACCAA 


ACAAGAAATT 


6660 


TTAGAACTGG 


CGACTATTTT 


TTCTGAAATG 


GTTCAAGGGC 


AAAAAGATGA 


GCGTCAAGAC 


6720 


CAACTTGGAG 


ACGCGGCATT 


CTTGTCAGGT 


GTTGCCAAAT 


TCCCTCAAAG 


AATCAAGTGT 


6780 


GCAACCCTAG 


CTTGGAATGC 


CCTTAAGAAA 


ACAATTGAAA 


ATCAAGAAAA 


ACAGTAAGAC 


6840 


AAGTTTCTTT 


TGTCTTATGA ATTATTAGAA ATGAAGAAAG 


AAAGGATACT 


ATGGCTGAAG 


6900 


AAAGAGTAGA ACCAAAACCA ATTGACCTTG 


GTGAATATAA 


ATTTGGTTTC 


CATGACGATG 


6960 


TAGAGCCTGT 


CTTATCGACA 


GGAAAAGGAC 


TCAACGAAGG 


TGTTATTCGT 


GAATTATCTG 


7020 


CTGCTAAGGG 


TGAGCCTGAG 


TGGATGTTGG 


AGTTCCGTTT 


GAAGTCTTAT 


GAAACCTTCA 


7080 


AAAAAATGCC 


CATGCAAACT 


TGGGGAGCAG 


ACTTGTCAGA 


GATTGACTTT 


GATGACTTAA 


7140 


TCTACTACCA AAAACCATCT 


GACAAACCAG 


CCCGTTCTTG 


GGATGATGTA 


CCTGAAAAGA 


7200 


TTAAAGAAAC 


CTTTGAACGT 


ATCGGGATTC 


CAGAAGCTGA 


ACGTGCTTAT 


TTAGCAGGGG 


7260 


CTTCTGCCCA 


GTACGAGTCA 


GAAGTGGTTT 


ACCACAACAT 


GAAGGAAGAG 


TTCCAAAAAT 


7320 


TAGGTATTAT 


CTTTACAGAT 


ACAGATTCCG 


CACTCAAGGA 


ATACCCAGAC 


TTATTTAAAC 


7380 


AATACTTTGC 


GAAGTTGGTA 


CCGCCGACAG 


ATAACAAGTT 


GGCAGCCCTC 


AACTCAGCAG 


7440 


TATGGTCGGG 


TGGAACTTTT 


ATCTACGTGC 


CAAAAGGTGT 


CAAGGTAGAT 


ATTCCACTTC 


7500 


AAACTTATTT 


CCGTATCAAT 


AACGAAAATA 


TAGGTCAGTT 


CGAACGTACC 


TTGATTATCG 


7560 


TTGATGAGGG 


AGCAAGCGTC 


TACTACGTAG 


AAGGATGTAC 


AGCACCAACA 


TATTCAAGCA 


7620 


ATAGCTTACA 


CGCTGCCATT 


GTAGAAATTT 


TTGCTTTGGA 


CGGAGCTTAT 


ATGCGTTATA 


7680 


CAACTATCCA 


AAACTGGTCT 


GATAACGTCT 


ATAACTTGGT 


AACAAAGCGT 


GCTAAGGCTC 


7740 


AAAAGGATGC 


CACTGTTGAG 


TGGATTGATG 


GAAACTTGGG 


TGCCAAAACG 


ACTATGAAAT 


7800 


ATCCATCTGT 


TTACCTTGAT 


GGAGAAGGAG 


CGCGTGGTAC 


CATGCTCTCT 


ATCGCCTTTG 


7860 


CTAATGCAGG 


GCAACACCAA 


GACACGGGTG 


CTAAGATGAT 


TCACAATGCT 


CCACATACCA 


7920 


GCTCGTCTAT 


TGTGTCTAAA 


TCCATCGCTA 


AAGGTGGAGG 


AAAGGTTGAC 


TACCGTGGAC 


7980 


AAGTCACCTT 


TAACAAGAAC 


TCTAAGAAAT 


CTGTTTCCCA 


CATTGAATGT 


GATACCATTA 


8040 


TCATGGATGA 


CTTGTCAGCA 


TCAGATACTA 


TTCCATTTAA 


TGAAATTCAC 


AACTCGCAAG 


8100 


TGGCTTTGGA ACACGAAGCC 


AAAGTATCTA 


AGATTTCAGA 


AGAGCAATTG 


TATTATCTCA 


8160 


TGAGCCGTGG 


ATTGTCAGAA 


TCTGAGGCAA 


CTGAAATGAT 


TGTCATGGGA 


TTTGTAGAAC 


8220 


CCTTTACAAA 


AGAACTTCCA 


ATGGAATACG 


CAGTTGAGCT 


GAACCGCTTG 


ATTAGCTATG 


8260 
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AAATGGAGGG 


ATCAGTTGGA 


TAAAATTTGA 


TTTTATACTC 


TTCGAAAATC 


TCTTCAAACC 


8340 


ACGTCAGCAT 


CGCCTTACCG 


TATGTATGGT 


TwCTGAtTCG 


TCAGTTTCAT 


CTACAACCTC 


8400 


AAAACAGTGT 


TTTGAGCAAC 


tGCGGCTAGC 


TTCCTAGTTT 


GTTCTTTGAT 


TTTGAGTATT 


8460 


AGATTTACTC 


AAAATCAAGG 


ATTTTGAAGA 


TGAACTTGTA 


TCAAAAAATC 


GCGGTTTAAA 


8520 


ATCGCGATTT 


TTTATAATTT 


CTCGTTAACA 


AAGCGGACAA 


ACTGATTCCA 


CCAAACTTTT 


8580 


AAGAAGAAGG 


CTTTTTCAAT 


TTTCTTGTCT 


GCTACCATTT 


CGAAACTAGG 


GCGCTCTGTG 


8640 


GTGATGTAAC 


CTTGACCAAT 


CAAGTCCTTG 


TCTTCATAAG 


TCAAATGGCC 


AACCACTGTT 


8700 


CCAGCTTCAA 


GTGGTGCTGG 


GATTGCTTTG 


GAATCAGGTG 


TGAATTGAAC 


AGATTGGGAA 


8760 


GATTGATTCC 


CAACACGTTC 


GATTAGATAG 


ATATCCTCTG 


GAGCCACTGC 


AGTTACTGTA 


8820 


TCTTCTTTTC 


CATCTTGTAC 


AGGGGCTTTG 


CTATCTTGAT 


AGGCATCGCC 


TTGTTGAACG 


8880 


ATTTTGCGAA 


GTGTAAATGT 


AGAAGAAATA 


TAATCCATTA 


GGGAAGATGT 


AGCTGTAAAT 


8940 


CGAGCGTAAG 


GATTATTGTC 


TTGATGATCT 


GCATTTAAAA 


CAACTGTGAT 


GACTCTCATG 


9000 


CCTTTTTCGA 


CAGTAGTACC 


AACAAAAGAC 


TCTCCAGCCT 


TATCTGTTGT 


TCCTGTTTTT 


9060 


AGCCCATCAA 


AACCACCACG 


GTAAGCAGGC 


ATACCTTCTA 


ACATGTAGTT 


GGTTGAAGTG 


9120 


ATTGTCATCC 


CAGCAAAAGT 


AGAAGAAGGT 


TTTTTGGTGA 


TTTCTAAGAC 


TTGTGGGTAT 


9180 


TTTTTGATGA GGTTGCGAGC AACGATAGCG ACATCATAAG 


CACTAAGCTT 


ATTTTCCTCA 


9240 


TCTTTTTTAG 


AACCTGGGTA 


AATGTTATCC 


CCTAGAGTTT 


CATTGTTAAG 


ACCTGTCGTA 


9300 


TTGACAACAG 


TGGCATCCTG 


AATTCCCCAT 


TCCAAGAGTT 


TTGCCCGCAT 


CATATCGACG 


9360 


AAATCTTTTT 


CTGAGCCAGC 


AATTTTCTCA 


GCTAGGGCAA 


TAGCGGCGCT 


GTTGGCACTA 


9420 


GATACCAGAG 


TTGCTTCAAG 


CAACTCTTCG 


ACAGTATAAT 


TACGGGCCTC 


CATAGGAATA 


9480 


TTACTGGCTT 


CAGAATTTGT 


CGTCAATTGA TAAGGATAAT 


CAGAAATATC 


TACAGGAGTG 


9540 


GAGAGGGTAA TACTTCCGTT TTCCAAAGCT TCATAGACCA GATAAACAGT AATCAATTTT 


9600 


GTTATGGAAG 


CAATTTCGAC 


AGGTTGCGTT 


GCATCCTTCT 


CATAGAGAAT 


TTTACCAGTA 


9660 


TTTGCCTCAA 


CAGCAATCGC 


ATGTTTAGCG 


GCAATGGTAA 


AATCTTGAGC 


AACAGCAGTA 


9720 


GAAGCACCCC 


CTAAAAGAGA 


GACAGTTAAC 


AAAGTTAAAA 


ATATTTTTTT 


CATAGTAGTC 


9780 


TTATTCTATC 


ATAAAGAAAA 


AAAATATTCT 


TGCTTTAATA 


ATTCATCTGT 


TAAGCTTTTT 


9840 


GAAAATATGG 


TAAAATAAAG 


TAAGGGAGGT 


AACTCATGTT 


TCGTAGAAAT 


AAATTATTTT 


9900 


TTTGGACCAC 


AGAAATTTTA 


CTCTTAACCA 


TCATCTTTTA 


CCTATGGAGA 


CAGATGGGGT 


9960 


CTTTGATTAA 


CCCTTTTGTT 


AGCGTGCTTA 


ATACAATTAT 


GATTCCATTT 


TTATTAGGGG 


10020 


GCTTTTTTTA 


TTATTTGACA 


AACCCTATTG 


TTACTTTCTT 


AAATAAAGTC 


TGTAAACTCA 


10080 
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ATCGTTTGCT 


TGGTATTTTA 


ATTACCTTGT 


GTACTTTGGT 


CTGGGGAATG 


GTCATAGGTG 


10140 


TTGTCTATCT 


CTTACCTATT 


TTGATTAATC 


AGTTATCTAG 


TTTGATTATA 


TCTAGTCAAA 


10200 


CTATTTATAG 


TCGAGTACAA 


GACTTAATCA 


TAGACTTATC 


TAATTATCCT 


GCGCTCCAGA 


10260 


ATTTGGATGT 


AGAAGCTACA 


ATTCAGCAGT 


TAAACTTATC 


CTATGTTGAT 


ATTCTTCAAA 


10320 


ATATCCTAAA 


TAGCGTATCA 


AATAGTGTGG 


GGAGCGTCTT 


GTCAGCTCTT 


ATCAGTACTG 


10380 


1 TTTTGATTTT 


GATTATGACT 


CCAGTTTTTT 


TGGTTTATTT 


CTTATTAGAT 


GGACATAAAT 


10440 


TCTTGCCCAT 


GCTTGAAAGA 


ACGATTCTAA 


AGAGGGATCG 


CTTGCATATT 


GCAGGCTTAT 


10500 


TAAAGAATTT 


AAATGCGACG 


ATTGCTCGCT 


ATATTAGTGG 


AGTTTCGATT 


GACGCAATCA 


10560 


TTATAGGTTG 


TTTGGCTTAT 


ATTGGCTATA 


GTATTATTGG 


TTTAAAATAT 


GCTTTAGTTT 


10620 


TTGCCATTTT 


TTCTGGTGTA 


GCCAATTTAA 


TTCCTTATGT 


GGGGCCAAGT 


ATTGGTTTGA 


10680 


TTCCTATGAT 


CATCGCAAAT 


ATATTCACTG 


ATCCCCATAG 


ACTGCTGATT 


GCAGTGATTT 


10740 


ATATGCTTGT 


TGTTCAGCAG 


GTAGATGGCA 


ATATCTTATA 


TCCTCGAATC 


GTAGGAAGTG 


10800 


TTATGAAGGT 


TCATCCAATC 


ACGATTTTAG 


TTTTACTTTT 


GTTGTCAAGC 


AATATCTATG 


10860 


GTGTAGTTGG 


AATGATTGTC 


GCAGTGCCAA 


CCTATTCTAT 


CTTGAAAGAA 


ATTTCTAAGT 


10920 


TCTTATCCCA 


TTTGTATGAA 


AATCATAAAA 


TAATGAAAGA 


ACGAGAAAGA 


GAATTAGCTA 


10980 


AGTAAAAGTC 


AGGAGAACCC 


TGATTTTTCT 


TTACTGGAAG 


TGGCCTTTAG 


ATTAGAAGAC 


11040 


TGAAAATAAG 


TTAAAGTCTT 


AAACTAATTT 


TCACAGCTAA 


GAATAGTAGA 


AGTTAATCTG 


11100 


ATAAAAATCG 


AAAAAACCAG 


TGGAATTCTG 


TGTCAGGGTA 


AGTTCCACTG 


GTTTTCATAG 


11160 


TCTATTAAAG 


TTCGAATGAA 


ACCTATTTAT 


AGTAGATTGA 


AACTAGAATA 


GTACACCTCT 


11220 


AATTCTAAAA 


CATTGTTAGA 


AATCGATTTG 


ACTGTCCTGA 


TCTATTCGTT 


CTATTCTTAT 


11280 


TTCATTTTAC 


TATATTTTGG 


TGCAATAAGT 


GAAAAGTAGT 


CCGAATAATA 


TAAGGATTOA 


11340 


TTTTATAGTT TTTAAACTCA 


AATGAATTGA 


AATAAAGAGA 


GTACGAAAAT 


TCTCATCTGA 


11400 


AAGTATTTTA 


GAATAATTCT 


CTTCGTGAAT 


TTCTTCAAAA 


CAGATAGCTT 


CATCTTAGGT 


11460 


ATGTGATTTC 


TTTTTGCATT 


TTTGAGTTAG 


ATAAGGTATA 


ATGATTTTAT 


TGTCTTTTGG 


11520 


GGTCGTTACG 


GATTCGACAG 


GCATTATGAG 


GCATATTTTG 


CGACTCGTGT 


GGCGACGTAA 


11580 


ACGCTCAGTT 


AAATATAACT 


GCAAAAAATA 


ACACTTCTTA 


CGCTCTAGCT 


GCCTAAAAAC 


11640 


CAGCAGGCGT 


GACCCGATTT 


GGATTGCTCG 


TGTTCAATGA 


CAGGTCTTAT 


TATTAGCGAG 


11700 


ATACGATTAA 


GCCTTGTCTA 


GCGGTTTGAT 


AAGAGATTGA 


TAGACTCGCA 


GTTTCTAGAC 


11760 


TTGAGTTATG 


TGTCGAGGGG 


CTGTTAAAAT 


AATACATAAC 


CTATGGTTGT 


AGACAAATAT 


11820 
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GTTGGCAGGT 


"ill uunLu J. 


GGGTTCGACT 


CCCACCGGCT 


CCATTATTCC 


TTTGCATTCT 


11880 


TTTGCATTCC 


TTGfiTAAAAP 


GTTGTTAAAT 


CAACGTTTTT 


TATTTTTATC 


TTTGGTATTC 


11940 


CTTTGCATTP 

^— All. VJ^« i* 1 S V* 


111 luLlAAA 


AAGGGAGTCA 


CAAACAGACC 


CTATTTTAAA 


AAAGGATAGA 


12000 


AAAAA(3fiATA 


LAALA 1 i l\j 1 


CGCATCCTAA AAATAATCTT TTTTCGACGG AAGACATGGG 


12060 


rtl 1a_V»AAv_LL. 


ACGCACGCTA 


TTACACGCCT 


ACCGCGTTTC 


CAACACGGCC 


TCTTAAGCCT 


12120 


L, 1 iuAb 1 AA 1 


CTTCCAATAC 


TTACTCAAAT 


AGTCTACCAT 


AAAGGCTCTT 


ATCTTGCAAT 


12180 


rUi/i/vii ILIA 


TV A 7V rn TV Jk^TV TV 

(j AAA I AAGAA 


AAATGATAGA 


TTTTGAAAGA 


AAATGATAAA 


AAATGCTTGA 


12240 


Li 1 L-LjAAALjA 


AAGTATGATA 


GAATGAATAG 


TGTAAACGAT 


AACAGGAGGT 




12300 


TAAAAACAGA 


ACGTAAACAA 


CTAATTTTAG 


AGGAGTTAAA 


TCAACATCAT 




12360 


m tv /*< tv tv t\ 7i tv mm 
lAuAAAAAI 1 


AGTTAGTTTG 


CTAGAAACGT 


CAGAATCAAC 


GGTTCGAAGA 


GACTTGGATG 


12420 


AO JrTGGAAGC 


GGAAAACAAG 


CTTCGTCGTG 


TGCATGGTGG 


AGCAGAACTC 


CCCTACTCCT 


12480 


1 AL, AL»L» AALtA 


AGAAACCATT 


CAAGAAAAAT 


CTGTCAAAAA 


CCTTCAAGAA 


AAGAAATTGC 


12S40 




AGCAGCCTCT 


CTCATTAAAG 


AAAAAGATGT 


CATCTTTATC 


GATGCTGGAA 


12600 


LAAL.AACTGC 


TTTTTTGATT 


CATGAATTGG 


TCAATAAGAA 


TGTTACAGTT 


GTGACCAACT 


12660 


CCATTCACCA 


TGCCGCTCAG 


TTGGTTGAAA 


AGCAGAwTCC 


AACTGTCATG 


GTTGGAGGAA 


12720 


ACGTCAAGAC 


GGCGACAGAT 


GCTAGTATCG 


GGGGCGTTGC 


TCTTAACCAG 


ATTAACCAAT 


12780 


TGCACTTTGA 


CCGTGCCTTT 


ATCGGAATAA 


ATGGTGTTGA 


CGATGGCTAT 


TATACGACTC 


12840 


CTGATATGGA 


GGAGGGAGCT 


GTGAAAAGAG 


CTATTTTGGA 


GAATGCCAAG 


CAGACCTACG 


12900 


TCTTGGTGGA 


TTCGTCAAAA 


ATTGGACAAA 


CTTGCTTTGC 


CAAGGTAGCC 


CCACTCAAAC 


12960 


GCGCTATCGT 


TATCACTAGT 


CAAGGGCATG 


AGCTCTTGCA 


GGTTATTAAG 


GAGAAAACGG 


13020 


AGGTAATAGA 


AGTATGATTT 


ATACAGTCAC 


ACTCAATCCA 


TCCATTGACT 


ATATCGTTCG 


13080 


TTTGGACCAA GTCAAAGTTG 


GTAGTGTCAA 


TCGTATGGAC 


AGTGATGATA 


AGTTTGCTGG 


13140 


TGGGAAAGGA 


ATCAATGTCA 


GCCGTGTCTT 


GAAACGTTTG 


AATATACCAA 


ATACAGCGAC 


13200 


GGGATTTATC 


GGTGGCTTTA 


CTGGTAAATT 


TATCACAGAT 


ACTTTAGCAG 


AGGAAGAAAT 


13260 


CGAGACACGT 


TTTGTCCAGG 


TGGCAGAAGA 


TACTCGTATC 


AATGTTAAAA 


TCAAAGCAGA 


13320 


CCAAGAAACA 


GAAATCAACG 


GAACGGGTCC 


AACTGTTGAA 


TCGGTTCAGC 


TAGAAGAATT 


13380 


GAAAGCTATT 


TTATCTAGTC 


TGACAGCAGA AGATACAGTT 


GTCTTTGCAG 


GTTCAAGTGC 


13440 


TAAAAATCTA 


GGCAATGTTA 


TCTATAAGGA 


TTTGATTTCC 


TTGACGCGCC 


AGACTGGTGC 


13500 


GCAAGTGGTC 


TGTGACTTTG 


AAGGACAGAC 


CTTAATTGAT 


AGTTTGGACT 


ACCAGCCTCT 


13560 


TCTTGTAAAA CCAAACAATC 


ATGAACTTGG 


AGCGATTTTT 


GGGGTTAAAC 


TCGAAAGTTT 


13620 
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AGATGAAATT GAGAAATACG CTCGTGAGTT ACTGGCTAAG GGTGCTCAAA ATGTTATTAT 13680 

CTCTATGGCT GGTGATGGTG CCCTTCTTGT CACATCTGAG GGAGCTTACT TCGCTAAACC 13740 

AATCAAAGGA ACAGTCAAAA ATTCAGTTGG AGCTGGTGAT TCTATGGTTG CTGGATTCAC 13800 

AGGTGAATTT GTCAAATCAA AAGACGTAGT AGAAGCCTTC AAATGGGGAG TGGCTTGCGG 13860 

AACGGCAACT ACCTTCTCAG ATGACTTGGC AACGGCGGAA TTTATTAAAG AAACATATGG 13920 

AAAAGTTGAG GTAGAAAAAC GATGAAAATT CAAGACCTAT TGAGAAAAGA TGTCATGTTG 13980 

CTAGATTTGC AGGCAACTGA AAAAACAGCT GTCATCGACG AGATGATTAA AAATTTGACA 14040 

GACCACGGTT ATGTAACAGA TTTTGAAACA TTTAAAGAAG GAATTTTGGC GCGTGAAGCT 14100 

TTGACTTCTA CTGGTTTGGG TGATGGAATC GCAATGCCTC ACAGCAAAAA CGCTGCTGTC 14160 

AAAGAAGCGA CAGTTCTATT TGCTAAGTCA AATAAGGGTG TTGACTACGA GAGCTTGGAT 14220 

GGACAAGCAA CTGACCTCTT CTTCATGATT GCAGCTCCAG AAGGTGCCAA TGATACTCAC 14280 

TTGGCAGCCT TGGCAGAATT GTCTCAATAC TTGATGAAAG ACGGTTTTGC AGACAAACTT 14340 

CGTCAAGCAA CATCTGCAGA CCAAGTTATC GAACTTTTTG ACCAAGCTTC AGAAAAAACT 14400 

GAGGAACTTG TTCAAGCACC TGCTAATGAC TCTGGTGACT TTATCGTAGC TGTTACAGCT 14460 

TGTACAACAG GTATTGCCCA CACTTACATG GCCCAAGAAG CCCTTCAAAA AGTAGCTGCT 14 520 

GAAATGGGGG TTGGTATCAA GGTCGAAACC AACGGTGCTA GCGGTGTTGG AAATCAACTA 14580 

ACTGCAGAAG ATATCCGTAA GGCTAAAGCT ATTATCATTG CAGCAGACAA GGCCGTTGAA 14640 

ATGGATCGAT TTGATGGAAA ACCATTGATC AATCGTCCAG TTGCTGACGG TATCCGTAAG 14700 

ACAGAAGAGC . TAATTAACTT GGCTCTTTCA GGAGATACTG AAGTCTACCG TGCCGCTAAT 14760 

GGTGCCAAAG CTGCAACAGC CTCTAACGAA AAACAAAGCC TTGGTGGTGC CTTGTACAAA 14820 

CACTTGATGA GTGGTGTATC TCAAATGTTA CCATTCGTTA TCGGTGGTGG TATCATGATT 14880 

GCCCTTGCCT TCTTGATTGA CGGTGCTTTG GGTGTTCCAA ATGAAAACCT TGGCAATCTT 14940 

GGTTCTTACC ATGAGTTAGC TTCTATGTTC ATGAAAATTG GTGGAGCTGC CTTTGGTTTG 15000 

ATGCTTCCAG TCTTTGCGGG TTATGTTGCC TACTCTATTG CTGAAAAACC GGGTTTGGTA 15060 

GCAGGTTTCG TGGCTGGTGC TATTGCCAAA GAAGGTTTTG CCTTTGGTAA AATTCCTTAT 15120 

GCCGCAGGTG GTGAAGCAAC TTCAACTCTT GCAGGTGTCT CATCTGGTTT CCTAGGTGCC 15180 

CTTGTTGGTG GATTTATCGC AGGTGCCTTG GTTCTTGCCA TCAAGAAATA CGTTAAAGTT 15240 

CCTCGTTCAC TCGAAGGTGC TAAATCAATC CTTCTATTGC CACTTCTTGG AACAATCTTG 15300 

ACAGGATTTG TTATGCTAGC TGTGAATATC CCAATGGCTG CAATCAACAC TGCTATGAAT 15360 
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GACTTCCTAG 


GCGGTCTTGG 


AGGAGGTTCA 


GCTGTCCTTC 


TTGGTATCGT 


CCTTGGTGGA 


15420 


ATGATGGCTG 


TTGACATGGG 


TGGACCAGTT 


AATAAAGCAG 


CTTATGTCTT 


TGGTACAGGT 


15480 


ACGCTTGCAG 


CAACTGTTTC 


TTCAGGTGGT 


TCTGTAGCCA 


TGGCAGCAGT 


TATGGCTGGA 


15540 


GGAATGGTGC 


CACCACTTGC 


AATCTTTGTC 


GCAACTCTTC 


TTTTCAAAGA 


TAAATTTACT 


15600 


AAGGAAGAAC 


GTAACTCTGG 


TTTGACAAAC 


ATCATCATGG 


GCTTGTCATT 


TATCACTGAG 


15660 


GGAGCGATTC 


CATTTGGTGC 


CGCTGACCCA 


GCTCGTGCGA 


TTCCAAGCTT 


CATCCTTGGT 


15720 


TCAGCAGTAG CAGGTGGACT 


CGTTGGTCTT 


ACTGGTATCA 


AACTCATGGC 


GCCACACGGA 


15780 


GGAATCTTCG 


TTATCGCCCT 


TACTTCAAAT 


GCTCTCCTTT 


ACCTCGTTTC 


TGTCTTGGTA 


15840 


GGAGCAATCG 


TAAGTGGTGT 


GGTTTATGGT 


TACCTACGCA 


AACCACAAGC 


ATAAAAAATA 


15900 


GAAAAATGAA AAGATTGGAC 


CGTTTGGTGC 


AGTCTTTTTC 


TCTTCCCGAA 


ATGCCTGTGA 


15960 


AATATGGTAT 


AATAGAAGAA 


TGGCAAACAA 


GAATACAAGT 


ACAACAAGAC 


GGAGACCGTC 


16020 


TAAAGCAGAA 


CTGGAAAGAA 


AAGAAGCGAT 


TCAACGAATG 


TTGATTTCGT 


TAGGAATTGC 


16080 


GATTTTATTG 


ATTTTCGCAG 


CCTTCAAATT 


AGGGGCTGCA 


GGTATAACCC 


TTTATAATTT 


16140 


AATTCGCTTG 


CTAGTGGGTA 


GCCTAGCTTA 


TCTGGCGATA 


TTCGGCCTAT 


TAATCTATCT 


16200 


CTTCTTTTTC 


AAGTGGATAC 


GAAAACAGGA 


AGGACTCTTA 


TCTGGCTTTT 


TCACCATATT 


16260 


TGCTGGCTTA 


CTCTTGATTT 


TTGAGGCCTA 


CTTGGTTTGG 


AAATATGGTT 


TGGACAAGTC 


16320 


CGTTCTAAAA GGGACCATGG 


CTCAGGTTGT 


GACAGATCTG 


ACTGGTTTTC 


GAACGACTAG 


16380 


CTTTGCTGGA 


GGGGGCTTGA 


TCGGGGTCGC 


TCTTTATATT 


CCAACAGCCT 


TTCTCTTTTC 


16440 


AAATATCGGA 


ACTTACTTTA 


TTGGTTCTAT 


CTTGATTTTA 


GTGGGTTCTC 


TCCTAGTCAG 


16500 


CCCTTGGTCT 


GTTTACGATA 


TTGCTGAATT 


TTTCAGTAGA 


GGCTTTGCCA 


AATGGTGGGA 


16560 


AGGGCACGAG 


CGTCGAAAAG 


AGGAACGCTT 


TGTCAAACAA 


GAAGAAAAAG 


CTCGCCAAAA 


16620 


GGCTGAGAAA 


GAGGCTAGAT 


TAGAACAAGA 


AGAGACTGAA 


AAAGCCTTAC 


TCGATTTGCC 


16680 


TCCTGTTGAT 


ATGGAAACGG 


GTGAAATTCT 


GACAGAGGAA 


GCTGTTCAAA 


ATCTTCCACC 


16740 


TATTCCAGAA 


GAAAAGTGGG 


TGGAACCAGA 


AATCATCCTG 


CCTCAAGCTG 


AACTTAAATT 


16800 


CCCTGAACAG 


GAAGATGACT 


CAGATGACGA 


AGATGTTCAG 


GTCGATTTTT 


CAGCCAAAGA 


16860 


AGCCCTTGAA 


TACAAACTTC 


CAAGCTTACA 


ACTCTTTGCA 


CCAGATAAAC 


CAAAAGATCA 


16920 


GTCTAAAGAG 


AAGAAAATTG 


TCAGAGAAAA 


TATCAAAATC 


TTAGAAGCAA 


CCTTTGCTAG 


16980 


CTTTGGTATT 


AAGGTAACAG 


TTGAACGGGC 


CGAAATTGGG 


CCATCAGTGA 


CCAAGTATGA 


17040 


AGTCAAGCCG 


GCTGTTGGTG 


TAAGGGTCAA 


CCGCATTTCC 


AATCTATCAG 


ATGACCTCGC 


17100 


TCTAGCCTTG 


GCTGCCAAAG 


ATGTCCGGAT 


TGAAGCACCA 


ATCCCTGGGA 


AATCCCTAAT 


17160 
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CGGAATTGAA 


GTGCCCAACT 


CCGATATTGC 


CACTGTATCT 


TTCCGAGAAC 


TATGGGAACA 


17220 


ATCGCAAACG 


AAAGCAGAAA ATTTCTTGGA 


AATTCCTTTA 


GGGAAGGCTG 


TTAATCGAAC 


17280 


CGCAAGAGCT 


TTTGACCTTT 


CTAAAATGCC 


CCACTTGCTA 


GTTGCAGGTT 


CAACGGGTTC 


17340 


AGGGAAGTCA 


GTAGCAGTTA 


ACGGCATTAT 


TGCTAGCATT 


CTCATGAAGG 


CGAGACCAGA 


17400 


TCAAGTTAAA 


TTTATGATGG 


TCGATCCCAA 


GATGGTTGAG 


TTATCTGTTT 


ACAATGATAT 


17460 


TCCCCACCTC 


TTGATTCCAG 


TCGTGACCAA 


TCCACGCAAA 


GCCAGCAAGG 


CTCTGCAAAA 


17520 


GGTTGTGGAT 


GAAATGGAAA 


ACCGTTATGA 


ACTCTTTGCC 


AAGGTGGGAG 


TTCGGAATAT 


17580 


TGCAGGTTTT 


AATGCCAAGG 


TAGAAGAGTT 


CAATTCCCAG 


TCTGAGTACA 


AGCAAATTCC 


17640 


GCTACCATTC 


ATTGTCGTGA 


TTGTGGATGA 


GTTGGCTGAC 


CTCATGATGG 


TGGCCAGCAA 


17700 


GGAAGTGGAA 


GATGCTATCA 


TCCGTCTTGG 


GCAGAAGGCG 


CGTGCTGCAG 


GTATCCACAT 


17760 


GATTCTTGCA 


ACTCAGCGTC 


CATCTGTTGA 


TGTCATCTCT 


GGTTTGATTA 


AGGCCAATGT 


17820 


TCCATCTCGT 


GTAGCATTTG 


CGGTTTCATC 


AGGAACAGAC 


TCCCGTACGA 


TTTTGGATGA 


17880 


AAATGGAGCA 


GAAAAACTTC 


TTGGTCGAGG 


AGACATGCTC 


TTTAAACCGA 


TTGATGAAAA 


17940 


TCATCCAGTT 


CGTCTCCAAG 


GCTCCTTTAT 


CTCGGATGAC 


GATGTTGAGC 


GCATTGTGAA 


18000 


CTTCATCAAG 


ACTCAGGCAG 


ATGCAGACTA 


CGATGAGAGT 


TTTGATCCAG 


GTGAGGTTTC 


18060 


TGAAAATGAA 


GGAGAATTTT 


CGGATGGAGA 


TGCTGGTGGT 


GATCCGCTTT 


TTGAAGAAGC 


18120 


TAAGTCTTTG 


GTTATCGAAA 


CACAGAAAGC 


CAGTGCGTCT 


ATGATTCAGC 


GTCGTTTATC 


18180 


AGTTGGATTT 


AACCGTGCGA 


CCCGTCTCAT 


GGAAGAACTG 


GAGATAGCAG 


GTGTCATCGG 


18240 


TCCAGCTGAA 


GGTACCAAAC 


CTCGAAAAGT 


GTTACAACAA 


TAAAAAAATA 


GCTTCTTTCC 


18300 


AAGTTTGGAG 


GGAAGCTATT 


TTAGTGGCTA 


TTGATTGCTT 


TTATTTTCTG 


AAGTTGGCGC 


18360 


ATTGGACTGT 


TTTTCGTTTT 


CAGTAGCAGG 


TTTACTTGAA 


GCAGGAGTAG 


AAGAGTCCTG 


18420 


AGTTGCTGTT 


TTCTGATCTT 


CTTTTTTCTC 


TTCCTTGACG 


CTAGATTTTG 


GTGTTTCCTC 


18480 


TTGCTGTGTT 


TTTTCTTGAC 


TAGTGTTAGT 


CTCTTTAGTT 


GGACTGGTGT 


TTTCCTTAGG 


18540 


GGATTCCTTT 


TGGATTTCTT 


TGACAATGGT 


TGTCGTCTGG 


CTTGTCGTAG 


GTTCTTTTTT 


18600 


AATATTTTTG 


TTATTATCCA 


AGGCGTT 








18627 


(2) INFORMATION FOR SEQ ID NO: 114: 









(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2560 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 114: 

TAAAATACGT TACCTTGCTT CTGCACGTTC AGCAGGTAAG TCATTGAAAT TTAAAGATCA 60 

AGATATTACA ATTGAAGAAA CGACTGAAAC AGCTTTTGAA GGAGTTGATA TTGCTCTCTT 120 

TTCAGCAGGT AGTTCTACAT CAGCTAAGTA TGCACCATAC GCAGTAAAAG CTGGCGTGGT 180 

AGTAGTAGAT AATACATCTT ATTTCCGTCA AAATCCAGAT GTTCCTTTGG TTGTTCCAGA 240 

GGTCAATGCT CATGCACTTG ATGCTCACAA CGGAATCATT GCCTGCCCTA ATTGTTCAAC 300 

AATTCAAATG ATGGTGGCTC TTGAGCCGGT TCGCCAAAAA TGGGGCTTGG ACCGTATCAT 360 

TGTTTCAACT TATCAAGCCG TTTCAGGTGC TGGTATGGGA GCAATTCTTG AGACACAACG 420 

TGAACTTCGT GAAGTCTTGA ATGATGGTGT GAAACCACGT GATTTGCATG CGGAAATCTT 480 

GCCTTCAGGT GGTGACAAGA AACATTATCC TATCGCCTTT AACGCTCTTC CACAAATTGA 540 

TGTTTTCACT GATAATGATT ACACGTACGA AGAGATGAAG ATGACCAAGG AAACTAAGAA 600 

AATTATGGAA GATGATAGCA TTGCAGTATC TGCAACATGT GTGCGTATTC CAGTCTTGTC 660 

AGCTCACTCT GAGTCTGTTT ATATCGAAAC AAAAGAAGTG GCTCCAATCG AAGAAGTAAA 720 

AGCAGCTATC GCAGCCTTCC CAGGTGCTGT TCTTGAAGAT GATGTAGCTC ATCAAATCTA 780 

TCCTCAAGCT ATCAATGCAG TTGGTTCGCG TGATACCTTT GTTGGTCGTA TCCGTAAAGA 840 

CTTGGATGCA GAAAAAGGAA TTCACATGTG GGTTGTTTCA GATAACCTTC TCAAAGGTGC 900 

TGCTTGGAAC TCAGTTCAGA TTGCTGAAAC TCTTCATGAA CGTGGATTGG TTCGTCCAAC 960 

AGCCGAATTG AAATTTGAAT TAAAATAGTC ATATCGTTTA GGAGTTCAGA TGAACTCCTT 1020 

CTTTGAAATA GAGAGGTGTT TTCGTGTCTT ATCAAGATTT AAAAAAATGT AAAATCATTA 1080 

CAGCCTTTAT TACCCCCTTC CATGAGGATG GTTCCATTAA CTTTGATGCT ATTCCAGCCT 1140 

TGATTGAGCA TTTATTGGCC CATCATACGG ATGGAATTCT TCTCGCAGGA ACGACTGCTG 1200 

AGAGTCCAAC TTTGACCCAC GATGAGGAGT TGGAGTTGTT TGCGGCTGTA CAAAAGGTTG 1260 

TCAATGGACG CGTTCCTTTG ATTGCGGGTG TAGGTACTAA TGATACGCGT GACTCTATTG 1320 

AGTTTGTCAA AGAAGTAGCG GAATTTGGTG GTTTCGCAGC TGGGCTTGCT ATTGTTCCTT 1380 

ACTACAACAA ACCTTCTCAA GAAGGGATGT ATCAGCACTT TAAGACTATT GCAGATGCTT 1440 

CTGACCTACC AATTATTATC TATAACATTC CAGGGCGTGT AGTTGTCGAA TTGACTCCAG 1500 

AAACCATGCT TCGCTTGGCT GACCATCCAA ATATTATCGG TGTCAAAGAA TGTACTAGCT 1560 

TGGCTAATAT GGCTTACTTG ATTGAGCACA AGCCTGAAGA GTTCTTGATT TATACAGGTG 1620 

AGGATGGAGA TGCTTTCCAT GCCATGAACC TTGGGGCGGA TGGGGTTATT TCTGTTGCCT 1680 
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CTCATACAAA 


TGGGGATGAA 


ATGCACGAGA 


TGTTTACTGC 


GATTGCAGAA 


AGCGATATGA 


1740 


AGAAAGCCGC 


AGCAATTCAG 


CGTAAATTCA 


TTCCTAAGGT 


TAATGCTCTC 


TTCTCTTATC 


1800 


CAAGTCCTGC 


TCCAGTTAAG 


GCAATTCTTA ACTATATGGG 


ATTTGAAGCT 


GGACCCACTC 


1860 


GTCTACCTCT 


TGTTCCAGCA 


CCAGAAGAAG 


ATGCCAAACG 


CATTATCAAG 


GTTGTCGTAG 


1920 


ATGGCGACTA 


CGAAGCAACT 


AAGGCAACTG 


TAACAGGGGT 


CTTAAGACCA 


GATTACTAAT 


1980 


" AAAGACAATA 


AAATCCGGCT 


CTTTGTCAAC 


TGTAGTGGGT 


TGAAGTCAGC 


TAAGCTCGAG 


2040 


AAAGGACAAA 


TTTTGTCCTT 


TCTTTTTTGA 


TATTCAGAGC 


GATAAAAATP 


v>u J i i i i i \jn 




AGTTTTCAAA 


GTTCCGAAAA 


CCAAAGGCAT TGCGCTTGAT 


AAGTTTGATG 


AGATTATTGG 


2160 


TCGCTTCCAA 


TTTGGCGTTT 


GAATAGGGTA 


GTTGAAGGGT 


GTTGACGATT 


TTCTTTTTGT 


2220 


CCTTTAGAAA 


GGTTTTAAAG 


ACAGTCTGAA AAATAGGATG 


AACCTGCTTC 


AGATTGTCCT 


2280 


CAATGAGTCC 


GAAAAATTTC 


TCCGGTTCCT 


TATTCTGAAA 


GTGAAACAGC 


AAGAGTTGAT 


2340 


AGAGCTGATA 


GTGATGTTTC 


AAGTTTTGTG 


AATAGCTCAA 


AAGCTTGTTT 


AAAATCTCTT 


2400 


TATTGGTTAA 


GTGCATACGA 


AAAGTAGGAC 


GATAAAATCG 


CTTATCACTC 


AGTTTACGGC 


2460 


TATCCTGTTG 


AATGAGTTTC 


CAGTAGCGCT 


TGATAGCCTT 


GTATTCGGGA 


TTTTCGATGA 


2520 


AACTGATTCA 


TGATTTGGAC 


ACGCACACGA 


CTCATAGCAC 






2560 



(2) INFORMATION FOR SEQ ID NO: 115: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11303 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 115: 

TATTGGATTT CCCTTGCAAT CAGTTTATGG GACAAGCACC CGGCAGCGCA GAGGAAATCA 60 

ACGCCTTCTG TAGCCTACAT TTTCAAACCA CCTTCCCACG TTTTGCCAAG ATTAAGGTCA 120 

ACGGTAAGGA AGCAGACCCT CTCTATGTCT GGTTACAAGA CCAGAAATCC GGCCCACTAG 180 

GAAAACGAGT CGAATGGAAT TTCGCTAAGT TTCTCATCGG TCGAGATGGG CAAGTCTTTG 240 

AACGCTTTTC TTCAAAAACA GACCCAAAAC AAATTGAAGA GGCGATACAA ACTCTACTAT 300 

AATTCACAAT CTCACTATGA TTAGGTTTCC TTTAACCTGA TGAATAGTGA GATTTTTTGA 360 

TGGGCTTTGA CTTAAATAGA AAAACACCCC ATGATATGAA ACATGAAGTG TTGTAAAGTC 420 

TATGTTGTAG GTGCTTATTT CACAATTTCA ATGTGACCAG TGATAACGAA TACCATACAG 480 
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AATCTTCATA TACACTAAAC 


AAATGACTTT 


CTAATTATTT 


CAATTAGTTT 


TGGCTAGTAA 


540 


ATATCATTTC CAACAAACGC 


CCTCTCAATT 


CCTTATCCTG 


ATGATGCAAG 


ATATTCATTA 


600 


AGTCATGAGA GTTTTTCGCA 


TTGATGAATT 


GATTTAACAA 


TCTATCTTTT 


AATTCATATG 


660 


GAAGAGAAGC TGTCTTTAGT 


AGTCTAAAAA 


CTTCGTCATT 


TAAAGATGTC 


CTTTTATTAT 


720 


CTTTCCATTC AAATTTAGCT 


GTATCATTCT 


TATTTGGCAA 


TTCAATTATA 


GACACATTCG 


780 


TTCCTTTAAA ATGAATTCTA 


TGTTTTCTAT 


TGCTTGGAAC 


GATACTAGAA 


TCTCCTTGTA 


840 


ATGCTAACTC TACCATTCCC 


ATTTCCCAAT 


CGATTGATAA 


TCTTGTTTTA 


TATCTTTGAC 


900 


CATTTTGATC TTCAAGCATT 


TCAAAAGAAT 


GTTGTTTTCC 


TGGGAATACA 


TACCAATCTA 


960 


CAACTTCAGG TAAATCAACA 


CCCATACCTA 


TCTCAGAACC 


AACCAAGGGA 


ATGATTGCAC 


1020 


CACTTTTTGC AAACACAGGC 


GTAGTCGAGA 


TGTCCCTATA 


AACACTTAAC 


TTCACACCAC 


1080 


CTGTGTATTT TTTCTCTGAA AAGAAGTCAT ACCATTCACC 


TTCAGGGAAC 


CATACATCTA 


1140 


CTTTTGCAGA TTGGAATGTC 


AAATCCATCT 


TTTCTACAAT 


GGGAGCCACC 


ATCAGTTCTG 


1200 


TTCCAAAAAA GTATTGGTTT 


GGAACATTAT 


AGCTCTCATC 


ATTCTCTGGA 


TAGAAATAAT 


1260 


AG ATT GG ACT GATTAATGGG 


GCACCTTCCT 


CATGTGTCTG 


TACATTCATG 


GTATATAGAT 


1320 


AGGGAATCAT CTGATGTCTC 


AAACGAAGGT 


ATTTCTTCAT 


AATCTTAGAT 


GTTGTTTCTG 


1380 


AAAAAAACCA AGGTTCTTTA 


CTATTAAAAG 


GACTTCTAGA 


ACTATGTAAT 


CGAGTAATCG 


1440 


GACTAAAAAC ACCAAACTGT 


AGCCATCTAG 


TTTGTAGCTC 


TTCGTCATAA 


TCCCCCAACA 


1500 


TATGTCCACC GATATCATGA 


CTCCACCAAC 


TATAACCGAT 


ATTAGATGCT 


GTCGCTGTAA 


1560 


AATAGGGTTG AAATCTTAAG 


GAATTCCAAC 


TAATAATAGT 


ATCCCCTGAA 


AAACCAACAG 


1620 


GGTAGCGGTG ACTACCAGGA 


CCTGCATATC 


TTGATAAAAT 


CAAACCACCT 


TCTGCATTTT 


1680 


TACAACTATC CTGATAGTGA 


TAATGGTTTA 


AAAGCCAAAG 


TGGATCTAGC 


ATACCTTGTG 


1740 


TCCCTTGTTG CCAGTCAATC 


CACCAAAAAT 


CTACTCCCTG 


CTTTTCTAGT 


TCATAATGAA 


1800 


CATCTTTAAA GTAGGCTTCC 


CTAAAAGAGG 


GATTAAAAAA 


ATCAAAAATA 


GCAGGTTCTT 


1860 


CTAGTTCTAC ATTTAACCCC 


AACCGTTTTG 


CGATTTGAGG 


ATAAGCTTCT 


TCATAAGCCC 


1920 


GTATCCCATC AGCAGGATGG 


ACATTTAAGG 


AGAGTTTTAG 


CTTTCTATCA 


TGAAGTTGTT 


1980 


GCAATAACTG TTCTGGATTT 


GGTATTAAGT 


TTCTATTCCA 


ACTATATCCT 


GTCCAGCCAC 


2040 


TTCCAAAGCG AGCTGGAATG 


TCAGTTATAT 


GCCAATCCAT 


ATCTAACACA 


CCGATAGATA 


2100 


ATGGAATTTT CTCTGTTTCA 


AATCTGTCTA 


TTAAATCCAA 


GTATTCATCC 


GACGTATAAG 


2160 


GCCAATATCT ACTCCACCAA 


TTGCCTAAAG 


CATATCTTGG 


CAACAAGGGT 


GTTGAACCAG 


2220 


TCAAATGGTA AAAATCTCTG 


ATTGCTCCTC 


TATAATCATG 


CCCATAGGCA 


AAGAAATACA 


2280 
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GGTCAATTTG ATTTTCTCTC TCAATATAAC CAGATTGTTC ATCCCAAATA AATCCTTGAG 2340 

AATCATCCAA TAAGGCTATA CCATTTCGGC TAATAATTCC ATCTTCTAAC GAGATTGCTC 2400 

CATCTGCCTT ATCCAGAGTC CGAGCTGTTC CTTTTAACGT TTCAATAGAT TCACCAAAAT 2460 

ACCAGCGACT ACCATATACG GCAAAATTTC CTTTTAATTC TATAAATAAA TTTTCGGCGT 2520 

TAAATTCTCC TTTATTAAAG TGCAGATGAA AATAGTCCGT CATAATATCT AGTACGTTTG 2580 

ATGTCTCGAT ATAATCTAAC GAAATTTGGC CAAAATCTCT ATTATAGATA AGTTGTGTCG 2640 

TTCTATCCTC AAAACTTCCA GTTTGAGAGT ATTCTAACCT TACTAGCTTG TCTGTTAATA 2700 

CAGAGATTCG ATAAAACTCT CCCTTAAAAA TTTTCAATTT GTTTTCCTCC TTTTATGGTA 2760 

GCATAAAAAC AGAACGCACC ATTTTTGATG CGTTTTTCAT TATTCTGAAT GCAATGTTCT 2820 

ATCTGTTATA TCTATGACAA ATAATAGTCA ATTGAAAAAA TGCAGTGGAC AAAATATCTT 2880 

TTAAGAAACC AAGAGTTTAT TAAAGAGTTA TCACTTTTCA ACTTTTCTAA GCTTATGCAG 2940 

TTGTGAAACA AACTACTTTT AAACTATTAA CTAAGATAGG ATTGATAAAT AATTTCAAAC 3000 

TCTTACTAGC AATCATACGA TATTCAAGCT CACGTGCTTT TTTCCTTCCT GCTTATTTCT 3060 

TAGAACTGAA GAACCCGGAT CGGTATATAA ATTATCCGGA TCAACATAGT CATAAGATTC 3120 

ATAACAGTTG CGCTTCATTA AGTCATCCCC AGAGCAAGAG CTTCATCTCG TAATTTTTCA 3180 

ACATCACTAA CCGTAGGTCG CCATCCTTCA ATCATATTTG TACTTAAAGC ATACCAAACA 3240 

CTCTTAAAAA CGGATCGGTT TTCAAAAGCT ATTCCCATGA TTGTCATCTT TTCTTTATCT 3300 

ATATCTAAGG ACATATGCTA CCTCCTTTAG ATACATTATA CCATGTTTCT CTGTAGCTTT 3360 

TAAAAATTTT ATTTTGTTTG TCATATCTAA GTTTTCAGCA CGCTTATCCT ATTTTATAAG 3420 

CCTCAAACCC AAATATAAAA CGCATTCTTT TTGCTTTTTT ACTATTGTAT CGTATTCTAC 3480 

GATAACATAC TTTACTTTAT TGTTTTTTTA AATAACAGCA GTTCCCTGTT TATCAACTAT 3540 

TCGAACTACT TTCTATTTTG CTTCATACCC TACATAGCGA AAAAATATGA AAAAGCAGAG 3600 

AAGAATATCT TAAAAAGACC TCTTCACTGC TAATATTAAC ACTCATTATT TAAACTATAT 3660 

GGATTCTATC ATCGAGTATA CTTTTTTACT TATTAGATAC CTTGCTCTTC TTTCACCAAT 3720 

TTTTGATCAT ATACACGGAT GAATGGAAGA TAGACTAGGA ATGCTGCAAA TGCACATACT 3780 

AGAGCAACTA ATACAGCTCG AAGATCTGCT GTCCCTAAGA AAGCTCCAAT CCCTACTGGA 3840 

GTTGGCCATG GAACCTGTGC GATAATTGGC TTAATAAAGT TTAGAGAATT CGCTACGTAA 3900 

TAAATAGTAG CAGTAACCAT TGGTGCTAAA ATAAATGGTA TAGCCAAGGC TGGATTATAG 3960 

ATAATAGGTA ATCCAAAAAT TAATGGTTCA TTAATATTAA ATAAGGCTGG AACTACAGAT 4020 
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GCTCGTCCTA 


TTGCTTTAAG 


CTGTTCAGAT 


TTAGAGGCAA 


AAGCAATATA 


TAAACATAGT 


4080 


CCTAAAGTTG 


CACCAGAACC 


ACCTGCAATT 


ACAAACATAT 


TAGAAAATTC 


ACCTGCAACA 


4140 


GCGAAGTGCC 


CGCCAGCAGC 


ATTTTCAGCC 


ATGTTAGCAA 


GAGCAATTGG 


ACTAACAAAT 


4200 


GCAAAAACAA 


TGTTCGCACC 


GTGGATACCT 


ACAATCCAAA 


GTAGTTGAGT 


CAATAGATAA 


4260 


ATAATCATTA 


AACCAATCCA 


CGAATTAGTC 


AGATTGGATA 


CAAAACCAAA 


TGGAATTGCA 


4320 


ATGACTTTAA 


AAATATCTGT 


TCCCATTGCT 


ACAAGAAGAC 


CGTTGATAAA 


GATAACAACA 


4380 


AATGCAACAA 


CAAATCCCGG 


AACCAAAGCG 


GTAAATCCAC 


GAGAAACTCC 


TTCTGGAACA 


4440 


GCTTCAGGCA 


TTTTAATAAC 


CCAATTATGT 


TTAACACACA 


TACGATAAAT 


AAGAACAGTC 


4500 


ACAATTGCCA 


TAATGATTGC 


GGTAAAAATC 


CCTGTTGTCC 


CAAAACGTGC 


GACTACATTT 


4560 


CCCATTGCCC 


ATCCATCTGC 


AATTACTGCA 


CCTTCTTTTA 


GACTTGTCAC 


AGTCTTCATC 


4620 


ATTCCACCAT 


CAAAAATGAT 


TTGCGGTACT 


GTCATGACAA 


AAGCCATCAA 


GGCAAGCAAG 


4680 


GCACCATTAA 


GAGGATTCAT 


ATTGAGTTCT 


TCTTCCTCTG 


CATAAATTTT 


TGTCAATTCA 


4740 


TATGCAAGTG 


ATAGAACGAA 


ATAAAGAGAT 


AGAGAACCCA 


TAGTCGCATA 


GTTTGCAACC 


4800 


ATGTAAAGTG 


ATGTGAATTT 


ATCAAATGAA 


GCAGAGAAAA 


TATCTGCCAC 


AATTGGCCAA 


4860 


AATGAGAAAG 


CTTGTGGCAA 


AATACTGAAT 


ACCAAAAACA 


TTGATCCTAC 


AATAGTAAAT 


4920 


GGTACAGCAG 


CCATACCTGC 


AGCCGTGATA 


GCACGTACTA 


CTTTAAACTG 


AGCAAGTTTG 


4980 


CCCATTGGTC 


CCATAACATG 


GTTTTCAAGA 


AAACCAAACA 


ACCCGTTTTG 


TTGATCCATA 


5040 


AATAGACCTC 


CTTAATAAAA 


CATAATAATT 


TTTACTTTCT 


AAAGACTAGT 


TTCAAATACA 


5100 


AATTATACTA 


GATCAGGATT 


ATAAACTAAG 


TGAGTTCTTT 


TCCAATTGGA 


CAAATTGTTG 


5160 


ATAAGCCTTA 


TCTGTTCGTT 


TATAAATTTT 


TTTAATTCTT 


CTAATGTCTA 


ACAAACTCAG 


5220 


AACTAAACCT 


AATAGAAGAA 


CTACAAAAAC 


AAATAAACGT 


GCTACTTGGT 


TATTTTCAAA 


5280 


AATCGGAAAA AGATTCTTAA 


ACCAACTTGT 


CCAAGTTAAA 


ACAAGTAATC 


CTATTGAAAT 


5340 


AAGCATTTGT 


ATTCTAACAA 


ACATTAGTGT 


TATTCCCAAC 


TTTTCTTTCC 


TATTTCCATA 


5400 


AAGTTTAAAT 


TGTTCAACAG 


TTGCTAAAAT 


AGAAAATACT 


ATGAGCATAA 


TGGGGAAAAT 


5460 


AATAATAGGC 


GAGGGACTAA 


TAAACTGACT 


CAAAAGCCAA 


TAAATATTCC 


CAAAAAAGAA 


5520 


GAGTGCTATT 


GAATAACGTA 


GAAGAAGATA 


TCGATTGAAA 


AAAGTATTAG 


TTAGAGCCAT 


5580 


CTCTCGACGT 


TGTTGTTCAA 


TCTTTTGTCG 


TTCTTTTTTA 


TCCATATCAT 


TTCCTCCTTA 


5640 


TATAACAACA 


CATATTTAGT 


TAACTTTCTT 


ATAAAGAGCT 


AACATTTCCT 


TTGCTACTTC 


5700 


TAATAATGTC 


ATAGTGGTCA 


TTAAATGATC 


TTGAGCATGT 


ACCATGATAA 


TTTCAATTTT 


5760 


AATTTCCACT 


CCACTTGCGT 


ATTCTTGCAA 


GAGTTTGGTT 


TGTGCATGAT 


GCGCTTCAAG 


5820 
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AATTATCTCA TTTGATTGAT TTAATTTACT TTCTGCATCA TCAAAACTAC CTTCTCTCAT 5880 

TTTTGCAAAT GCTTCATGTA TTTCTGACCT TGCATTTCCC GAATGCAGGA TAATTTCAAA 5940 

TGCTGCAACC TGCAGTTCCT CTTGATTCAT ATAAACCTCC TATTTTATCT TCTCAAATAT 6000 

GTTAATAAAA TCTTCAAAGT TATTGCAAGA TATTAGCTGA TTTTGCAATT CATCATTCTC 6060 

TGTCAGAGAG ACTATCTTTT TAGTCACAGT TGCCAAACCT TCGTTCCCAT ATATTGATGG 6120 

AGATAGAAGA AATACTAGCT GGACATGTGA ACTTTGATTA TCCCAGAGTA ACGAATCTTT 6180 

ACAAATTGCA ACCGAAACCT TTCCCTCTGT ACCAAAGGGC TGAATAGGAT GCGGAACTGC 6240 

AATTTTTTCA GAAAAAACAA CTGAACTTAA TTCTTCGCGC TGTTTAATTC CATAAAGTAA 6300 

AGATTGTTCA AACTCATTTG ATTCACCAAC AGATAAACTC TCAACCATCT TTTCAAGTAA 6360 

ATTTACCTTG TCTGATTCAG TACATATTAA AAAGTTTTCT TTACTAAAAT ACTGTCTAAA 6420 

GCCGTTGTTT TCAAATTTGT TAATCTTTGA TGATTGTACA TAACTAGAAA CTTGCATCTA 6480 

ATCCATAGCT TTTCTAATCA TTTCCATCTC ATCACTCTTA AGAAACACAC TAACTTTAAA 6540 

AACTGGGATT TGAAAATATA GATTTGATAA ATCAATAGCT GACACTATAA AATCTATTCC 6600 

TTTAAGTTTT TCTTGATTCA ATTCATAGTA GCCTATTACA TCAACAACTT CTACTCGCTT 6660 

CCCAAACTCC GTTTCCAAAC GATTTCTTAA CATTTGGGCT GCACCAAATC CTGTTGCACA 6720 

AATAGCAAGA ATATTAAACT TAGTACTCTC TTTGCTACGT TCCATAGCAG CTAAAAAGTG 6780 

AAGACTTACA TATGCTACTT CATCATCTGA TATTGTCCAC TCCAAGAACT TGTCCATATT 6840 

TGCAAGAATT TCTCTAGTCA TAAAGAATAT ATCACTATAA TTCTGTTTAA TTTCATCTAC 6900 

CAAAGGGTTA TTTAAGGTAA TCCGGCTTTC TAAACGTACT TGTAGTGTCA TTAGATGAGT 6960 

TATCAATCCT TCAATTAGTT GGAAATCTGA AGAAAAGTTA TACATATCAT CTAATCCTAA 7020 

ATTCTGAAAT GTTTTAAATA AAGATTTTTT TAAAACTTCT TCAGAAATAT TCTTCTGATT 7080 

TTTTTGACAT TGTTGACTCT TAGCTAACAA ATGCAAAGTA ATGTAGTCTA TTTCCTGAAC 7140 

TGGAAATTCC TGATTTGTTA CTTCTCTTAC TTTAGAAAGA ATTCTTTGGG CAACCTTTCT 7200 

CTCTATTGCA TCATCAGTCA TCTGACAGTC TATATTTTTT ATTTCAAATC CGGATTTTAA 7260 

ACGAATCACA GACAATGCTA TGTGAACTAC TAAATTCTGT AGTACAAAAT CAGATAGTTT 7320 

TAGGTTGGCC TCTTGGCATT CATCCAAAAC AATTCTAGCA AATTCTTCTA ATGGAACAGT 7380 

TTGATCAAAA AAGTTAAATT TTACATAGCA ATGTATTGTT TTAAAAAATT GATTCTCTAG 7440 

GAAATAATTT ATGATAAAAC GTCGTTTATC ACGTTCCTCG CCTGAGACAT AAACTCCTTT 7500 

ATTCGCCCTA CTCTCAATGG ACAAATTATA CTCTGATAAC ATCACTCGTA TCTTTCTGAA 7560 
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ATCATGAGAT AATGTTGAAC GACTAACGTA AAGTTCATCA GCTAAATCAT CAAAAAGAAC 7620 

TGGAACTTGC TCAAATAATA ATTTATTTAA GATAAATACT AAACGATCAT CACCTTTTGA 7680 

AACCGCAGTT TTCGTATAGT CTTCTTCCAG TTCATAAGTT TGTCTAAACT CCTGGTAAGC 7740 

GCCTTGATTC TCAAAAAATA TTTGATACCC TTGACCTTGT TTTGAAATCA ACCGGACTCC 7800 

TTGAATAATC ATTGTCTTCT CAATTAATTT CAGTACATTA CGGACAGTTC TATCTGAACA 7860 

GGATAAATAT TCTGCCAGTT CTTTGCTTGT AACAAAACGT TCCTTATTTT TTATTAAAAA 7920 

TTGAAGGATA TCTTTCTCTT TAATGTTTAA CACATTCATT CCCTCCTAAA ACGTATGTTT 7980 

TCATATATTG AAGCATATTA T AC ACT T AAA TCAGTTTATA TCAAACTCAA AACAATTTAT 8040 

CTTAACCTAA ATATTTATTG ACATTTCATG TGTTCATCAA ATATTCTCAA GAATCAAATT 8100 

AGCCATTTTT TCAATTCCCA TTGGAATAGG AATATAGGCT TGAGGAGGTA TTTGTACAAC 8160 

TGGTTTTCCT GCTTTAGAAC CAGCCTCTTC AAATTGCTTA AAGTACATTT TTGTTTGAGG 8220 

ACTGACAAGA TACAAATCAA AAGCTGCTGC TGCGATAGCT TTCCCTCCTT CAGTAGCACT 8280 

AATAGCATCA ACTACAATAT CTTTCCCTTT TCCTTTTAGA AACTCTGTTG TTTTCTGTGC 8340 

CATAAGTGAT GAAGACATTC CTGCTGCACA AATAATTAAA GCTTTTGCCA TAATATTTTC 8400 

TCCTTTTCTT AAATCCAATC AAAGCTGTGC TAAGTTGGCT TATTTGTTAT CTATTTTTAT 8460 

TATAAAATAA AGCGTTTCCA ATGACAATTC CCTCATTTTC CTAAATGATA TGGAAAAAAA 8520 

TTATTTATAC TTCAATTTAT AAAATAAAAT TATTCCTGAG AGTAGAAATG AAACACTATT 8580 

TGCTAAAATC AAAGGCAAGT CTCCTATACG AATACCATGA GCAAGCCACA ATGCAATACC 8640 

AATAACTTGC ATAACATACA TACCTAGAGC AATAGATCCT GTGTCCTTTG TCTTAACTAC 8700 

ACGAAAAACT TGTGGTAAAA ATGCAAATGT TGTTAAAATT GCTGCAATAC TTCCAATCAT 8760 

ATGTCACCTC AATATGCTAA ACAAACTGAG AATAATCTCA GTTTGTTTAT ACTATTCTAC 8820 

TGATTCACCG TTAGATGAAA TAACTTCCTT ATACCAGCCA AAAGATTTTT TCGGGGAACG 8880 

ATTATAACTT CCCTTCCCAT TATCATCTTT ATCTACATAA ATAAAGCCAT AACGTTTCCG 8940 

CATTTCACCG GTACCAGCTG AAACCAAATC AATACATCCC CATGGAGTAT AACCCATTAA 9000 

ATCAACACCA TCTTCAACTA CAGCCTTTTT CATTTCACGA ATATGGGCAC CTAGATATTC 9060 

AATTCTATAA TCATCATGTA CCATACCATC TGCTGCAACT TGATCTATAG CTCCAAAACC 9120 

ATTTTCAACA ATAAAGAGTG GTAAGTGATA GTGGTCTGTA AACCAATTTA ACGCATAACG 9180 

CAAACCTTCT GGATCAATTT GCCACTCCCA TTCAGAAGCC TTAACATAAT TATTTTTCAC 9240 

TAAATCTTCT GTTTCAAGAT AATCAAAATA AGGATTATTT TCACGATGAG AGTCGATAGC 9300 

AAAGGACATA TAGTAACTGA AACCAATGTA ATCTACAGTC CCACCAAGTA AATCTTCTTT 9360 
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ATCCTGGGCA GTAAAATCAA CTGAAATACC TTTTCGTTCC CAATACTTGA AAATATGCTC 9420 

AGGATATTTA CCTAAAACAT GCACATCAGC AAAATAATAA CGCTTCTGCA TAGCTTTCAT 9480 

TGCCATTAAG ATATCCTTAG GATTGCAAGT AACTGGATAA ATTGGACACA TCGCAATCAT 9540 

ACAACCTATT TGAAAATCTG GATTAATCTC ATGACCAATT TTTACAGCTC GTGCAGAAGC 9600 

AACTAATTCG TAATGTGCTG CTTGATACAT AATTGCTTCT CTATTATCAC CTTCCTCATA 9660 

TACAATACCT GAGTTAGTAA ATGGTGCAAA ATCTTCCTGA TAATTCGCTT GATTATTGAT 9720 

TTCATTGAAA GTCATCCAAT ATTTAACCTT ATCTTTGTAA CGTTTAAATA CGACTTCTGC 9780 

AAAACGAGCA AAGAAATCAA TCAATTTCCT ATTTTTCCAA CCACCATATT CGGTCACTAA 9840 

GTGATAAGGC ATTTCAAAAT GAGATAGAGT GATGACAGGT TCAATACCAT TCTTTAAGCA 9900 

TTCATCAAAA AGATTATCAT AAAACTGTAA TCCTTCTTCA TTCGGCTCTA ACTCATCACC 9960 

TTTTGGAAAG ATACGTGTCC ATGCAATAGA GGTACGGAAG CACTTGAATC CCATTTCAGC 10020 

AAAAAGTGCT ATATCTTCTT TATAACGGTG ATAAAAATCT ATCGCCTCAT GATTTGGATA 10080 

ATATTTACCC TCTAAAACTC CCAAAGTAAT TTCACGAGCT ACTCCATGAC GACCAGCAGT 10140 

CATAACATCA GCAACACTAA TTCCCTTGCC ACCTTCTTGC CATCCACCTT CAAGTTGATG 10200 

AGCAGCAACA GCACCACCCC ATAAAAATCC ATCTTTAAAA GTAGTCATCT TTTTTCCTCC 10260 

TGACTTTGAT ACTCTTATTA TAAACCTTAA ACCAAAAGAT GAAAACGCAT TCTTTTTCCT 10320 

TATTGTTAAG GAAAGAAGTA ATTTTTAATG GAAATAGAAC AATATCTTCT TGTATTCTCG 10380 

TAATGATATC TTTACGATTT TCAATACTTT CAAACTACAA AAACTCTCAC AATAATTCTA 10440 

ATTCCCTGTG TCTATAAACG ACTTATCGCT TTCTGGCATC CCAGAATCAT CTTCTATATA 10500 

ACGTTCAACT TGCATCTGCA AGTGATATTT TTTTCTTAAA TCTAAGATTT TCTGCATTGT 10560 

CTTTGATTGA TAATGTTTAT CTAAAGTTTC TTGATTTATC CACTGATCAA TAAGGAGAAT 10620 

AGTTCCCTCT TTTTCAATTG GTAAAAAATA TTCGTATTTC AAGTTACCTT TTTGATTTCT 10680 

AATTTCTTTA ACAAGGCCAC TATCAAGCAT TTCTCTTGCA AACTTTATTG CACTATCTCC 10740 

ATCACCTTTA TAATATACAT GAATAGTCAA TGTCATCTTA TATCCTCCAA AATCATCCTT 10800 

CAATTTTAAA AAAACAAGTT TAGATGAGGA TCTAAACTTG TTTTTTATGA ACTAATTATC 10860 

TAACGTTTCG CCATTACTTT CAATCACTTC TTTATACCAA TAAAATGATT TTTTCTTATA 10920 

GCGATTTATA GTCAATTGAA ACAAGAGCAG GACAAAAGAG CCTCATAAAA GGTATTGCAA 10980 

CTTGGTAATA CCTTTTTGAG GTGCTTTTTG ATATGAGCCC ATGTTTTCTC AATAGGATTG 11040 

TACTCAGGTG AGTAGGGAGG AAGAGGTAAA AGTTTATACC CAAACTCTTC ACACAAGAGT 11100 
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TCTAGCTTCC CCATTCTATG GAATCTTGCA TTATCCATAA TAATAACCGA TGGTGTGGTT 11160 

AATGTTGGTA AGAGAAACTT CTGAAACCAA GCTTCAAAAA AGTCGCTCGT CATCGTCTCT 11220 

TCGTAAGTCA TTGGAGCGAT TAACTCACCA TTTGTTAGAC CTGCAACCAA AGAAATCCTC 11280 

TGATATCTTC TTCCAGATAC TTT 11303 



(2) INFORMATION FOR SEQ ID NO: 116: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3112 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 116: 

CCTTAGATTT CCACTTGCCA GAGGAATTGA TTGCCCAAAC GCCCCTTGAA AAACGTGATG 60 

CCTCCAAACT CCTCATCGTC AACCGTGAGA CAGGAGAAAT GCAAGATAAA CATTTCCACT 120 

CTATTATTGA TATGCTGGAA CCTGGTGATG CCCTTGTCAT GAACGACACC CGAGTTCTCC 180 

CTGCCCGCCT CTATGGTCAA AAAGTGGAGA CAGGAGGTCA TGTGGAACTT CTCCTCCTTA 240 

AGAACACTAG TGGAGACGAG TGGGAAGTTC TGGCTAAACC TGCCAAACGC CTCAAGGTCG 300 

GTACTCGTAT CAGCTTTGGT GATGGCCGCC TCAGCGCTGT CGTTACAGAA GAATTGACCC 360 

ACGGGGGACG CATTGTCCGC TTTGAATACC AAGGAATTTT CCTAGAAGTC TTGGAAAGTC 420 

TGGGAGAAAT GCCTCTGCCA CCTTATATCC ACGAAAAATT AGATGACCGT GAACGTTATC 480 

AAACCGTCTA CGCCAAGGAA AGTGGCTCTG CTGCAGCACC GACTGCTGGT CTTCACTTCA 540 

CCAAAGAACT GCTGGCAGAA ATCCAAGCTA AGGGTGTTCA TCTAGTCTAT CTGACTCTCC 600 

ATGTCGGACT CGGAACCTTT AGACCTGTTT CTGTGGATAA TCTGGACGAA CACGAAATGC 660 

ACTCAGAGTT CTATCAACTT TCTGAGGAAG CTGCTGCCAC CCTTCGCTCT GTCAAAAAAA 720 

ATGGTGGTCG TGTCATCGCT GTCGGAACCA CTTCTATCCG CACCTTGGAA ACTATTGGTT 780 

CCAAGTTTGA TGGGCAAATC CAAGCAGATT CTGGTTGGAC CAATATCTTT ATCAAACCTG 840 

GGTATGAGTG GAAGGTCGTG GATGCCTTCT CAACCAACTT CCACCTGCCA AAATCAACTC 900 

TGGTCATGTT GGTTTCTGCC TTTGCAGGCC GTGAATTAGT CTTAGATGCC TACCACCATT 960 

CCATCCAAGA ACACTACCGC TTCTTCAGTT TTGGTGACGC CATGTTTATT TATTGAGAAA 1020 

GAATTTCTCT AAATCTTCTA ATACCAATAA ATCGCTAAGA TATTATTTCA AAGAACATCT 1080 

ACAATTGAAA CTCTAGCTAG CTGTAGAAGA GGCCTAGTAC ATTGAAATTA AAATGCTTCC 1140 

CCCTAGCTTC GAAAATATTG CCATAGATTG CGTTGACTCT CCAAATTGAT TCATCTATAT 1200 
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TTTATTTCAG 


CTTCCTATAC 


TTTCTTCGCT 


GTTTGTAAAT 


CAAAATGCAA 


GACACATGAG 


1260 


TAGCACCATA TTTGTTACTC 


TTATCTGTCC 


TCTCAAGAGA 


CTATTATGAG 


TTATTTCAGA 


1320 


ATCATTCACT 


ACTTTGACCC 


TGACTCTCCT 


TAGTCTCAAA 


ATCAAAGACT 


TATACTCTTC 


1380 


AAAAATCTCT 


TCAAACCGCG 


TCAACGTCAC 


CTTGGATTAT 


ATATGTGatC 


TGaCTTCGTC 


1440 


AGTTCTATCT 


ACAACCTCAA 


AGCAGTACTT 


TGAGCAACCT 


GCGACTAGTT 


TTCTAGTTTG 


1500 


CTCTTTGATT 


TTCATTGAGT 


ATTAAACAAA 


AAGTGAACAA 


ATCTGAATTC 


TAATGTACAG 


1560 


AAGACTAGGC 


TTGTTCACTT 


TTTTATAGTC 


GCTATAAGAT 


GACCTTATCT 


ATAGCTTTTT 


1620 


ATATATAATT 


ATATATTCAG 


ACATACTATT 


ATCAATTTTG 


TCGCAGGGAG 


GAATCTGTTA 


1680 


ACGCACCCAT TCACCATTAT 


CATTGACTCT 


ATAGCCATCT 


ATACTTGTAT 


TGACCGCTAA 


1740 


CTCACCCGAT 


GTATTTACAT 


AATACCATTT 


ACCACCAACT 


TGGAACCATT 


GATTGACTTT 


1800 


CATAGAACCG 


TTGCTGTTGA 


GGTAGTACCA 


TGAACTATTA 


ACTTGTACCC 


AACCTGTTGC 


1860 


CATGGAACCA 


TCAGTATTAT 


AAAAATACCA 


CATACCATTT 


TCTTGTTTCC 


AGTCTGTTGT 


1920 


TGGAGCAACT 


GCTTTAGCTG 


GTTCTACTGC 


TACATCTGTT 


CCTTGGTTAG 


ATGTAACAGA 


1980 


TACAGGATAC 


GAAGGAATAG 


ATGATTGCTC 


AGGAACAACA 


ACTTTTTCAG 


GTTCTCTCGT 


2040 


CCCTCTCCTT 


ATACGTCTTT 


TTACCATCTC 


TTTAGTAATT 


TGACGAGAAG 


TAGTTTCTTC 


2100 


AATTGTTCCA 


TCACGTTCAT 


CTACAGTATA 


GATTGTAGTA 


AGAGTAATTT 


ACCAATTTCT 


2160 


CCTACTTCTT 


CTACTTCTTG 


ACTTTTATCA 


AGAGTTGGGC 


CATCGAGATA 


TTCTGTTTCG 


2220 


ATTGGAATTT 


CTTGGACAAG 


AACTTGGGGC 


TTGGTTCTTT 


TTTTAACAAC 


TCTTGTTTGA 


2280 


GAGTCTTTTT 


TTTGACTTAA 


AGTACTCTCA 


GTTACTTGTC 


CACTCTTTCC 


ATCTACATTA 


2340 


TAAGTTATCG 


TTGTAACTGT 


TTTCCCATTC 


TTTCCTAGAG 


TAATCTCTTG 


CTCCTGTCCT 


2400 


GCAGAAAGGT 


CATTGTCTGC 


TTCATATTTA 


GTAGCAAATG 


GAACAAGAAC 


TTCTTCAACC 


2460 


TTGCTTTTAG 


CTGGAACTTT 


GATAACTGTA 


TCCGTGGCTT 


CTTTTCTATC 


AACAGTAACC 


2520 


TGTTCGGTAA 


CATAACCAGT 


CTCTGGATTA 


ACATCGTAGG 


TCCTTGTCGT 


AGTTACATAG 


2580 


CCATCCTCTC CATCAATTGT 


AACAGGATTT 


TCACTACGGT 


CTTTTGTTTC 


ATCTTTTTCA 


2640 


TAACGAATTC 


GCGTACTTGA 


AATTTTCTTG 


GTTACTACCT 


TAGGTTTAGT 


CGCTACTTTT 


2700 


ACAATAATAT 


CCCCATTGTC 


AGCGTCATCA 


TACTCTATTC 


CCTCTTCTTT 


ATCTCTAGTA 


2760 


TCATCTCTGA CATATTGAAT 


CCCATCAGCA 


GCATGAACAA 


AACTTGTATT 


CAGATTCCTC 


2820 


CTAAAAATAA AGTTAGCCCG 


ATTACCGCAG 


AACCAAAAAT 


CTTTCCGAGT 


TTACGTATTG 


2880 


CATAGCGCTT ATTAGTATTA GATTTTGCCA TTACATCCTA CTTCTAGTAT AGCATCTTTT 


2940 



i 
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CTATCAAACG TTAAACAATA TACGTTATAT ATAAAATAGA CTTAGAATGA TATATTGATT 3000 

ATTGAACTAA CACTTTAACT ATATCGTAAT CAATCTCATA TATAAAGGAT TGCAGACATC 3060 

TTATCTAAAT ACATGCGAAT ATATTTAGAT ACAAACATTC CAACTTGATA AT 3112 



(2) INFORMATION FOR SEQ ID NO: 117: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4327 base pairs 
(B> TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 117: 

CCCAAAAATC TCTTCAAACC ACGTCAGCTT CGCCTTGCCG TAGTATGGTT ACTGACTTCG 60 

TCAGTTCTAT CCACAACCTC AAAACAGTGT TTTGAGCATC ATGCgGCTAG CTTCTTAGTT 120 

TGCTCTTTGA TTTTCATTGA GTATAAAAAC AGATGAGTTT CTGTTTTCTT TTTATGGACT 180 

ATAAATGTTC AGCTGAAACT ACTTTCAAGG ACATTATTAT ATAAAAGAAT TTTTTGAAAC 240 

TAAAATCTAC TATATTACAC TATATTGAAA GCGTTTTAAA AATGAGGTAT AATAAATTTA ' 300 

CTAACGCTTA TAAAAAGTGA TAGAATCTAT TTTTATGTAT ATTTAAAGAT AGATTGCTGT 360 

AAAAATAGTA GTAGCTATGC GAAATAACAG ATAGAGAGAA GGGATTGAAG CTTAGAAAAG 420 

GGGAATAATA TGATATTTAA GGCATTCAAG ACAAAAAAGC AGAGAAAAAG ACAAGTTGAA 480 

CTACTTTTGA CAGTTTTTTT CGACAGTTTT CTGATTGATT TATTTCTTCA CTTATTTGGG 540 

ATTGTCCCCT TTAAGCTGGA TAAGATTCTG ATTGTGAGCT TGATTATATT TCCCATTATT 600 

TCTACAAGTA TTTATGCTTA TGAAAAGCTA TTTGAAAAAG TGTTCGATAA GGATTGAGCA 660 

GGAAGTATGG TGTAAATAGC ATAGGCTGAT GTC CATC ATT TGCTTATAAA GAGATATTTT 720 

AGTTTAATTG CAGCGGTGTC CTGGTAGATA AACTAGATTG GCAGGAGTCT GATTGGAGAA 780 

AGGAGAGGGG AAAATTGGCA CCAATTTGAG ATAGTTTGTT TAGTTCATTT TTGTCATTTA 840 

AATGAACTGT AGTAAAAGAA AGTTAATAAA AGACAAACTA AGTGCATTTT CTGGAGTAAA 900 

TGTCTTATTT CAGAAATCGG GATATAGATA TAGAGAGGAT CAGTATGAAT CGGAGTGTTC 960 

AAGAACGTAA GTGTCGTTAT AGCATTAGGA AACTATCGGT AGGAGCGGTT TCTATGATTG 1020 

TAGGAGCAGT GGTATTTGGA ACGTCTCCTG TTTTAGCTCA AGAAGGGGCA AGTGAGCAAC 1080 

CTCTGGCAAA TGAAACTCAA CTTTCGGGGG AGAGCTCAAC CCTAACTGAT ACAGAAAAGA 1140 

GCCAGCCTTC TTCAGAGACT GAACTTTCTG GCAATAAGCA AGAACAAGAA AGGAAAGATA 1200 

AGCAAGAAGA AAAAATTCCA AGAGATTACT ATGCACGAGA TTTGGAAAAT GTCGAAACAG 1260 
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TGATAGAAAA 


AGAAGATGTT 


GAAACCAATG 


CTTCAAATGG 


TCAGAGAGTT 


GATTTATCAA 


1320 


GTGAACTAGA 


TAAACTAAAG 


AAACTTGAAA 


ACGCAACAGT 


TCACATGGAG 


TTTAAGCCAG 


1380 


ATGCCAAGGC 


CCCAGCATTC 


TATAATCTCT 


TTTCTGTGTC 


AAGTGCTACT 


AAAAAAGATC 


1440 


AGTACTTCAC 


TATGGCAGTT 


TACAATAATA 


CTGCTACTCT 


AGAGGGGCGT 


GGTTCGGATG 


1500 


GGAAACAGTT 


TTACAATAAT 


TACAACGATG 


CACCCTTAAA 


AGTTAAACCA 


GGTCAGTGGA 


1560 


ATTCTGTGAC 


TTTCACAGTT 


GAAAAACCGA 


CAGCAGAACT 


ACCTAAAGGC 


CGAGTGCGCC 


1620 


TCTACGTAAA 


CGGGGTATTA 


TCTCGAACAA 


GTCTGAGATC 


TGGCAATTTC 


ATTAAAGATA 


1680 


TGCCAGATGT 


AACGCATGTG 


CAAATCGGAG 


CAACCAAGCG 


TGCCAACAAT 


ACGGTTTGGG 


1740 


GGTCAAATCT 


ACAGATTCGG 


AATCTCACTG 


TGTATAATCG 


TGCTTTAACA 


CCAGAAGAGG 


1800 


TACAAAAACG 


TAGTCAACTT 


TTTAAACGCT 


CAGATTTAGA 


AAAAAAACTA 


CCTGAAGGAG 


1860 


CGGCTTTAAC 


AGAGAAAACG 


GACATATTCG 


AAAGCGGGCG 


TAACGGTAAC 


CCAAATAAAG 


1920 


ATGGAATCAA 


GAGTTATCGT 


ATTCCAGCAC 


TTCTCAAGAC 


AGATAAAGGA 


ACTTTGATCG 


1980 


CAGGTGCAGA 


TGAACGCCGT 


CTCCATTCGA 


GTGACTGGGG 


TGATATCGGT 


ATGGTCATCA 


2040 


GACGTAGTGA 


AGATAATGGT 


AAAACTTGGG 


GTGACCGAGT 


AACCATTACC 


AACTTACGTG 


2100 


ACAAT C C AAA 


AGCTTCTGAC 


CCATCGATCG 


GTTCACCAGT 


GAATATCGAT 


ATGGTGTTGG 


2160 


TTCAAGATCC 


TGAAACCAAA 


CGAATCTTTT 


CTATCTATGA 


CATGTTCCCA 


GAAGGGAAGG 


2220 


GAATCTTTGG 


AATGTCTTCA 


CAAAAAGAAG 


AAGCCTACAA 


AAAAATCGAT 


GGAAAAACCT 


2280 


ATCAAATCCT 


CTACCGTGAA 


GGAGAAAAGG 


GAGCTTATAC 


CATTCGAGAA 


AATGGTACTG 


2340 


TCTATACACC 


AGATGGTAAG 


GCGACAGACT 


ATCGCGTTGT 


TGTAGATCCT 


GTTAAACCAG 


2400 


CCTATAGCGA 


CAAGGGTGAT 


CTATACAAGG 


GTGACCAATT 


ACTAGGAAAT 


ATCTACTTCA 


2460 


CAACAAACAA 


AACTTCTCCA 


TTTAGAATTG 


CCAAGGATAG 


CTATCTATGG 


ATGTCCTACA 


2520 


GTGATGACGA 


CGGGAAGACA 


TGGTCAGCTC 


CTCAAGATAT 


TACTCCGATG 


GTCAAAGCCG 


2580 


ATTGGATGAA 


ATTCTTGGGT 


GTAGGTCCTG 


GAACAGGAAT 


TGTACTTCGG 


AATGGGCCTC 


2640 


ACAAGGGACG 


GATTTTGATA 


CCGGTTTATA 


CGACTAATAA 


TGTATCTCAC 


TTAGATGGCT 


2700 


CGCAATCTTC 


TCGTGTCATC 


TATTCAGATG 


ATCATGGAAA 


AACTTGGCAT 


GCTGGAGAAG 


2760 


CGGTCAACGA 


TAACCGTCAG 


GTAGACGGTC AAAAGATCCA CTCTTCTACG 


ATGAACAATA 


2820 


GACGTGCGCA 


AAATACAGAA 


TCAACGGTGG 


TACAACTAAA 


CAATGGAGAT 


GTTAAACTCT 


2880 


TTATGCGTGG 


TTTGACTGGA 


GATCTTCAGG 


TTGCTACAAG 


TAAAGACGGA 


GGAGTGACTT 


2940 


GGGAGAAGGA 


TATCAAACGT 


TATCCACAGG 


TTAAAGATGT 


CTATGTTCAA 


ATGTCTGCTA 


3000 
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TCCATACGAT GCACGAAGGA AAAGAATACA TCATCCTCAG TAATGCAGGT GGACCGAAAC 3060 

GTGAAAATGG GATGGTCCAC TTGGCACGTG TCGAAGAAAA TGGTGAGTTG ACTTGGCTCA 3120 

AACACAATCC AATTCAAAAA GGAGAGTTTG CCTATAATTC GCTCCAAGAA TTAGGAAATG 3180 

GGGAGTATGG CATCTTGTAT GAACATACTG AAAAAGGACA AAATGCCTAT ACCCTATCAT 3240 

TTAGAAAATT TAATTGGGAA TTTTTGAGCA AAAATCTGAT TTCTCCTACC GAAGCGAACT 3300 

AGAGAGATGG GCAAAGGAGA GATGGGCAAA GGAGTTATTG GCTTGGAGTT CGACTCAGAA 33 60 

GTATTGGTCA ACAAGGCTCC AACCCTTCAA TTGGCAAATG GTAAAACAGC GACTTTCCTA 3420 

ACCCAGTATG ATAGCAAGAC CTTGTTGTTT GCAGTAGATA AGGAAGATAT CGGACAGGAA 3480 

ATTATTGGTA TAGCTAAAGG AAGCATCGAA AGTATGCATA ATCTTCCTGT AAATCTAGCA 3540 

GGTGCCAGAG TTCCTGGCGG AGTAAATGGT AGCAAAGCAG CGGTGCATGA AGTTCCAGAA 3600 

TTTACAGGGG GAGTTAATGG TACAGAGCCA GCTGTTCATG AAATCGCAGA GTATAAGGGA 3660 

TCTGATTCGC TTGTAACTCT TACTACAAAA AAAGATTATA CTTACAAAGC TCCTCTTGCT 3720 

CAGCAGGCAC TTCCTGAAAC AGGAAACAAG GAGAGTGACC TCCTAGCTTC ACTAGGACTA 3780 

ACAGCTTTCT TCCTTGGTCT GTTTACGCTA GGGAAAAAGA GAGAACAATA AGAGAAGAAT 3840 

TCTAAACATT TGATTTTGTA AAAATGGCTC TTTGTCAACT GTAGTGGGTT GAAGTCAGCT 3900 

AAGCTCGAGA AAGGACAAAT TTTGTCCTTT CTTTTTTGAT ATTCAGAGCG ATAAAAATCC 3960 

GTTTTTTGAA GTTTTCAAAG TTCCGAAAAC CAAAGGCATT GCGCTTGATA AGTTTGATGA 4020 

GATTATTGGT CGCTTCCAAT TTGGCGTTAG AATAGTGTAG TTGAAGGGCG TTGACGATTT 4080 

TCTCTTTGTC CTTTAGAAAG GTTTTAAAGA CAGTCTGAAA AAGAGGATGA ACCTGCTTTA 4140 

GATTGTCCTC AATGAGTCCG AAAAATTTCT CCGGTTCCTT ATTCTGAAAG TGAAACAGCA 4200 

AGAGTTGATA GAGCTGATAG TGATGTTTCA AGTCTTGTGA ATAGCTCAAA AGCTTGTTTA 4260 

AAATCTCTTT ATTGGTTAAA TGCATACGAA AAGTAGGGCG ATAAAAATGT TTATCGCTGA 4320 

GTTTACG 4327 
(2) INFORMATION FOR SEQ ID NO: 118: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3521 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY : linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 118: 

CTCTGGCCCT GCCACTCCAA CGTTTTGTCA GGGTGCTTTT TTCATAAAGG AGTTCTTATG 60 
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TTAGATATCA 


AACGTATTCG 


TACAGATTTT 


GAAGCTGTCG 


CAGAAAAATT 


AGCTACACGT 


120 


GGTGTAGATG 


CTGCTGTCTT 


GAATGAAATG 


AAAGAAATCG 


ATGCTAAACG 


TCGTAACATC 


180 


TTGGTCAAGG 


TTGAAACTCT 


CAAAGCAGAA 


CGTAACACAG 


TTTCTGCTGA 


GATTGCCCAA 


240 


GCTAAGCGCA 


ACAAGGAAAA 


TACAGATGAC 


AAGATTGCTG 


CCATGCAAAA 


TCTATCTGCT 


300 


GAGGTTAAAG 


CCTTGGATGC 


TGAATTGGCA 


GAAATCGATG 


CTAAATTGAC 


AGAATTTACA 


360 


' ACGACTCTTC 


CAAATATCCC 


AGCTGACAGC 


GTTCCTGTTG 


GGGCTGACGA 


AGACGACAAT 


420 


GTGGAAGTTC 


GCCGTTGGGG 


TACTCCACGC 


GAGTTTGACT 


TCGAACCTAA 


AGCTCACTGG 


480 


GATCTCGGTG 


AAGACCTTGG 


TATCCTTGAC 


TGGGAACGCG 


GTGGTAAGGT 


AACAGGCGCT 


540 


CGCTTCCTCT 


TCTATAAAGG 


CCTCGGTGCT 


CGTTTGGAAC 


GTGCTATCTA 


CAACTTTATG 


600 


TTGGATGAAC 


ATGGAAAAGA 


AGGCTATACT 


GAAGTCATCA 


CACCTTACAT 


AGTCAACCAT 


660 


GATTCTATGT 


TTGGTACTGG 


TCAGTATCCA 


AAATTTAAGG 


AAGATACTTT 


TGAACTCAGC 


720 


GATACCAACT 


TTGTCTTGAT 


TCCAACTGCT 


GAAGTTCCTC 


TGACAAACTA 


CTACCGTGAT 


780 


GAAATCTTAG 


ACGGCAAAGA 


TCTTCCAATC 


TACTTCACTG 


CCATGAGTCC 


GTCATTCCGT 


840 


TCTGAGGCTG 


GTTCTGCCGG 


TCGTGATACG 


CGTGGCTTGA 


TCCGTTTGCA 


CCAATTCCAC 


900 


AAGGTTGAAA 


TGGTCAAATT 


TGCCAAACCA 


GAAGAATCTT 


ACGAAGAATT 


GGAAAAAATG 


960 


ACAGCCAACG 


CTGAAAACAT 


TCTTCAAAAA 


CTCAACCTTC 


CATACCGTGT 


CGTTGCTCTC 


1020 


TCTACTGGAG 


ATATGGGCTT 


CTCAGCTGCG 


AAGACTTACG 


ACTTGGAAGT 


GTGGATTCCA 


1080 


GCACAAAACA 


ATTACCGTGA 


AATCTCAAGC 


TGTTCAAACA 


CAGAAGATTT 


CCAAGCCCGT 


1140 


CGTGCCCAAA 


TCCGTTACCG 


TGATGAAGCA 


GATGGCAAGG 


TGAAACTCCT 


TCATACCTTG 


1200 


AACGGTTCTG 


GACTTGCAGT 


TGGACGTACA 


GTGGCTGCAA 


TTCTTGAAAA 


TTACCAAAAT 


1260 


GAAGATGGTT 


CTGTGACCAT 


CCCAGAAGCA 


CTTCGTCCAT 


ACATGGGTGG 


AGCTGAAGTC 


1320 


ATCAAACCAT 


AAAAAATAAG 


GTTTAGCTAT 


TTCTAGCTAG 


ACCTTTTTTC 


GTAACCAAAT 


1380 


CAGATAAGCA CCTAGTACAA AGAATAAAAT AGTTAGGCAT ATAATGGTTT 


CAGCCAATAC 


1440 


CAGGTAATCC 


AGAAATGGAA 


GTTTCAAAAT 


TCCCTGAGCC 


ATCTTGAGCG 


AGGTCGCTGT 


1500 


GATAATGGTT 


GGGAAGGTGA 


GGGCTGAGAA 


GGCTGGTTGA 


AAACCTTGTT 


TTAAAATGTT 


1560 


GGGCAGACGA 


GTTAAAACAA 


AGAAAAAGAA 


GGATTGAGAA 


GCCAAAATCA 


TGACAATCAA 


1620 


GACCCAAGTC 


GGCAGGCTGG 


TTCCTCCTAC 


TCGAACTAGA 


GAAGCCAAGA 


GTAGAGAGAA 


1680 


AGGAGCACAG 


TAGATTCCTT 


CTTGTCCAAG 


CAAGGCTAGT 


GGGAGTGGAT 


GTTTCTTTAA 


1740 


ATCGCTATAA 


ATAAGGGGAT 


AGAGATAGAA 


GGTCAAGAGA 


AAACCAAAAC 


TCAAGGTCGC 


1800 
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ATAGGCAATT TCGATAATAC CTACCAGAGG ATAGGTCAAG GCAGCCACTG CTATCCCCAC 1860 

ATAGAGAACC GTCCAGCTTG GAGTGGCATG AACCCTCCGC CCTGGACAAG CAAACTTGAT 1920 

GGTAAAACCA GCAATCAAGG TCAAATCCAA GAGAAATGAA AACCACCAAA TCCCTTGTGC 1980 

TACCAAAGGA AGATAAGAGA ATACGCGAAA GACATAGGTC GATAAAATCA TCCCAGCCAT 2040 

AGGAAAGGTT GCCATTCCTG ACAAAAGAGG GGGCTTGGTC AATTCTTGCT TGGTTTCTTT 2100 

CCAATTAAAG AGATGCAGAA TTAGAAAGTA AATCCATAAA ACCAAACCAA TCAGACTAAA 2160 

AAGATGGGAT AGAACCGGCA ACGTATCTAA AATAAGATTT CCAGCTCCTG CCAAACCTAG 2220 

CAAACAACCT GAAAATACTA AGGGGAGTTT TTTCATCCTA ACCTCCAATA ATCATGTTAG 2280 

TTTCAGTATA ACATAAAAGC GCTTAAATGA GGATTTAAAA AAACGAGTCC GCTTATTTCA 2340 

GACTTCATTT TACTCAGATA TGAATTAGGC ATAAGGTTGC AATTCTGGAT TAATTGGTGT 2400 

ATTAGCTAAG TTGTTGGCAT AGTTACAGAG GATTGCTAGG CTGACACCAA AAACCACATC 24 60 

CAAGGCATTT TGTTGAGTGT AGCCAGCTTC TAAAAACTCA GACAAGGCTT CATCTCCTAC 2520 

ACGACCCTTG GTATTGATAA CTGCCAAGGT AAACTTAGCT AGGGTATCCA ATTTAGGATC 2580 

TGTTTCAATT GGAGTACGAT TGCGAAGAGC TTGAATCAAG TCATCATTCA TCTGGATTTG 2640 

TTTGATGGAA AAGGCTGTGT GACCTGCGAC ACAGAAGGCA CAACCATTGG TCACGGCTGC 2700 

CGTGATTTGC ACCACTTCAC GCTCAACGGG TGTCAGGCTG TTGCGACGGT GGATAGATGA 2760 

GACAATTTGG TAGGCTTCTA AAACAGTCGG GGCATTGGCC AAGAGACCGA TTAGGTTGGG 2820 

AATATAGCCA TTGTTGTCTT TTTCTACTGT TTCAAGAATT TCTTTCACTT CTGCTGGTGC 2880 

TGACTCTACT GTATGGATAG TAAATGTTGT CATAAGATAC CTCTTTTCTT ATTATTGACA 2940 

CTAATATTAT TGGAAAATCT TATAAAATCC TGATTCCTAA GTTTATCTAA GATAAAGCTT 3000 

TATTCTCTCA TAAGATTTTC GTTGTTATAT TAGTTTATCA CACTTCCAAT CACTTGTATA 3060 

ATATATATTA TATATCAGGC TGATAAAAAT TATTTATAGG CAAAAAAATC ACACGAGCTG 3120 

TGTGATTCCA TTATTTGTCA AAATACTTTT TAGTTTCAGC AATAACGACT GGCGACAAGA 3180 

CCAAGAGGGC AATCAAGTTT GGCAGAGCCA TCAAGGCGTT AACGATATCT GCGATAATCC 3240 

AGACCATATC CAACTCGATA AATCCTCCTA ACAAGACCAT GAGCACAAAA ACCACACGGT 3300 

AGAGCCAGAT AAAGCGAACC CCAAAGAGGA ACTCAAAACA GCGTTCTCCG TAATAGTTCC 3360 

AACCTAGAAT CGTTGTAAAG GCAAAAAGTA CAAGGAAGAT GGTCAAGAGA GCAGGCCCAA 3420 

AGTGTGAAAA GTTTGTTGAG AAAGCTGACT GAGTCAAGGC AACCCCATTC AAGTCACCGC 3480 

TCCAAACTCC AGTTACCAAG ATGGTCAAAC CAGTTAGAGT A 3521 
(2) INFORMATION FOR SEQ ID NO: 119: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1968 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS : double 
(D) TOPOLOGY : linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 119: 

* AACCTGGGCA AGCAAGCTAA AAGCAATGGG ACCTGGAATC CTAATGGCAA CTGCCGCTGT 60 

TGGAGGTTCC CACATTGTAT CCTCAACTCA AGCTGGCGGT TCTTACGGTT GGTCTCTACT 120 

TCTCTTGGTC ATCTTAGCCA ATGTCTTTAA ATATCCATTT TTCCGTTTTG GTGCTGAATA 180 

CACAGCTGAT ACTGGAAAGA CTTTGGTTGA AGGTTATGCC GAAAAAGGAA AACTCTATCT 240 

CTGGATTTTC TTTATCCTCA ATGTCTTTTC GGCTATGGTC AACACGGCTG GTGTTGCCAT 300 

TCTGTGCTCA GCTATCATCG CCAGTGCCTT CCCAATGATT GGACTTAGCA TTACTCAGTG 360 

GTCCCTCATT CTCGTTGCAA TCATTTGGGC TATGCTACTC TTTGGAGGCT ACAAACTTTT 420 

AGACGGCATG GTCAAATGGA TTATGTCTGC CTTAACCATT GCGACTGTTC TTGCAGTTAT 480 

CATTGCGGCG GTCAAGCATC CAGAATACAG TTCTGATTTT GTCGAGAAGA CACCTTGGCA 540 

AATGGCAGCT CTGCCCTTCA TCGTCTCCCT CCTAGGATGG ATGCCGGCTC CTATTGAAAT 600 

TTCAGCCATC AATTCACTTT GGTCAGCTGA AAAGAGAAAG ACCGTCAACT TTAACACAGA 660 

AGACGCTCTG TTTGACTTTA ACACTGGTTA TATTGGAACA GCTATCCTAG CCGTCTTCTT 720 

TGTGGCACTG GGAGCACTGA TTCAGTATCC TACAGGGCAG GCGGTTGAAG CTGCTTCAGC 780 

CAAATACATC TCTCAATTCG TGGGCATGTA TGCCTCTGTT CTTGGCGAAT GGTCCCGTTA 840 

CTTGATTACC TTTATTGCCT TCCTCTGTAT CTTTGGAACA GTTATAACTG TTATCGATGG 900 

CTATTCTCGC GTTAATCAGG AATCTCTCCG ACTGCTAATC AGTCAAAAAG AGGACAATCG 960 

TAAATCTTTG AACATCTGGA TGACCATCAC TGCTATCATC GGTATCGTCA TTATCAAGTT 1020 

CTTCGCTGGT CAGGTTTCAA CCATGCTCCG CTTTGCCATG ATTGGCTCTT TCCTGACAAC 1080 

ACCTTTCTTT GCTCTTTTGA ATTACGCCTT GGTAACGCGT GAAAACAAAA ATCTTCCTTC 1140 

TTGGCTCAAA CACCTTGCCA TTGCGGGATT GATTTTCCTC TTTGCTTCGC CATCTTCTTT 1200 

ATCTACGCAC TCGCAATCGG AAAAGCAGGG TAAGGGACAA GCGCGAGATG AAGATAAGGT 1260 

TTCATTTCAA GAGAAAATTC AGCAAATATT TCTATGATAA AAAGCATAAG AACAAGGTTT 1320 

TGAAGACCTG AACTTATGCT TTTTTACGTT CTTAAAGACT GTTTATACTC AAAAAACAGT 1380 

TGAACAACTT CAACCACCTC TTATAAGAAC TTTATACTAT TCGAGAATCT CTTCAAACCA 1440 
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CGTCAGCTCT ATCTGCAACC TCAAAGCTGT GCTTTGAGCA ACCTGCGACT AGCTTCCTAG 1500 

TTTGCTCTTT GATTTTCATT GAGTATTAAT TCTCCTTTTC CAACTCATAC AAATCTGCGA 1560 

TAATAGCTGC GACATGTTTG ATATCTTCCA GCATGCCTCG CATTTCAAAG TCAGCCAATA 1620 

CAGGGAAGCC AAAGCGTTGA CTGTATTGCT TGGCTGTTAG GCAGTATTGG TTATTAAAGT 1680 

TACGATTTCC TGACCCAACC ACACCAAAAC ACTTACTAGC ATTGTTACCA TAGGCAATAA 1740 

AATCTCCCAC CGGTGTCGTC AAAATCTCAA CATCTCCGTT ATCCACGCCA TTCCCACCTT 1800 

CGAGATAGGT CGGCAAAAAA GCGACATAGG GATGGTCCAT TTCATAGAAA TTTTTGCCTT 1860 

CCTTGACCAA ATCCTTGATA TGAATCTTTT GAACCTCAAT CCCTTTGTAC TGGGACAAGA 1920 

GATAGTCTTT CAAGCGCGTC ACAAAACTTT CAGTGTTGCC ACTCAAGG 1968 



(2) INFORMATION FOR SEQ ID NO: 120: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7172 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
<D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 120: 

CCGCATTTTT TATCACTAGA CTCGAGACAT CTTTTGAGTG GCTCTTGCTC TCTGGTTTAA 60 

TTTTCTTCCT TGCTCAAGGA CTCCTGCTAT TTCTCTTGGT CGTCCGACTC AAACATCAAT 120 

TCGCTGAGAT TTATCCTCAA ATCAATAAAA AGATTCGCTT CTACTATTTA GGGGTTCTCA 180 

CCATTGATTT TCTATTTTTT GTTCTCTTAG CCTTCATTAG TTCTCAGCGT TTTTCATCTC 240 

TTATGCCAAT CATCACTGCT TGCCATTCTA CTTTTTATTA TATGACAGCT GACTACCTAA 300 

GAGAAAACTA TCCAGACTTT TACGACAAAC ACATCTCTTT ATGGGAGTGT CTCTAAAGAA 360 

AAGGAGGTTT TAGCATGAAA AAAATCATCT TCATCAAAAC CATTCAACTC CTTGTCATTG 420 

ATGGAATCAT GCTGGCATTT TTGACATTTA AAAGGGGGCT TACTTGGGAC TGGATTTTGA 480 

TTTATAGCGG TTGGCTCATT TTCTTTCATC CTGTGCTATT GACCTATCTT TCAAACCAAC 54 0 

TTTGTGACCA CTTTAGTTAA CTCTATTCCC AGATTAGACC GAGATTCTGG CGTTTTGCTT 600 

TACAAATTCT CCTATGGGAT AGCCTGATGA TTCTCTCCTT GGTGTCTTTA AGTGATATTC 660 

CACTTTTCCT TCAGGGAACT CTCCTCATCC TAGGACATCT CATCCCTTCC TATCGCATCT 720 

GCCAAAGCCT GAAAAGAGAC TTCCCCCAAG CATATCAAGA ACCGATTTCT TTTTGGAGTA 780 

TTTTATGATA GATGAGAAAG ACCAAGCCGA CTGGGCTTGG TCTTTCTTAT CTCTTTTTAG 840 

TATCTAGGAT AATGGTAACA GGTCCATTAT TAACCAGCTC AACCTGCATA TCTGCTCCAA 900 
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AGATGCCTGT 


CTGAACGGGC 


ACTTCTTGCG 


CTAATTTTTG 


ATTGAAAGCA 


TCATAGAAGT 


960 


CTGATGCCAT 


ATCAGGTTTA 


GCTGCCCCTG 


TAAAGgCTGG 


ACGATTGCCT 


CTCTTAGTAT 


1020 


CCGCAAAGAG 


GGTAAACTGA GAAATAGAGA 


GGATTTCTCC 


TTCAATATCT 


TTGACAGACA 


1080 


GGTTCATCTT 


GCCTTCTGCG 


TCTGAAAAAA 


TCCGCATATT 


GACCAGTTTT 


CTCACAGCAT 


1140 


AGTCCAAATC 


TTCCTCTTGG 


TCCTCTGGTC 


CAACACCAAC 


CAGCAATAAA 


AGTCCCTGAT 


1200 


TGATTTTTCC 


CTGAATCTGG 


CCTTCTATAC 


TCACTTGGGC 


TTTTTTAACC 


CGTTGGATAA 


1260 


TGATTTTCAT 


AATAGCCTTT 


CTAGTAAGAG 


CTAGGACAAC 


TAGCCGTTGG 


TCCGTTTGAC 


1320 


AGAGTAAACT 


TCTGGCACAC 


TCTTAATTTT 


ATCGACAACC 


GTGGTCAGTG 


TAGAGAGGTT 


1380 


GGCAATACCG AAGgACACAT GGATATTAGC 


AAACTTCATA 


TCCTTGGTTG 


GTTGGGCATT 


1440 


GACCGTTGAA 


ATATTCTTGG 


TTGTATTTGA 


AAGAACTTGC 


AGTACATCGT 


TCAACAGTCC 


1500 


TGTACGGTTG 


AGACCGTAGA 


TATCGATATG 


GGCCATATAC 


TCCTTATTTG 


AGCTAGGGTA 


1560 


CTGGTCTTCC 


CATTCCACAT 


CAAGGAGACG 


TTGCTCGTAG 


TTTTCTTGGG 


CACGCAGGTT 


1620 


CATACAGTCC 


ACACGGTGAA 


TAGCCACACC 


ACGACCCTTG 


GTAATGTAGC 


CAACAATATC 


1680 


GTCACCAGGC 


ACGGGGTTAC 


AACACTTAGC 


AATCCGCACT 


AGGAGACCAG 


AAGCACCTTC 


1740 


AATAACCACT 


CCCCCCTCAT 


GCTTGACCTT 


GAGGGTTTCT 


TTATTTTCAA 


CCTTGACCTC 


1800 


GCCACCTTTG 


ACAAGCTCCT 


CTGCCTCAGC 


TTTGGCCTTG 


GCACGCTCTT 


CCTCACGGCG 


1860 


TTCCTTTTCA 


GTCAGACGGT 


TAAAGACGGT 


AATCGCACCG 


ATTTCCCCAA 


AACCAATGGC 


1920 


CGCAAAGAGG 


GAGTCTTCTG 


TCTTGTAACT 


GGTCTTTTGC 


AGAACTTGAT 


CCATGTGGCG 


1980 


CTTGTCCATA 


AATTTATTTG 


CCACATAGCC 


ATTTTCTTGG 


AACTGAGCCA 


TCAGCATCTC 


2040 


ACGACCCTTG 


TTGACAGACA 


ATTCCTTATC 


TTGGTTTTTA 


AAGAACTGGC 


GAATCTTATT 


2100 


GCGCGCCTTG 


CTAGTCTTGA 


CCATATTGAG 


CCAGTCACGG 


CTAGGTCCAA 


AGGAGTTCGG 


2160 


GTTGGCGATA ATTTCAACCT 


GATCCCCTGT 


CTTTAACTTG 


GTTGTCAGTG 


GAACCATGCG 


2220 


GCCATTGACC 


TTGGCACCAG 


TTGCTTTTTC 


ACCGACCTTG 


GTATGGATTT 


CGTAGGCAAA 


2280 


ATCAATCGGT 


CCTGAATCTT 


TGGGAAGGGA 


ACGGACAGCT 


CCATCTGGGG 


TAAAAACGTA 


2340 


AATCTCCTCA 


GCCAAATAGT 


TTTCCTTAAC 


AGAGTCCACA 


AATTCCTTAG 


CATCATCAGC 


2400 


CTGGTCTTGG 


AGCTCCATCA 


TCTCCTTGAT 


CCAGTTCATT 


CCAATAGCTG 


ATTCCTTGCT 


2460 


GTTAACTTGC 


CCCTTTATAC 


CTTTCTTATA 


AGCCCAGTGA 


GCCGCAACCC 


CGTACTCAGC 


2520 


CACCTCGTGC 


ATTTCCTTGG 


TTCGAATCTG 


GAATTCAATC 


GGCCCTTTTG 


GTCCATAAAC 


2580 


AGTCGTATGG 


ATAGACTGAT 


AACCATTGGC 


CTTGCGGTTG 


GCGATATAGT 


CTTTGAAGCG 


2640 



WO 98/18931 



PCT/US97/19588 



848 

ACCTGGCATC GGTTTCCAAA ATTCATGCAC GTAACCAAGC ATGGCATAAA CATCACTTTG 2700 

GGTATCTAAA ATACAACGAA TAGCAATCAG ATCATAGATT TCCTCAAACC GTTTTCTCTT 2760 

GTCCTGCATT TTGCGGAAAA TTGAGTAAAT ATGCTTGGGA CGACCATAAA TCTTCCCTTT 2820 

CAAGTGACGT TCTGTCGTAT ACTCCTCTAA TTTTGTGACT ACCTCATCCA CCAAGGCCTC 2880 

ACGCTCCCTG CGCTTTTCCT TCATCATATG GGTAATCTTG TAAAACTCCG TTGGATTGAG 2940 

ATAACGGAAA GACAAGTCTT CTAATTCCCA TTTGACACTG GAAATCCCCA AACGATGGGC 3000 

AAGCGGGGCA TAGATTTCCA TGGTTTCTTT GGAAATACGC TCCTGCTTGT CTTTTCGAAG 3060 

ATGTTTCAGG GTCCGCATAT TGTGCAAGCG GTCAGACAGT TTGACCAAAA TAACGCGGAT 3120 

GTCCTCAGAC ATGGCCATGA GCATCTTGCG ATGATTTTCC GCTAATTGCT CCTCGATCGA 3180 

TTTGTACTCG ACCTTGCCAA GCTTGGTAAC TCCGTCAACA ATCATCCGCA CATCAGGACC 3240 

AAACTCTCTT TCCAAATCGT CCAAAGTCGC ATCTGTATCT TCCACCACAT CATGCAAGAA 3300 

TCCACAAGCT ACTGTTACAG CATCCAGCTT TAGCTTAGCT AAAATACCTG CCACTTGGAT 3360 

AGGGTGAATG ATATAAGGCT CGCCTGATTT GCGATATTGA CCACTGTGGC ATTCAACAGC 3420 

ATAGACCAAG GCCTTATGGA CAAAATGAAC ATCCTCTTCC GTTAAATATT CTTTGGTTAA 3480 

AGCGACAACT TCTTCGCCTG TTAAATTCAC TTCTTTCGGC ATCTCTACTC TCCAATTCTT 3540 

CCTACCATTT TATCACTTTT TTAAGAATAT GAAAACTAGA TTGGAACAGA ATAAGAAAAA 3600 

AATAATTCAA AATTGCTTGA TAATTCTGAA TTATTGGTCC GTAATATACT ACGAAGTTAG 3660 

ATTTTAAACT TAGGTGATAG AAGGAGAGAT AGAAGAACGG AAACCATATT GTAACCCAAA 3720 

GACTTTCTGA CTTCCCCAAT TCCATTGAAG ATACGAAAGA TAAACGGTGG AACTCGTATC 3780 

ACATACACTG GTACCTTGAC TGGATTTTGG AATTAATACT AAATGAAAAT CAAAGAGCAA 384 0 

ACTAGGAAAC TAGCCGCAGG TTACTCAAAG CACCGCTTTG AGGTTGCAGA TAAAGTTGAC 3900 

GCGGTTTGAA GAGATTTTTG AAGAGTATAA AAATCCTCAA GATACTTTCT TCTATCCTTT 3960 

AGTTTATAAG GAGAATACCT ATGAAAAAAA CTGCTATTTC TATCTTTGCT CTCCTAATGT 4020 

TAGGAGTTTG CTGCCTGTTC CTATTCAGCC AGCAAAGCTA TAAAAAACAG TCGTTCAATA 4080 

CTATGCTAAC GACCAGAACC TGCCCAGTAG GATAACTTAT AGTGAATATA GCGACAAATG 4140 

AGAAGCCAAC TACGGTAGCA CTCTAAACAT CACGTCTATC AAACAAGCTA ATGACGGAGT 4200 

TTATGCAACC TATGAAGGGC AATTGACACC TTTCCAATAT TGATAAATTG ATAACCAGCC 4260 

TGTCTTCATC TAGTCATGCT GGTTTTTAAG TTCATTTTAA ATCCTTACCT ATTCTCCCTA 4320 

ACTGTGCTAT ACTTAATTTA TACTCAATGA AAATCAAAGA GCAAACTAGA AAGCTAGCCG 4380 

CAGGCTGTTC AAAGCACTGC TTTGAGGTTG CAGATAAAGT TGACGCGGTT TGAAGAGATT 4440 
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TTCGAAGAGT ATTAGTACAT TCTTTGAGAT TGGAGCTAGT ATGAAAATCC ATAAAACCGT 4500 

GAATCCTGTT GCCTATGAAA ATACCTATTA TCTAGAAGGC GAAAAGCACC TCATCGTCGT 4560 

CGATCCTGGT AGTCATTGGG AAGCCATTCG TCAGACAATC GAGAAGATCA ACAAACCGAT 4620 

CTGTGCTATT CTCTTGACCC ACGCCCATTA TGACCATATC ATGAGTCTGG ACTTGGTTCG 4680 

CGAGACGTTT GGCAATCCTC CTGTCTATAT CGCAGAGAGC GAAGCCAGCT GGCTCTACAC 4740 

TCCTGTCGAT AATCTCTCCG GTCTCCCTCG CCACGATGAT ATGGCAGATG TGGTCACAAA 4800 

ACCTGCAGAA CACACCTTTG TCTTTCACGA AGAATACCAA CTAGAGGAAT TTCGTTTTAA 4860 

GGTTCTACCG ACCCCAGGGC ACTCTATCGG TGGTGTTTCC CTAGTCTTTC CTGATGCTCA 4920 

TCTAGTCTTG ACGGGAGATG CTCTATTCCG CGAAACTATC GGACGGACCG ACCTTCCGAC 4980 

TGGTAGCATG GAGCAACTCC TTCATAGTAT CCAGACCCAA CTCTTCACCC TACCAAACTA 5040 

CGATGTCTAT CCAGGACATG GTCCAGCTAC TACTATCGCT CACGAAAAGG CCTTCAATCC 5100 

CTTTTTCTAG CAAGATGATG ACAATCGAAA TTTAAGTAAA CTATCCAGCA AATCTTTCTA 5160 

TTACAAAAGG CATCCTATCA AGGTTTTCAC ACATGATTGG ATGCCTTTTT TCTGATGACT 5220 

AGATTTTTTG CATTACCAAA TAATCACGCG CTCCTCTGGT GAACGCCACA TTCCGTCTCC 5280 

TTCTTTGACA TCATAGGTTG TAAAGAAATC GTCGAAGTTT GGTACTTGCA CATTGACACG 5340 

GAGTTTGGCT GGTGCGTGCA CATCGACGCT AGCCAAAAGT TTCATAAATT CTGGTCGACC 5400 

TTTCATGCGC CAGATGCGAC CGAAGTTGTA GAAGAACTCT TCTGCTGAGA AGTCTGCTTC 5460 

TCTCTTAGCT GCTTCAAGCG CTGCTGCGAT TCCTCCCAAG TCAGCCACGT TTTCTGATAC 5520 

AGTCAATTTA CCGTTAATGG TTGCTCCATA AGAATCCTGT CCATCAAATT GGTCAATGAC 5580 

TTTTTGTGTT TTCTCCTTGA AGGCAGCATA GTCGCTCTCT GTCCACCAAT CCTTGAGGCT 5640 

ACCATTTTCG TCAAAGGAAG CCCCGTTAGT ATCAAAGGCG TGGGAAATTT CATGGGCAAT 5700 

CACTGCCCCA ATACCACCGT AGTTAGCAGA AGATGACTGA TGCAAGTCAT AGAAAGGCGC 5760 

CTGTAAAATG GCCGCTGGAA AGACAATCAG GTTCTTCTGA GGATTGTAGT AGGCATTGAC 5820 

CATATGAGCA GGCATGCCCC ATTCCTTATA ATCTACAGGC TGGTTCCACT TACTCCAACT 5880 

GTGCTTGATT TCCACACGCG CAAAGGCTAG AGCATTCTCA AAAAGACTGG CAGTTTCATT 5940 

CACTACCTTA TCCTTGTAAC GTGCAGGCAA TTCTTCTGGA TAGCCAATAT AAGGTTTGAT 6000 

CACATTGAGC TTCACGATAG CCTGTTTACA GGTTTCTGGA GTGAGCCAGT CATTCTTAAG 6060 

CAGACGCTCC TTATAAACAT CAATCATGGT TGCCACTTTT TTCTCCACAT CCGCCTTGGC 6120 

TTCTGGAGAG AACTTCTCAC GGGCGTACCA AAGACCCAGG GCTTGCTTGA AAGGTTCTTG 6180 
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TGCTAGATGA TAAGCTGCTT TGACCTTATC TTTTGCCTCT GGAACTCCAG AAAGGG C ACG 6240 

GCTGTAGGCA CCAGACAAAA CACGGATATC CTCTGTTAAA TAGCTGGTTG AAAGATTGAC 6300 

AACACTCAAA ATCAAGGTTG CTTTAAGGAG AGACCAGGCT TCCTCACTGT AGAATTGCTC 6360 

TGCTGCTTGC CAGAAACGTT CCTCGTCTAC AATAACCTTG TCTGGTAATT GCCCAATAAC 6420 

TGCTTTGAAG AAGTCATCCA AAGGTAGGGC AGGCGCGAAT TTCTTGAAAT CTTCGTAAGA 6480 

ATATGGATGA TAGAGTTTAG CATATTCTGA ACTTTCTTCA TTAGAGAGCA CCACTGCCGC 6540 

AACTCGGCGG TCCAATTCAA GTCTTTTTTC TAGCAAGTCT TCAATTTCTT CATCAGAGAA 6600 

ATCATAAGCC TTGAGGAGAT TTGCGCTGCT TTCTTTCCAA AGAGTCAAGA GCTCTTCGCG 6660 

CTGAGGATGT TCTTCTGCAT AGTAGGTCGT ATCTGGCAAG ATTGTGCTTG GAGCGCTAGC 6720 

CCATAGAACA TTGATTCTAG CATCCATAAA GTCTGGCGAT ACACCAAAAG GAAGGAAGTT 6780 

TGGTTTTCCT GCAAGCTCAA ACTCTGCTAG TTTAGCTGTA AAATCCGCAA AAGTCTCCAA 6840 

TTCTTGGAAT TCTTTAAGGA GTGGTAAGAC AGGTGTGATA CCGTCAGCTT CTCTCTTGTC 6900 

AAAATCACGA ACTAGGCGGT GGTATTTGAC AAAGTTTTCC AAGATAGCAT CCTCAGGCAC 6960 

TTCTTCACCT GCTAACCACT TGTCTGTTGT CGCCAGCATC AGGTCTTCAA TTTCCTGGTC 7020 

TAAATCAACA AAACCTCCTG TTTGAGACTT ATCTGCTGGG ATTTCAGCTG TCTGTTGCCA 7080 

TTCTCCATTG ATAGCATCAT AAAAATCATC TTGATAACGT GTCATCTTGT TCTCGCTTTC 7140 

ATTTGTATTT GCATTTATCT TAACAAAAAT CG 7172 



(2) INFORMATION FOR SEQ ID NO: 121: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4518 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
(DJ TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 121: 

CGGGAAGTTA TGCGATCTAG ACTTCGTTCC TGTACAGCTA CTTTCTCAGG TGGTCTTGTT 60 

GTTTGTATGA GTTTGTTTAG AGAGGATCTT TCTATGTCTT TCTTTCTTAT TTTTGTTTTA 120 

TATGCTTTTC TGATTTCTTA TCTAATTTAT GGTTATTTCA GACTAAAAAG GAAATACCGA 180 

GTAGATGAAT AGCAAGGTTC TAGGTCTTCA GATTGATTTT TAGCACTCTT GATAAAAGAG 240 

TGCTAATTTT TTGAGTTTTT GTCTTGACAT TCTCTTCTAA GGGTGTATAA TAGAATCATG 300 

AGTTAGCACT TGGATGCATT GAGTGCTAAT TGATCAGACA GAGAGGAGTG ATGAGATGGT 360 

TACAGAGCGT CAGCAGGATA TTTTAAATCT GATTATTGAC ATCTTTACCA AAACGCACGA 420 
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ACCTGTCGGA TCAAAAGCCT TGCAAGAGTC TATTAACTCT AGCAGTGCAA CCATTCGTAA 480 

TGACATGGCG GAACTAGAAA AACAAGGGTT GCTTGAGAAG GCTCATACTT CAAGTGGTCG 540 

GATGCCAAGT GTTGCTGGTT TTCAGTACTA TGTGAAACAC TCACTGGATT TTGACCGGCT 600 

GGCTGAAAAT GAGGTATATG AGATTGTCAA AGCCTTTGAT CAGGAATTCT TCAAATTGGA 660 

GGATATTCTG CAAGAGGCTG CTAACTTACT AACAGACCTG AGTGGCTGTA CGGTAGTGGC 720 

ACTGGATGTT GAGCCGAGCA GGCAACGTTT GACAGCCTTT GATATCGTTG TTTTGGGGCA 780 

ACATACAGCC TTGGCGGTAT TTACCCTAGA CGAGTCGCGA ACGGTTACTA GTCAGTTTCT 840 

GATTCCAAGG AACTTCTTGC AGGAGGATTT GCTGAAACTG AAGAGCATCA TTCAGGAACG 900 

TTTCCTCGGT CACACCGTTT TAGATATTCA CTACAAGATT CGGACGGAGA TTCCGCAGAT 960 

TATCCAGCGT TACTTTACAA CAACGGATAA TGTCATCGAT CTCTTTGAAC ACATCTTTAA 1020 

GGAAATGTTC AACGAAAACA TTGTGATGGC GGGCAAGGTC CATCTCTTGA ATTTTGCCAA 1080 

TCTAGCAGCC TATCAGTTCT TTGACCAACC GCAAAAGGTG GCCTTGGAGA TTCGTGAGGG 1140 

GTTGCGTGAG GATCAGATGC AAAATGTTCG TGTTGCAGAC GGTCAAGAGT CCTGTTTAGC 1200 

TGACCTAGCG GTAATCAGTA GTAAGTTCCT CATTCCTTAT CGGGGAGTTG GAATTCTAGC 1260 

CATTATCGGT CCAGTTAATC TGGATTACCA ACAGCTAATC AATCAAGTCA ATGTGGTCAA 1320 

CCGTGTTTTG ACCATGAAGT TGACAGATTT TTACCGCTAC CTCAGCAGTA ATCATTACGA 1380 

AGTACATTAA GATTGAAATC ATTAAAGGAG GCGAACATGG CCCAAGATAT AAAAAATGAA 1440 

GAAGTAGAAG AAGTTCAAGA AGAGGAAGTT GTGAAAACAG CTGAAGAAAC AACTCCTGAA 1500 

AAGTCTGAGT TGGACTTGGC AAATGAACGT GCAGATGAGT TCGAAAACAA ATATCTTCGC 1560 

GCTCATGCAG AAATGCAAAA TATCCAACGC CGTGCCAATG AAGAACGTCA AAACTTGCAA 1620 

CGTTATCGTA GCCAGGACTT GGCAAAAGCA ATCTTACCAT CTCTTGACAA CCTTGAGCGT 1680 

GCACTTGCAG TTGAAGGTTT GACAGATGAT GTGAAGAAGG GCTTGGGGAT GGTGCAAGAA 1740 

AGCTTGATTC ACGCTTTGAA AGAAGAAGGA ATTGAAGAAA TCGCAGCAGA TGGCGAATTT 1800 

GACCATAACT ACCATATGGC CATCCAAACT CTCCCAGCAG ACGATGAACA CCCAGTAGAT 1860 
ACCATCGCTC AAGTCTTTCA AAAAGGCTAC AAACTCCATG ACCGCATCCT ACGCCCAGCA • 1920 

ATGGTAGTGG TGTATAACTA AGATATAAAG CCCGTAAAAA GCTCGCAGTA AAAATAGGAG 1980 

ATTGACGAAG TGTTCGATGA ACACAAGAAA ATCTATCTTT TTTACTCAGA GCTTAGGGCG 2040 

TGTTCGATTC GGCAATTCTG ACGGTAGCTA AAGCAACTCG TCAGAAAACG GCAATCGCTA 2100 

TGGCGTTTGC CTAGCTTCCT TACTAACTCG TCGTCGAAAT AAAATCGATT TCGACTCCTC 2160 
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GTGTCGCAAT TTACATAATA GAAAACTTGT CCGAAACGAC AATAAACTAT GAAGAAAGAT 2220 

AAAATATGTT TGGCTTTGTA ATAGTGAGCG AAGCGAACCA AACACGATAC TCTTCGCCGT .2280 

GGCGCTATTT GCGCAAATTT TGAGACCTTA GGCTCAAAGT TTAGTCAAAG AGATTGACGA 2340 

AGTCAAGCTC TGACGGCGTC GCCACTGTCG CCACTTAAGA AGAGTATCAA AAAGAAAAAT 2400 

AGAAAATTAA CTAACAAGGA GAAAAACACA TGTCTAAAAT TATCGGTATT GACTTAGGTA 2460 

CAACAAACTC AGCAGTTGCA GTTCTTGAAG GAACTGAAAG CAAAATCATC GCAAACCCAG 2520 

AAGGAAACCG CACAACTCCA TCTGTAGTCT CATTCAAAAA CGGAGAAATC ATCGTTGGTG 2580 

ATGCTGCAAA ACGTCAAGCA GTTACAAACC CAGATACAGT TATCTCTATC AAATCTAAGA 2640 

TGGGAACTTC TGAAAAAGTT TCTGCAAATG GAAAAGAATA CACTCCACAA GAAATCTCAG 2700 

CTATGATCCT TCAATACTTG AAAGGCTACG CTGAAGACTA CCTTGGTGAG AAAGTAACCA 2760 

AAGCTGTTAT CACAGTTCCG GCTTACTTCA ACGACGCTCA ACGTCAAGCA ACAAAAGACG 2820 

CTGGTAAAAT TGCTGGTCTT GAAGTAGAAC GTATTGTTAA CGAACCAACT GCAGCAGCTC 2880 

TTGCTTATGG TTTGGACAAG ACTGACAAAG AAGAAAAAAT CTTGGTATTT GACCTTGGTG 2940 

GTGGTACATT CGACGTCTCT ATCCTTGAAT TGGGTGACGG TGTCTTCGAC GTATTGTCAA 3000 

CTGCAGGGGA CAACAAACTT GGTGGTGACG ACTTTGACCA AAAAATCATT GACCACTTGG 3060 

TAGCAGAATT CAAGAAAGAA AACGGTATCG ACTTGTCTAC TGACAAGATG GCAATGCAAC 3120 

GTTTGAAAGA TGCGGCTGAA AAAGCGAAGA AAGACCTTTC TGGTGTAACT TCAACACAAA 3180 

TCAGCTTGCC ATTTATCACT GCAGGTGAGG CTGGACCTCT TCACTTGGAA ATGACTTTGA 3240 

CTCGTGCGAA ATTTGACGAT TTGACTCGTG ACCTTGTTGA ACGTACAAAA GTTCCAGTTC 3300 

GTCAAGCCCT TTCAGATGCA GGTTTGAGCT TGTCAGAAAT CGACGAAGTT ATCCTTGTTG 3360 

GTGGTTCAAC TCGTATCCCT GCCGTTGTTG AAGCTGTTAA AGCTGAAACT GGTAAAGAAC 3420 

CAAACAAATC AGTAAACCCT GATGAAGTAG TTGCTATGGG TGCGGCTATC CAAGGTGGTG 3480 

TGATTACTGG TGATGTCAAG GACGTTGTCC TTCTTGATGT AACGCCATTG TCACTTGGTA 3540 

TCGAAACAAT GGGTGGAGTA TTTACAAAAC TTATCGATCG CAACACTACA ATCCCAACAT 3600 

CTAAATCACA AGTCTTCTCA ACAGCAGCAG ACAACCAACC AGCCGTTGAT ATCCACGTTC 3660 

TTCAAGGTGA ACGCCCAATG GCAGCAGATA ACAAGACTCT TGGACGCTTC CAATTGACTG 3720 

ATATCCCAGC TGCACCTCGT GGAATTCCTC AAATCGAAGT AACATTTGAC ATCGACAAGA 3780 

ACGGTATCGT GTCTGTTAAG GCCAAAGACC TTGGAACTCA AAAAGAACAA ACTATTGTCA 3840 

TCCAATCGAA CTCAGGTTTG ACTGACGAAG AAATCGACCG CATGATGAAA GATGCAGAAG 3900 

CAAACGCTGA AGCCGATAAG AAACGTAAAG AAGAAGTAGA CCTTCGTAAT GAAGTAGACC 3960 
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AAGCAATCTT TGCGACTGAA AAGACAATCA AGGAAACTGA AGGTAAAGGC TTCGACGCAG 4020 

AACGTGACGC TGCCCAAGCT GCCCTTGATG ACCTTAAGAA AGCTCAAGAA GACAACAACT 4080 

TGGACGACAT GAAAACAAAA CTTGAAGCAT TGAACGAAAA AGCTCAAGGA CTTGCTGTTA 4140 

AACTCTACGA ACAAGCCGCA GCAGCGCAAC AAGCTCAAGA AGGAGCAGAA GGCGCACAAG 4200 

CAACAGGGAA CGCAGGCGAT GACGTCGTAG ACGGAGAGTT TACGGAAAAG TAAGATGAGT 4260 

GTATTGGATG AAGAGTATOT AAAAAATACA CGAAAAGTTT ATAATGATTT TTGTAATCAA 4320 

GCTGATAACT ATAGAACATC AAAAGATTTT ATTGATAATA TTCCAATAGA ATATTTAGCT 4380 

AGATATAGAG AATTATATTA GCTGAACATG ATAGTTGTAT CAAAAATGAT GAAGCGGTAA 4440 

GGAATTTTGT TACCTCAGTA TTGTTGTCTG CATTTGTATC GGCGATGGTA CCGTATCTGA 4500 

CGAACGTTCA GCTTATAT 4518 



(2) INFORMATION FOR SEQ ID NO: 122: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8145 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



<xi) SEQUENCE DESCRIPTION : SEQ ID NO: 122: 

TGCTATTTTC GATTCCCTTG GGCGTTTTGA TTGCCTTTGC CTTGCAAGTC CATTGGAAGC 60 

CCCTCCATTA TCTGATTAAC ATTTACATCT GGGTTATGCG AGGAACCCCC TTACTCTTGC 120 

AACTGATTTT TATCTATTAT GTGCTCCCAA GTATTGGGAT TCGTTTAGAC CGCCTTCCTG 180 

CAGCTATTAT TGCCTTTGTT CTCAACTATG CAGCTTACTT TGCAGAAATT TTCCGTGGGG 240 

GAATTGACAC TATTCCAAGA GGACAGTATG AGGCCGCCAA GGTCTTGAAG TTTAGCCCTT 300 

TTGACAGAGT GCGCTATATT ATCTTGCCCC AAGTGACCAA GATCGTTCTT CCTAGTGTCT 360 

TTAATGAAGT TATGAGTTTG GTCAAGGATA CTTCTTTGGT CTATGCTCTC GGAATTTCAG 420 

ACCTTATCTT GGCTAGTCGA ACAGCTGCTA ACCGCGATGC TAGTCTAGTT CCTATGTTCT 480 

TGGCAGGAGC CATTTATTTG ATTTTGATTG GGATTGTGAC AATTATTTCC AAAAAAGTTG 540 

AGAAGAAGTA TAGTTATTAT AGATAGGAGG CTGCCATGTT AGAATTACGA AATATCAATA 600 

AAGTCTTTGG AGACAAACAA ATCCTGTCTA ATTTCAGTCT AAGTATTCCT GAAAAGCAAA 660 

TCCTGGCTAT CGTTGGACCT TCTGGTGGAG GTAAGACAAC TCTTTTACGT ATGCTTGCAG 720 

GTCTTGAAAC CATTGATTCA GGGCAAATCT TTTATAATGG ACAACCTTTA GAGCTGGATG 780 
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AATTGCAGAA 


GCGCAATCTA 


CTGGGATTTG 


TCTTCCAAGA 


TTTTCAACTA 


TTTCCTCATC 


840 


TATCAGTTCT 


GGAAAATTTG 


ACTTTATCGC 


CTGTGAAGAC 


CATGGGAATG 


AAGCAGGAAG 


900 


AGGCTGAGAA 


GAAGGCGAGT 


GGACTCTTGG 


AACAGTTAGG 


ACTAGGAGGA 


CACGCAGAGG 


960 


CCTATCCTTT 


CTCACTATCT 


GGTGGGCAAA 


AGCAGCGGGT 


GGCTTTGGCG 


CGTGCTATGA 


1020 


TGATTGACCC 


AGAAATCATT 


GGCTACGATG 


AACCAACTTC 


TGCCCTGGAT 


CCAGAATTAC 


1080 


GTTTGGAAGT 


GGAGAAGCTA 


ATCTTGCAAA 


ATAGGGAACT 


TGGGATGACC 


CAGATTGTGG 


1140 


TTACCCATGA 


TTTGCAGTTT 


GCTGAAAATA 


TCGCAGATGT 


ATTATTGAAA 


GTAGAACCTA 


1200 


AATAGGAGGA 


AAAATGGATG 


AAAAAATGGA 


TGCTTGTATT 


AGTCAGTCTG 


ATGACTGCTT 


1260 


TGTTCTTAGT 


AGCTTGTGGG 


AAAAATTCTA 


GCGAAACTAG 


TGGAGATAAT 


TGGTCAAAGT 


1320 


ACCAGTCTAA 


CAAGTCTATT 


ACTATTGGAT 


TTGATAGTAC 


TTTTGTTCCA 


ATGGGATTTG 


1380 


CTCAGAAAGA 


TGGTTCTTAT 


GCAGGATTTG 


ATATTGATTT 


AGCTACAGCT 


GTTTTTGAAA 


1440 


AATACGGAAT 


CACGGTAAAT 


TGGCAACCGA 


TTGATTGGGA 


TTTGAAAGAA 


GCTGAATTGA 


1500 


CAAAAGGAAC 


GATTGATCTG 


ATTTGGAATG 


GCTATTCCGC 


TACAGACGAA 


CGCCGTGAAA 


1560 


AGGTGGCTTT 


CAGTAACTCA 


TATATGAAGA 


ATGAGCAGGT 


ATTGGTTACG 


AAGAAATCAT 


1620 


CTGGTATCAC 


GACTGCAAAG 


GATATGACTG 


GAAAGACATT 


AGGAGCTCAA 


GCTGGTTCAT 


1680 


CTGGTTATGC 


GGACTTTGAA 


GCAAATCCAG 


AAATTTTGAA 


GAATATTGTC 


GCTAATAAGG 


1740 


AAGCGAATCA 


ATACCAAACC 


TTTAATGAAG 


CCTTGATTGA 


TTTGAAAAAC 


GATCGAATTG 


1800 


ATGGTCTATT 


GATTGACCGT 


GTCTATGCAA 


ACTATTATTT 


AGAAGCAGAA 


GGTGTTTTAA 


1860 


ACGATTATAA 


TGTCTTTACA 


GTTGGACTAG 


AAACAGAAGC 


TTTTGCGGTT 


GGAGCCCGTA 


1920 


AGGAAGATAC 


AAACTTGGTT 


AAGAAGATAA 


ATGAAGCTTT 


TTCTAGTCTT 


TACAAGGACG 


1980 


GCAAGTTCCA 


AGAAATCAGC 


CAAAAATGGT 


TTGGAGAAGA 


TGTAGCAACC 


AAAGAAGTAA 


2040 


AAGAAGGACA 


GTAAGATAAA 


ATAGTGGCTG 


AAACTGCGTT 


TTGATTAGCA 


AAACGTAGTT 


2100 


TTTTTTGTAA 


TCTAGGAAAA 


CGATAATAGC 


GATTGAATAT 


GGATAATTGA 


ATATGGAATA 


2160 


GCCCACTGTG 


ATTTCTAAAA 


CATTGTTAAA 


AATTGATTTG 


ACTTCCAAAA 


TTAAAATGTT 


2220 


CTGTAATGAA 


ATACTGATGT 


AACTGTTTTA 


GGAACAATAA 


AACGCATAAT 


ATCAAGGTTT 


2280 


TTGCACCTTA 


CATTATGCGT 


TTTTGTGATT 


TTAAGACTTG 


TTAGCTGATT 


TTTTACAATC 


2340 


CTGCGAAATC 


TTTGATTTCT 


TGTGCTGACA 


TTGAAGAGTC 


GCAACGGACG 


TTGATTTGTC 


2400 


CATCTGTAAT 


ATGAACAAAA 


CCTGGTACAG 


TTGGGATTCC 


ATAGCGTGAG 


CGGAATGCTT 


2460 


GCAAATCATT 


GAGTTGGCTT 


GGTTCTTCAC 


TATTGATGAA 


GTAAATGTGA 


GCTTTGGTTT 


2520 


CAGCTACGAC 


ACCTGACAAT 


GTACCTGCAA 


ATTTACGGCA 


GTAAGGGCAA 


GTTTTGCGAC 


2580 
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CGATAAAGAA GGTTGCAGTT TCTTTTTTAT CAAGAGCTTC TTGCGCACGC ACAACTGTAG 2640 

TGACTTCAAG GTCTTTGATG TTATCTAAAA ATTGTTCCAT GAGATTACCT CGCTTTCATT 2700 

GATAAGTCTA GTATGCCATA AAGTTTCTAA AATTGCTTAG ATTTGATACG AAAAAAGATG 2760 

AGGTTGGTTG GTCTCATCTT TTATAGGTCT TTATTTTACA AATGCATTGA TTTCTGCTTC 2820 

GATGTTAGCA ATCTTAGCTT GTGATTCTTC GTTGGTTTCC CCTACAACTG CAATGTAGAA 2880 

CTTGATTTTT GGTTCTGTAC CTGAAGGGCG AACGGCAATC CATGAACCGT CAGCAAGTGT 2940 

GTATTTCAAC ACATCACTTG GAGGAGTTGT CAAGTTTGTA ACAGTACCGT CAGCAACAGT 3000 

AGCAGTTTGT GCCTTGAAGT CTTCTACGAC AGTGATAGCT GTTGCGTTCC ATTCTGTTGG 3060 

AGCATTGTTG CGGAATTTAG CCATAATCGC TTTGATTTGT TCAGCACCAT CGACACCTGA 3120 

AAGAGTAACA GAGATTGTTT TTTCTGCGTA GTAGCCATAT TCTTTATAGA TTTCTTCGAT 3180 

ACCGTCAGCA AGTGTCAAAC CACGAGAACG GTAGTAGGCA GCAAGTTCAG CAACTACAAG 3240 

AACGGCTTGG ATGGCATCTT TATCACGTAC AAATGGTTTA ATCAAGTAAC CGAAGCTTTC 3300 

TTCAAATCCC ATCATGTAAG TGTGGTTGTG TTTTTCTTCG AATTCTTGGA TTTTTTCAGC 3360 

GATAAATTTG AAACCTGTCA AGACGTTGAA CATAGTTGCG CCGTAGCTTT CAGCAATCTT 3420 

CGTTACCAAG TCAGTTGAAA CGATAGATTT GCAGAGAGCG GCATTTTCAG GAAGAGTTCC 3480 

AGCGTTTTTG TGAGCTTCCA AGATGTATTT AGCCATGATA GCACCGATTT GGTTACCTGA 3540 

AAGGTTGAGG TAGCTACCAT CTTTTTGAAG AACTTCAACA CCAACACGGT CAGCGTCTGG 3600 

GTCAGTTGCG ACAAGAACAT CTGCACCAAC TTGACGACCA AGTTCTTCAG CAAGGGCAAA 3660 

GGCTGCTTGG CTTTCTGGGT TTGGAGATGT TACAGTTGAA AAGTCTGGGT CAGCAGTTGC 3720 

TTGCGCTTCA ACAACTTGAA CAGAGTCAAA TCCTGCTTGG GCAAGAGCAC GACGAGCCAA 3780 

CATTTCACCA GTACCATGAA GTGGTGTGTA GACAATCTTC ATGTCTTTAC CAAATTCTTC 3840 

AATCAAGGCT GGGTTGATGT TTATGTCCTT AACCTCTTTA AGGTATTCTA TGTCAACAGC 3900 

TTCGCCGATA ACTTCAATCA AGCCAGAAGC TTTTTCAGTT TCCACATCAG CAACTTCAAC 3960 

TGCAAATGGG TTTTCGATTG CACGGATATA AGTAGTCAAA GCGTCCGCAT CGTGTGGAGG 4020 

CATTTGTCCA CCGTCTTCAC CGTAAACCTT GTAACCGTTA AATGGAGCAG GGTTGTGGCT 4080 

GGCTGTGACC ATGATACCTG CGAAACAGTT GAGATGACGA ACTGCAAATG ATAGTTCTGG 4140 

AGTCGGACGA AGGCTTTCAA ATACGTAAGA TTTGATGCCG TGTTTAGCAA GAACTGCCGC 4200 

AGATTCAAAG GCAAACTCAG GTGAGAAGTG ACGGCTATCG TAGGCAATTG CTACACCGCG 4260 

TTCTTTCTCG TTTCCACCTT TTGACTCAAT CAAACGAGCC AATCCTTCAG TAGCTTGGCG 4320 
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AACAACGTAG ATGTTGATAC GGTTTGTACC AGCACCAACC AAGCCACGCA TACCTGCAGT 4380 

ACCAAATTCA AGATTTGTAT AGAAGGCATC TTCCTTAGTT TTTTCGTCCA TATTTTCCAA 4440 

ATCTTGACGA AGGTAGTCAC GAAGCTCCAC AAAATCAACC CATTTCTGGT AATTTTCTTG 4500 

GTAAGACATT CAAATTCTCC TTTATTTTTA AAACATTTAA TCAGTTTAAT TATATCATTT 4560 

TTTTTAGTTT TAGTAAAACC TTATCTGCTT CGAACATCTC TTCAAACCAG GTCAGATTGA 4620 

ATTTTGGGGT TATATGATGT TGAGGCTAGG AAAAATTCAA TTTCAGTAAA AAAAGTAAGT 4680 

CTTCTCATAA CAAAACATTG ATATAGTTAC TTAGTTTTAA ACAAGCATAT TATAATAAAG 4740 

CTATGGCATA TAGTACTGAT TTTAAACAGC GAGCATTAGA TTACATCAAA GAGGGGCACA 4800 

GCCATGTCGA GGCAGCCAAG TTTTTTGGTG TTGGCGTCAG AACTCTCTTC ACGTGGGAAA 4860 

AGAAAGACGT GAACAAGAAC ACATAGAGAG GAAAAAGCGA GTCGTCAAAA ACCGAAAGAT 4920 

TCCTTTAGAG GAATTGAAAG CCTTTGTAGA GGCTCATCCA GATGCTTTTT TACGGGAAAT 4980 

TGCGGCACAT TTTGATTGTG CTGTTCCTTC AGTATGGGCA GCTTTAAAGC AGATTAAGGT 5040 

CACTTTAAAA AAAGATGACG AGCTTTAAGG AACAAGACCC AGAAAAGTAG CCTTATTTCT 5100 

TAAGAATTTT AATAGTTTAA AGCACCTAGC ACCTGTTTAT ATTGATGAAA CAGGAATCGA 5160 

CCGCTATCTC TATCGTCCTT ATGCAGGGGC TCCTAGAGGG GAGAAAGTCT ATGAAAAGAT 5220 

TAGCGGACGT CGTTTTGAGC GAACTTCAAT TGTTGCAGGA CAAGTAGACG GAGAGTTTAT 5280 

AGCTCCCATG ATTTACAAGA AAAGCATGAC AAGCGATTTC TTTGTGGAGT GGTTCAAAAC 5340 

GCAACTCCTA CCTGCTTTGA AGACACCTCA TGTTATTGTC ATGGGCAATG CTGGTTTTCA 5400 

TCCCAAGAAC ATTTTGGATG AACTCTGCAT CCAAGATAAA CACTTTTTCT TACCTCTACC 5460 

ACCTTATTCA CCGGATTTGA ATCCTATTGA GCAAGCTTGG GCTATCTTGA AAAAGAAAGT 5520 

GACGGATGTA TTAAGGGAAG TTCCAACTAT TTTTGAATGT TTGGAATGCT TTTTTAAAAC 5580 

TAGATGACTA TAACGGTTCT AAAGGAACCT ATCGAGTAGT CATTAAAACT AAGGATACTG 5640 

CTGGTTAAGA GAAGACGGTA TACAATCAAA CCATTCACCG TGTAGCCGAA ATCGTTCAGA 5700 

ATGAAGACTT GTATCAGAAT GAAGACTTGT ATAAGAAAGG TTTGAATGTT GAACTTGCGC 5760 

ACCAACAAAT TAAGGGATTT TTTGAAGCAG AGTTTAAAAA TCGTATTAAT GGAGTTCTTA 5820 

ATACTAAAAT AAAAAATAGT ACATTAAATC GTGTAAATAA AAAAACTATA CACCAGAGCA 5880 

ACAAAAACTC CATGATCAAT TTGAAGCAGA AGCAACGGAA GATGCTAAAA AACAAGGCGA 5940 

TATTGTGTTG AATGTTGACC AGGATTTCAT GAGCATATCT AAGTCTAATA AAAGTGGTTC 6000 

AGACTGGAAG AAAACTTTCA CAGTGAGGAT AACCAATAGG CTAGCAAATG ACTTGAATAA 6060 

TGTCTTGAAA CAGGTTGATA AAGATACTCC TAATACCCCA ACTTGGCTAA ACTCAGCTGC 6120 
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TTCTAAAGCT AAAGATGATG ACAGAGTATA TAAACTACTG AAGACTCTTA TACCAGGAGA 6180 

AAATTACCTA TCATGTTAAG GATAATCAGC TAGAAGTAGA AACAGATAAA TACACATATA 6240 

CTGCCGCTAG AAATGGTAGT AAGGAAGTTG GTATTCAAGA GTCAGATATA GCAGCAACTC 6300 

TAAGTGCCGA TGAATATAAT TCTAATCGCC AAACTTTTGA GAGAGAATAC AAATACAAAA 6360 

GCAAATGCCC TTAATAATGG TTGGGCTAGA TCTGGTTCTG AAGAGTTCAA AAAGTTCTCC 6420 

CACTTTGTAG GGGTAGACAA AGGGATTGTG CGAACGAATG TACTGACTGG TAAAAAACTA 6480 

TCTGATAAGA TTAGGAAAGA AGTGGGCTCT GGAGATAGCA AACTAGGAAA AGGCGGCTAT 6540 

TTCTCTACTG GGGATGTTCT ATTAGGAAAA GATGTTGTTT CTTATACCGT ACAAGTATTT 6600 

TCAGAGAATA ATGAAAGAGT AGGAGTAAAC ACTCAAAGTC ACCGTGTTCA GTATAATCTC 6660 

CCAATTCTAG CTGACTTTTC AGTCATCCAA GATACTGTGG AACCATCACG AACCGTTGTT 6720 

GAAAAAATCA TTCCAAAACT AAATATTCCC GAAGAAGAGA AAGGGAAAAT AACCGAAGAA 6780 

ATCAAGAAAA AGAAAAAAAC CTCAGAATTG GCAGAACTAA TCTCAGAAAA TGTGAAAGTT 6840 

CGCTATGTTG ATGAACAAGG GCGTTTGCTA TCATTGAAAA ATGATACTGG AATTGGAGAA 6900 

AAAGAAAGTG ACGGAACCTA CATTACCAAT AAAAAACAAC TGATTGGTAC CAGCTATAAT 6960 

GTCACAGATA AAAAACTCAG TAGCATGACT ACTACTGACG GAAAATATTA TACTTTTAAA 7020 

GAAGCAGATA CAAATTCTGC AAGTTTAACT GGGAATATTG TAAGCGAAGG TAGAACAGTG 7080 

ACCTTAGTTT ATAGAGAAAG CGAAGCGCCA ACCACTGCTA CAGTAACAGC CAATTACTAT 7140 

AAAGAAGGTA GGCAAGAGAA GTTGGTAGAG TCTGTTATAA AAGCTGATTT AGCGATAGGT 7200 

TCTGAGTATA CCACAGAATC AAAAACTATT GAAGGGAAAA CAACAACTGA GGACAAAGAA 7260 

GACCGAGTTA TCACAAGGAA AACAACATAC ACCTTGGTAG CAACTCCTGA AAATGCGTAC 7320 

CAGAAGACGG TGCAACAGTT GACTATTACT ACCGTGAGAA TGTTGAGGAA ACAGTGGTTC 7380 

CCAAAACAGC AACCTCTACT GAGACGAAGA CTATAACGCG TATCATTCAT TACGTTGATA 7440 

AAGTTACGAA CCAAAATGTA AAAGAAGATG TTGTTCAACC TGTAACCTTA AGCCGTACAA 7500 

AAACTGAGAA CAAGGTCACG GGAGTTGTAA CCTACGGTGA ATGGACAACA GGAAACTGGG 7560 

ACGAGGTTAT ATCTGGTAAG ATTGACAAGT ACAAAGATCC AGATATTCCA ACAGTTGAAT 7620 

CACAAGAAGT TACGTCAGAC TCTAGTGATA AAGAAATAAC GGTAAGGTAT GACCGTTTAT 7680 

CAACACCAGA AAAACCAATC CCACAACCAA ATCCAGAGCA TCCAAGTGTT CCGACACCAA 7740 

ACCCAGAACT ACCAAATCAA GAGACTCCAA CACCAGATAA ACCAACTCCA GAACCAGGTA 7800 

CTCCAAAAAC TGAAACTCCA GTGAATCCAG ACCCAGAAGT TCCGACTTAT GAGACAGGTA 7860 
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AGAGAGAGGA ATTGCCAAAC ACAGGTACAG 1 AAGCTAATGC TACCTTGGCT AGTGCTGGTA 



7920 



TCATGACCTT GTTAGCTGGT CTAGGATTAG GATTTTTCAA GAAAAAAGAA GATGAAAAAT 



7980 



AATAGATTTT AGAATCTAGG AACCAGGAAA AGCTCACAGA TGTGGGCTTT TTTCCTGGTT 



8040 



TTGAGAACGA GGTCTTTCGT AAAGAATAAA AACGCTTACA AGTCTGTTGA ACTGGGAAAC 



8100 



TATGAATCCT ATTTTTTTAA AAATATTTCC AGAAATCAGT TGCGG 



8145 



(2) INFORMATION FOR SEQ ID NO: 123: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8697 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 123: 

CGGTACCGGG AACGATACTT AGTCTAATTT TGCACCTTTT CCATGTATGG TAAAGGTTTT 60 

TCTTTTTTTA AAAAGGAAAA CGAGAAGAGG AGGTTCTTAT GAAAGCAAGC ATTGCCTTGC 120 

AAGTTTTACC CCTAGTACAG GGGATTGATC GGATAGCTGT TATTGATCAG GTCATTGCTT 180 

ATCTGCAwAC TCAAGAAGTG ACGATGGTAG TG AC AC C ATT TGAAACGGTC TTGGAAGGGG 240 

AGTTTGATGA GCTTATGCGC ATTCTAAAAG AAGCGCTGGA AGTGGCAGGG CAGGAGGCAG 300 

ACAATGTCTT TGCCAATGTC AAAATAAATG TAGGAGAGAT TTTAAGTATT GATGAGAAAC 360 

TTGAGAAGTA TACTGAGACG ACACATTAGT CTATTGGGCT TTCTCGGAGT ATTGTCAATC 420 

TGGCAGTTAG CAGGTTTTCT TAAACTTCTC CCCAAGTTTA TCCTGCCGAC ACCTCTTGAA 480 

ATTCTCCAGC CCTTTGTTCG TGACAGAGAA TTTCTCTGGC ACCATAGCTG GGCGACCTTG 540 

AGAGTGGCTT TACTGGGGCT GATTTTGGGA GTTTTGATTG CCTGTCTTAT GGCTGTGCTC 600 

ATGGATAGTT TGACTTGGCT CAATGACCTG ATTTACCCTA TGATGGTGGT CATTCAGACC 660 

ATTCCGACCA TTGCCATAGC TCCTATCCTG GTCTTGTGGC TAGGTTATGG GATTTTGCCC 720 

AAGATTGTCT TGATTATCTT AACGACAACC TTTCCCATCA TCGTTAGTAT TTTGGACGGT 780 

TTTAGGCATT GCGACAAGGA TATGCTGACC TTGTTTAGTC TGATGCGGGC CAAGCCTTGG 840 

CAAATCCTGT GGCATTTTAA AATCCCAGTT AGCCTGCCTT ACTTTTATGC AGGTCTGAGG 900 

GTCAGTGTCT CCTACGCCTT TATCACAACT GTGGTATCTG AGTGGTTGGG AGGTTTTGAA 960 

GGTCTTGGTG TTTATATGAT TCAGTCTAAA AAACTGTTTC AGTATGATAC CATGTTTGCC 1020 

ATTATTATTC TGGTGTCGAT TATCAGTCTT TTGGGTATGA AGCTGGTCGA TATCAGTGAA 1080 

AAATATGTGA TTAAATGGAA ACGTTCGTAG AATTAGAATG TTTCTGAAAA AGAAAAGAGG 1140 
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AAATCAAAAT GAAGAAAACA TGGAAAGTGT TTTTAACGCT TGTAACAGCT CTTGTAGCTG 1200 

TTGTGCTTGT GGCCTGTGGT CAAGGAACTG CTTCTAAAGA CAACAAAGAG GCAGAACTTA 1260 

AGAAGGTTGA CTTTATCCTA GACTGGACAC CAAATACCAA CCACACAGGG CTTTATGTTG 1320 

CCAAGGAAAA AGGTTATTTC AAAGAAGCTG GAGTGGATGT TGATTTGAAA TTGCCACCAG 1380 

AAGAAAGTTC TTCTGACTTG GTTATCAACG GAAAGGCACC ATTTGCAGTG TATTTCCAAG 1440 

ACTACATGGC TAAGAAATTG GAAAAAGGAG CAGGAATCAC TGCCGTTGCA GCTATTGTTG 1500 

AACACAATAC ATCAGGAATC ATCTCTCGTA AATCTGATAA TGTAAGCAGT CCAAAAGACT 1560 

TGGTTGGTAA GAAATATGGG ACATGGAATG ACCCAACTGA ACTTGCTATG TTGAAAACCT 1620 

TGGTAGAATC TCAAGGTGGA GACTTTGAGA AGGTTGAAAA AGTACCAAAT AACGACTCAA 1680 

ACTCAATCAC ACCGATTGCC AATGGCGTCT TTGATACTGC TTGGATTTAC TACGGTTGGG 1740 

ATGGTATCCT TGCTAAATCT CAAGGTGTAG ATGCTAACTT CATGTACTTG AAAGACTATG 1800 

TCAAGGAGTT TGACTACTAT TCACCAGTTA TCATCGCAAA CAACGACTAT CTGAAAGATA 1860 

ACAAAGAAGA AGCTCGCAAA GTCATCCAAG CCATCAAAAA AGGCTACCAA TATGCCATGG 1920 

AACATCCAGA AGAAGCTGCA GATATTCTCA TCAAGAATGC ACCTGAACTC AAGGAAAAAC 1980 

GTGACTTTGT CATCGAATCT CAAAAATACT TGTCAAAAGA ATACGCAAGC GACAAGGAAA 2040 

AATGGGGTCA ATTTGACGCA GCTCGCTGGA ATGCTTTCTA CAAATGGGAT AAAGAAAATG 2100 

GTATCCTTAA AGAAGACTTG ACAGACAAAG GCTTCACCAA CGAATTTGTG AAATAATGAC 2160 

AGAAATTAGA CTAGAGCACG TCAGTTATGC CTATGGTCAG GAGAGGATTT TAGAGGATAT 2220 

CAACCTACAG GTGACTTCAG GCGAAGTGGT TTCCATCCTA GGCCCAAGTG GTGTTGGAAA 2280 

GACCACCCTC TTTAATCTAA TCGCTGGGAT TTTAGAAGTT CAGTCAGGGA GAATTGTCCT 2340 

TGATGGTGAA GAAAATCCCA AGGGGCGCGT GAGTTATATG TTGCAAAAGG ATCTGCTCTT 2400 

GGAGCACAAG ACGGTGCTTG GAAATATCAT TCTGCCCCTC TTGATTCAAA AGGTGGATAA 2460 

GGCAGAAGCT ATTTCCCGAG CGGATAAAAT TCTTGCGACC TTCCAGCTGA CAGCTGTAAG 2520 

AGACAAGTAT CCTCATGAAC TTAGCGGTGG GATGCGCCAG CGTGTAGCCT TACTCCGGAC 2580 

CTACCTTTTT GGGCACAAGC TCTTTCTCTT AGATGAGGCC TTTAGCGCCT TGGATGAGAT 2640 

GACAAAGATG GAACTCCACG CTTGGTATCT TGAGATTCAC AAGCAGTTGC AGCTAACAAC 2700 

CCTGATCATC ACGCATAGTA TTGAGGAGGC CCTCAATCTC AGCGACCGTA TCTATATCTT 2760 

GAAAAATCGC CCTGGGCAGA TTGTTTCAGA AATTAAACTA GATTGGTCTG AAGATGAGGA 2820 

CAAGGAAGTC CAAAAGATTG CCTACAAACG TCAAATTTTG GCGGAATTAG GCTTAGATAA 2880 
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GTAGAAAAAT AGGGAGTTGG TGAAGATTAT CCTTTACCAG CGCCCTTTTT CTTTTAAAAA 2940 

TGAGAAAATT TCGGTATAAT AGTCAAACAA GGTCAAGGTT TAAAGAGAGA GGTGGGTTTG 3000 

TTATGAGATT TAAAAATACA TCGGATCATA TTGAGGCCTA CATCAAGGCG ATTTTAGATC 3060 

AATCTGGTAT CGTGGAGTTG CAACGGAGTC AGTTGGCAGA TACCTTTCAG GTTGTTCCTA 3120 

GTCAGATTAA CTACGTGATC AAGACACGCT TTACGGAAAG TAGAGGCTAC TTGGTTGAAA 3180 

GTAAGCGTGG TGGCGGAGGC TACATTCGTA TAGGACGGAT TGAGTTTTCT AGTCATCATG 3240 

AAATGCTCCG GGAGCTGCTT TACTCGATTG GTGAGCGAGT CAGTCAAGAA ATTTATGAGG 3300 

ATATTCTCCA GCTTTTGGTT GAGCAGGAAT TGATGACCAA GCAGGAGATG AATTTGCTAG 3360 

AATCAGTAGC TTTGGATCGC GTTTTAGGAG AAGAAGCTCC AGTTGTTCGA GCAAACATGC 3420 

TACGTCAGAT CATACAAGAG GTAGATAGAA AAGGGAAGTA AGATGAACTA TTCAAAAGCA 3480 

TTGAATGAAT GTATCGAAAG TGCCTACATG GTTGCTGGAC ATTTTGGAGC TCGTTATCTA 3540 

GAGTCGTGGC ACTTGTTGAT TGCCATGTCT AATCACAGTT ATAGTGTAGC AGGGGCAACT 3600 

TTAAATGATT ATCCGTATGA GATGGACCGT TTAGAAGAGG TGGCTTTGGA ACTGACTGAA 3660 

ACGGACTATA GCCAGGATGA AACCTTTACG GAATTGCCGT TCTCCCGTCG TTTGCAGGTT 3720 

CTTTTTGATG AAGCAGAGTA TGTAGCGTCA GTGGTCCATG CTAAGGTACT AGGGACAGAG 3780 

CACGTCCTCT ATGCGATTTT GCATGATAGC AATGCCTTGG CGACTCGTAT CTTGGAGAGG 3840 

GCTGGTTTTT CTTATGAAGA CAAGAAAGAT CAGGTCAAGA TTGCTGCTCT TCGTCGAAAT 3900 

TTAGAAGAAC GGGCAGGCTG GACTCGTGAA GATCTCAAGG CTTTACGCCA ACGCCATCGT 3960 

ACAGTAGCTG ACAAGCAAAA TTCTATGGCC AATATGATGG GCATGCCGCA GACTCCTAGT 4020 

GGTGGTCTCG AGGATTATAC GCATGATTTG ACAGAGCAAG CGCGTTCTGG CAAGTTAGAA 4080 

CCAGTCATCG GTCGGGACAA GGAAATCTGA CGTATGATTC AAATCTTGAG CCGGAAGACT 4140 

AAGAACAACC CTGTCTTGGT TGGGGATGCT GGTGTCGGGA AAACAGCTCT GGCGCTTGGT 4200 

CTTGCCCAGC GTATTGCTAG TGGTGACGTG CCTGCGGAAA TGGCTAAGAT GCGCGTGTTA 4260 

GAACTTGATT TGATGAATGT CGTTGCAGGG ACACGCTTCC GTGGTGACTT TGAAGAACGC 4320 

ATGAATAATA TCATCAAGGA TATTGAAGAA GATGGCCAAG TCATCCTCTT TATCGATGAA 4380 

CTCCACACCA TCATGGGTTC TGGTAGCGGG ATTGATTCGA CTCTGGATGC GGCCAATATC 4440 

TTGAAACCAG CCTTGGCGCG TGGAACTTTG AGAACGGTTG GTGCCACTAC TCAGGAAGAA 4500 

TATCAAAAAC ATATCGAAAA AGATGCGGCA CTTTCTCGTC GTTTCGCTAA AGTGACGATT 4560 

GAAGAACCAA GTGTGGCAGA TAGTATGACT ATTTTACAAG GTTTGAAGGC GACTTATGAG 4620 

AAACATCACC GTGTACAAAT CACAGATGAA GCGGTTGAAA CAGCGGTTAA GATGGCTCAT 4680 
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CGTTATTTAA 


CCAGTCGTCA 


CTTGCCAGAC 


TCTGCTATCG 


ATCTCTTGGA 


TGAGGCGGCA 


4740 


GCAACAGTGC 


AAAATAAGGC 


AAAGCATGTA 


AAAGCAGACG 


ATTCAGATTT 


GAGTCCAGCT 


4800 


GACAAGGCCC 


TGATGGATGG 


CAAGTGGAAA 


CAGGCAGCCC 


AGCTAATCGC 


AAAAGAAGAG 


4860 


GAAGTACCTG 


TCTACAAAGA 


CTTGGTGACA 


GAGTCTGATA 


TTTTGACCAC 


CTTGAGTCGC 


4920 


TTGTCAGGAA 


TCCCAGTTCA 


AAAACTGACT 


CAAACGGATG 


CTAAGAAGTA 


TTTAAATCTT 


4980 


GAAGCAGAAC 


TCCATAAACG 


GGTTATCGGT 


CAAGATCAAG 


CTGTTTCAAG 


CATTAGCCGT 


5040 


GCCATTCGCC 


GCAACCAGTC 


AGGGATTCGC 


AGTCATAAGC 


GTCCGATTGG 


TTCCTTTATG 


5100 


TTCCTAGGGC 


CTACAGGTGT 


CGGGAAAACT 


GAATTAGCCA 


AGGCTCTGGC 


AGAAGTTCTT 


5160 


TTTGACGACG 


AATCAGCCCT 


TATCCGCTTT 


GATATGAGTG 


AGTATATGGA 


GAAATTTGCA 


5220 


GCTAGTCGTC 


TCAACGGAGC 


TCCTCCAGGC 


TATGTAGGAT 


ATGAAGAAGG 


TGGGGAGTTG 


5280 


ACAGAGAAGG 


TTCGCAATAA 


ACCCTATTCC 


GTTCTCCTCT 


TTGATGAGGT 


AGAGAAGGCC 


5340 


CACCCAGATA 


TCTTTAATGT 


TCTCTTGCAG 


GTTCTGGATG 


ACGGTGTCTT 


GACAGATAGC 


5400 


AAGGGACGCA 


AGGTCGATTT 


TTCAAATACC 


ATTATCATTA 


TGACATCGAA 


TCTAGGTGCG 


5460 


ACTGCCCTTC 


GTGATGATAA 


GACTGTTGGT 


TTTGGGGCTA 


AGGATATTCG 


TTTTGACCAG 


5520 


GAAAATATGG 


AAAAACGCAT 


GTTTGAAGAA 


CTGAAAAAAG 


CTTATAGACC 


GGAATTCATC 


5580 


AACCGTATTG 


ATGAGAAGGT 


GGTCTTCCAT 


AGCCTATCTA 


GTGATCATAT 


GCAGGAAGTG 


5640 


GTGAAGATTA 


TGGTCAAGCC 


TTTAGTGGCA 


AGTTTGACTG 


AAAAAGGCAT 


TGACTTGAAA 


5700 


TTACAAGCTT 


CAGCTCTGAA 


ATTGTTAGCA 


AATCAAGGAT 


ATGACCCAGA 


GATGGGAGCT 


5760 


CGCCCACTTC 


GCAGAACCCT 


GCAAACAGAA 


GTGGAGGACA 


AGTTGGCAGA 


ACTTCTTCTC 


5820 


AAGGGAGATT 


TAGTGGCAGG 


CAGCACACTT 


AAGATTGGTG 


TCAAAGCAGG 


CCAGTTAAAA 


5880 


TTTGATATTG 


CATAAAAGAA 


TAAAAGTATC 


AGCATCTGAC 


CATAAGTCAC 


AGTGGAGTGA 


5940 


AATTCAATGA 


AAATCAAAGA 


GCAAACTAGG 


CAGCTAGCCG 


CAGGTTGCTC 


AAAACACTGG 


6000 


TTTGAGGTTG 


CAGATAGAGC 


TGACGTGGTT 


TGAAGAGATT 


TTCGAAGAGT 


ATGAAACTAA 


6060 


AACCTATAGC 


TTCTAAACGA 


TCCGTGGTTT 


TCATCATTCA 


ACACAAAATT 


CATATGTTTA 


6120 


TTACCCTCCG 


TCGTATTTGT 


CTTAGAGCGT 


GTGTAGTAGA 


AAAAGAGCAG 


TCTTATCTGA 


6180 


AATTTTTATT 


CTTTCAAAAG 


AGACCTGTTT 


CTTTTTTGCA 


TGTCAAATCC 


GTTCTAGCTG 


6240 


GTATTTGAAA 


AATCAAACTA 


ATATTCAATG 


AAAATCAAAG 


AACAAACTAG 


GAAGCTAGCC 


6300 


GCAGGTTGCT 


CAAAACACTG 


TTTTGAGGTT 


GTAGATAGAG 


CTGACGTGGT 


TTGAAGAGAT 


6360 


TTTCGAAGAG 


TATAAGCTGC 


AAGATGAATG 


ATTTTCTTGT 


ATTGACGTTG 


TTGTTGACAA 


6420 
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AAAGTAGCGG ATAAATGAAA TCCATTCCAT TATCATAGAT GATAGGCTGG TAGGAAATTT 6480 

TCAAATAGCA TACAGGAAAT AGATGTATGG AGTTCTGGTA GTAGAAAGGG AGAGAGATGA 6540 

ACATTTTAGT TGCAGATGAC GAGGAAATGA TTAGAGAAGG AATTGCAGCA TTTCTGACAG 6600 

AAGAGGGTTA TCATGTCATT ATGGCTAAGG ATGGACAAGA GGTCTTGGAA AAATTTCAAG 6660 

ATCTCCCTAT CCATCTCATG GTACTGGATT TAATGATGCC TAGGAAGAGT GGTTTTGAAG 6720 

TGTTAAAAGA AATCAATCAA AAGCACGATA TTCCTGTCAT CGTCTTGAGT GCTCTGGGAG 6780 

ATGAAACTAC TCAGTCACAG GTATTTGATC TCTATGCTGA TGATCATGTG ACAAAACCTT 6840 

TTTCTTTGGT ACTGCTTGTC AAGCGTATTA AGGCGCTTAT CAGACGTTAC TACGTCATAG 6900 

AGGATCTTTG GCGATATCAG GATGTAACAG TGGATTTTAC CTCTTACAAA GCACATTATA 6960 

AAAATGAAGA AATTGATCTC AAACCAAAGG AATTACTGGT ACTAAAGTGT TTGATTCAGC 7020 

ATAAAAATCA AGTTTTAAGT AGAGAGCAGA TATTGGAAGA AATTTCAAAA GATGTAGCTG 7080 

ATTTACCTTG TGATAGGGTC GTTGATGTCT ATATTCGTAC TCTTCGCAAA AAATTAGCTT 7140 

TAGATTGTAT CGTGACTGTG AAAAATGTTG GGTATAAGAT TAGCTTATGA TAAAAAATCC 7200 

TAAATTATTA ACCAAGTCTT TTTTAAGAAG TTTTGCAATT CTAGGTGGTG TTGGTCTAGT 7260 

CATTCATATA GCTATTTATT TGACCTTTCC TTTTTATTAT ATTCAACTGG AGGGGGAAAA 7320 

GTTTAATGAG AGCGCAAGAG TGTTTACGGA GTATTTAAAG ACTAAGACAT CTGATGAAAT 7380 

TCCAAGCTTA CTCCAGTCTT ATTCAAAGTC CTTGACCATA TCTGCTCACC TTAAAAGAGA 7440 

TATTGTAGAT AAGCGGCTCC CTCTTGTGCA TGACTTGGAT ATTAAAGATG GAAAGCTATC 7500 

AAATTATATC GTGATGTTAG ATATGTCTGT TAGTACAGCA GATGGTAAAC AGGTAACCGT 7560 

GCAATTTGTT CACGGGGTGG ATGTCTACAA AGAAGCAAAG AATATTTTGC TTTTGTATCT 7620 

CCCATATACA TTTTTGGTTA CAATTGCTTT TTCCTTTGTT TTTTCTTATT TTTATACTAA 7680 

ACGCTTGCTC AATCCTCTTT TTTACATTTC AGAAGTGACT AGTAAAATGC AAGATTTGGA 7740 

TGACAATATT CGTTTTGATG AAAGTAGGAA AGATGAAGTT GGTGAAGTTG GAAAACAGAT 7800 

TAATGGTATG TATGAGCACT TGTTGAAGGT TATTTATGAG TTGGAAAGTC GTAATGAGCA 7860 

AATTGTAAAA TTGCAAAATC AAAAGGTTTC CTTTGTCCGC GGAGCATCAC ATGAGTTGAA 7920 

AACCCCTTTA GCCAGTCTTA GAATTATCCT AGAGAATATG CAGCATAATA TTGGAGATTA 7980 

CAAAGATCAT CCAAAATATA TTGCAAAGAG TATAAATAAG ATTGACCAGA TGAGCCACTT 8040 

ATTAGAAGAA GTACTGGAGT CTTCTAAATT CCAAGAGTGG ACAGAGTGTC GTGAGACCTT 8100 

GACTGTTAAG CCAGTTTTAG TAGATATTTT ATCACGTTAT CAAGAATTAG CTCATTCAAT 8160 

AGGTGTTACA ATTGAAAATC AATTGACAGA TGCTACCAGG GTCGTCATGA GTCTTAGGGC 8220 
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ATTGGATAAG GTTTTGACAA ACCTGATTAG TAATGCAATT AAATATTCAG ATAAAAATGG 8280 

GCGTGTAATC ATATCCGAGC AAGATGGCTA TCTCTCTATC AAAAATACAT GTGCGCCTCT 8340 

AAGTGACCAA GAACTAGAAC ATTTATTTGA TATATTCTAT CATTCTCAAA TCGTGACAGA 8400 

TAAGGATGAA AGTTCCGGTT TGGGTCTTTA CATTGTGAAT AATATTTTAG AAAGCTATCA 8460 

AATGGATTAT AGTTTTCTCC CTTATGAACA CGGTATGGAA TTTAAGATTA GCTTGTAGAC 8520 

AGATTAGTTT TTTATTAAAG TTCATATAGG GTTAACATAA GTGTGTTATT CTTTGTGTAG 8580 

ATAAAAGAAA GGATACTAAT ATGGTATTAG CGATTATTTT AGTAACATTC TTTATTCGAT 8640 

TGATTTTTTT AAAGCGTTCG ATAGAGAATG AGAAACGAAT CCTTAGCAAT GGCGGGG 8697 



(2) INFORMATION FOR SEQ ID NO: 124: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4317 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY : linear 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 124: 

AACCATACAT ACGGCAAGGC AAAGCTGACG CGGTTTGAAG AGATTTTCGA AGAGTATTAG 60 

TTGCCTTTAA AGGCATCCAC CATCGTTTGA AATTCTTCAT TTGAGAGAGT AATCCCTTTG 120 

CCCATTTTAG TATGGTCTGG ACTCCAAGCA CGAATATCAA ACTTTGCAGG GGCACCATTA 180 

AAGCTCACAC GGTTAATTTC CTTGGTCCAA CCTTTTTCGT TTTCAGAAAG AGTCAACAAG 240 

TGCTCTTCGA TTTCAAATGT AAATTCTGCC ATTTTCTTCT CCTTTTTTAG TTTCATTAGT 300 

TTATTCGTAA AATCTTGTAG ATTTTAGGAA AATTTTATAT AATATTGATA TAAAAGAAGG 360 

GAGGCCAATA TGAGACATAA ATTCCAGCAA GTTCTAAATA AAATACATGA TTTTTTAAAT 420 

GGATATGACC AACCTGACCA GACTGAAACC AACTCCCTTA CAGCCACTAT TGAAGAGGCT 480 

ATCCAGAAAC AAACCGCTGT TCACCTTATC TTGTCTGAGA CAAGCTTTAC AGGTGACATC 540 

ATCAAATATG ATCAGCAAGG CCAGCAAATT ATCGTGAAAA ATTTTTCCAA AAATGTGAGC 600 

CGGATTATCC GTATAAGCGA TATTCAACGC CTGCGATTTG TCCCCTCAAC TGTCCAAACA 660 

GCCCAAAAAA ATAGATTTAA GAAAGAGTGA GATGTAGTTG CTTCATCCCA CTCTTTTTTC 720 

TTAGCGAATT TGTTCAAAAT GTAAATGAAC TGCGATATGA TCTCCATAAC CACTTCTTTC 780 

CAAGTCACGT TGTAAACGAT AGGAAATGTA GTGTTCTGCA ATGGTAATGT AACCTGCGCC 840 

CAATAAACGA TGTTCAACCA TAGATTGAAT CATACTGATA GTCGCACGTT CCACCTTGGC 900 
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TTCTTGTAAA TCCAAAACTA CCTTCTTAGT GACTTGAGCA AGATTTTGAC GCAAATCATC 960 

TGTCAAAACA TAAACAGTTT GGGCTGCCTT CAAGATGGCT TGGTAAATCT TATCTGGATT 1020 

AAATTCAGCA ATTTCGCCAT TACGTTTGAT TACTTGCATA GGTTTCTCCT TTATTCTTTG 1080 

TTTTCTTTGA TTTCTGCCAG CATTTTTTCT TCTTCTACTG TCAGTTGATA ATGTTCAAGT 1140 

AAATCCGGTC TGCGCTCGTA GGTTTTCTTT AAACTCTCGT ACAATCGCCA CTGACGAATC 1200 

TTTTCATGGT GGCCACTCAT CAATACATCT GGCACGACCA TGCCTCGATA ATCATAGGGA 1260 

CGTGTGTACT GAGGATATTC TAAAAGACCT GAAGAAAAAC TATCATCTTG GTGGCTAGAC 1320 

TCCTTGCCAA TCACTTCTGG AATCAGGCGA ACTGTAGCAT CAATCATGGT CATAGCTGCC 1380 

AATTCTCCAC CAGTGAGGAC ATAGTCACCT AGGGAAATCT CATCTGTTAC CAAGGTCTTA 1440 

ATGCGCTCAT CATAACCCTC ATAGTGCCCA CAGATAAAGA TTAGCTCTTC CTCTTGAGCC 1500 

AAATCTTCAG CATAAGCCTG ATCAAACTGC TTTCCAGCAG GATCAAGGAG AATAACGCGC 1560 

GGATTTTTCT TTTCAATAGC ATCAAAGGAA TCGAAAATAG GTTGTGCTCT GAGCAACATG 1620 

CCCTGACCGC CTCCGTAGGG CTCATCATCT ACATGACGGG CCTTTTCAGC ATTTTCTCGA 1680 

AAATTATGAT ACTGGATATC CAAGAGCCCT TTTTCTCGAG CCTTTCCAAC GATTGAGTGC 1740 

TCCAGTGGAG AAAACATCTC TGGAAAGAGG GTTAAAATAT CAATCTTCAT CGTCTAACCC 1800 

TTCTAAGATT TCCACATCGA CCCGTTTACT TGGAATATCA ACATTGAGAA CCACTGGTGG 1860 

GATATAAGGT AAAAGCAAAT CACGTTTGCC TTTTCGTTTG ACCACCCAGA C ATCATT AG C 1920 

ACCTGGTTGC AGGATTTCCT TGATGGTTCC AACCAAGCTA TCACCCTCAT AGACTTCCAA 1980 

ACCGATAATC TCGTGATAGT AAAATTCACC ATCGTCTAGG TCATTCAAAT CTTCCTCAGC 2040 

GACCTTGAGA CTGTATCCCT TGTACTTTTC GATAGTATTG ATATGGTACA TATCTTTGAA 2100 

TTTAATAATG TCAAAGTTCT TCTGTTTACG GTGGCTAGCG ATGGTCACTG TTTGGACAAA 2160 

CTGATCTTTT TCATCAAACA AAACCAGCTC AGCTCCTTTT TTAAACCGTT CTTCTGCAAA 2220 

ATCCGTCACA GACAAGACTC GCATCTCCCC CTGTAATCCC TGCGTATTAA CGATTTTCCC 2280 

AACATTAAAG TAGTTCATCT TGTCTCCTGT AATCTCCTTT TTTCCATCTT ATTCTAACAA 2340 

TTCTCGAATA ATAGCCGCAA TTTTTTCCGA TTCTGACCAT TGTAAATAAT GGTGATTCCC 2400 

TCCTAAAATG AGTTTAGTAT TGGAAGTCCA ATATTCTGAT TCTCTGTACT CTTTTTCTCT 2460 

ATAAGGCTGA CAAAAAACAA ATACAGGAAT ATGAGCTTCT ATAGATACAT CCTCAAAATC 2520 

TTCCTCAGTA ATCTCTCCAG ATATCTGAAA TTCTGGATCT TGATTTTCCA ACTCTAAGCC 2580 

TTTTTCTTGC ATTAATTCCC AGATTTTTTT ATTCGTTTCA GGACTAAATG TTGCTTGAGT 2640 

TAAGTTCTTA AAATAAAGTT CAGGACCACA CTCGTCAATC AGCCTCATCT GCTCTTCCAT 2700 
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TTCTGGATAA GGATTTTCTG AAAAATCAGC AAACATGACT TTTTTAGTTG TCGGTTCAAT 2760 

TGCTACTAAA GTCTGACGCT TAATTGGTTT CTCGAGTAAT TTGCAAGCTA AAATTCCACT 2820 

CCAACTATGT GCACAAAGTA TATATTCAGA AATTCCTAAT TCTTCAAGTA CTTCATAAAC 2880 

CGCATCTGCA AGATTATCTA GATTTTTTCC AGCTTGGTCA TGAATCGGAC TCCTACCTGT 2940 

GTTCGGAAAA TCAATTGTCA AATAACCAAT TGTAGGAGGA GGTTTTTCAA GTATAAGTGA 3000 

AAAATTTTCA TAACTTGGTA GCAAACCTGC TCCGTTTAAA CAAACTAGCA CTTTCTTTTG 3060 

CTTTTGATAA GTAACAGAGA GGCTACCAAT TTCTGTAGAT ACTTCAAACC TCTTCATAAA 3120 

GAAATCCACT GATTCTATAT AATGAATTAT TAAAAATCCT TATCCTTTAT TTTATCACGT 3180 

TCCAAGGATT TTCTCAAGTT GGAGGAAGGG GACAATATCT CTACTTTCCC TTCAATAATC 3240 

CTTCCAAATT ATGTTTATGT TGGTAATTAA TGGCTGCGGT TTTGTCTTTC TCAAAGACAG 3300 

TCTTGGTAAG GTCAATATGA TTAATAGCTA CGATTGCGAC GGTGTAGTAA ATGATATCAG 3360 

CCAGTTCTCT GGCAAGTTCC TCGTTCGAAT CCTATCCCTT CTTTTCGACC AGAGCGCCTA 3420 

TTCAAAACCT CGACTACTTC TCCGACTTCC TCCACTAACT TCATAAAGAG ACCTTCATCA 3480 

GTCCGAGACT GCTGTTAATG TTCGATTAAG TAGTCTTGGA ATTGCCTAAA CGTTCAATCT 3540 

TTTATAGTAT ATTGAAACTA GAATAGTACA CCTTTACTTC TAAAACATTG TTAGAAATCG 3600 

ATTTGACTGT CCTGATCGAT TTGTCCTGTT CTTGTTTCAT TTTACTATAT CTTCTATTCC 3660 

ACACAAAAAA GCGAGACATC CGTCCCGCCC TTCTTATTTT TCGTCAATAA CGATTCTTAC 3720 

TTTTTTGTAT TCAGTTGGGA CAGAGTAGAC AATCGTTCTT ATCGCAGAAA TAGTGCGACC 3780 

CTTACGACCG ATTACACGAC CCACATCGCT TTGATCAAGA TTCAAATGAT ATTCCAAAAA 3840 

TTCTGGTGTA TCCTCAATCT TGATAGTTAA GGCATCTGGT TGTGAAATTA AGGGTTTCAC 3900 

AATCGCAATA ATGAGATTTT CAATCGTATC CATCTGTCAA CCTACTTTAA ACTTATTTTG 3960 

AAAATTTAGA ATCGTGGAAT TTTTTCAATA CGCCTTCTTT TGAAAGGATG TTACGTACTG 4020 

TGTCTGAAGG TTGAGCTCCA TTAGCCAACC ATGCAAGAAC GCGGTCTTCT TTCAAAGTTA 4080 

CTTGGTTTTC AGCAACAAGT GGGTTGTAAG TTCCAACTGT TTCGATGAAA CGTCCGTCAC 4140 

GTGGTGAACG TGAATCTGCT ACGTTGATAC GGTAGAAAGG TTTTTTCTTA GAACCCATAC 4200 

GAGTCAAACG GATTTTAACT GCCATTTTTA AAGTCTCATT TCTTTAATTT TTTATTTCGG 4260 

TGAAATAGCT GAGCTATTTA GCACATGTTC TATTATAGCA GATTTCTGGC ATGTGTC 4317 

(2) INFORMATION FOR SEQ ID NO: 125: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4881 base pairs 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 125: 

AATTTATTTG ACTGGAAATT GTAGAGGGTT CTCGAAATTT CTTGAATGGT TAAAATAAGG 60 

ACAAGAGAAA ACATGGATAT CTATATCCTT GTGCCAAAAA AACCACTGCC CTCCCCAGAC 120 

CAACCTGAGG AAAGCAGTGA TTCTTATTTT AGGAGTTAGG AATGAATACA CGAAATCAAT 180 

TTAGCTGATT ATTTTTTGTT TTTCAAGAAT TCATCGTATT GTTTTTGCAT TTCGTTCAAT 240 

ACTTTTTCGT AGGCACCTTC AGATTTCAAT TTTTCCATCA ATTCTGGAAT CGCTTTATCT 300 

GGGTCTACAG TACCAGTGTT GATAGCTGTA TCAAATTGTT GCATTGTGTT AGCAATAGCT 360 

GAGATTTCAG ATTTCACATT GTCAGTATTG AAGATAAATC CAAGCGCTGG AGATTCTTTA 420 

GCTTCTGCCA ATTCTTTCTT AGAATTTTCG ATTTGTTGGT CTGTAACGTT TTCGTTGATG 480 

TAAAGGATCC AGTTGTTACC AGTGTTCCAT CCACCCATGT GAGTGTTTCC TTTGTAGCCA 540 

TCAAGAACGC GAACACGGTT TTCTTTACCT TCAATTTTTT CCCAGTTCTT GCCTTCTGGA 600 

CCGTAAACAA GACCGTTCAA GAGTTCTGGG TTCGTATTCA AGAGGTTCAA GATTTCCATT 660 

GATTTTTCTT TGTTCTTAGA GTTGTTTGAG ATGACAAAGT TAGCAACTTG TGTTGTTTGG 720 

TTTTTCTTGA TGAAGTTAGT AATTGGTTTG ATTTGGATAT CTTTGTTGGC AACACGTGAA 780 

AGCAAGCTGT TACCGTAGTC AGCTGGTCCT ACTGTTTCTT CACGAACGAA CCAAGTATCT 840 

TGTTGAAGGT CAAAGGAAGT ATCGCTTGTT GCGACGTCTT TTGGAATGTA GCCAGCTTCA 900 

TAGAATTTGT GAAGAGTCTT CAAGTGTTCT TTGAAACGAG GCACTTCGTA ACGGTTTACA 960 

ACTTTAGTAG TATCGCCTTC AAGGTCGATA ACGAATGGAA GACCGTTTGC TACTGGGTAG 1020 

TCAAAATTAT CAGATGGGAT GAAAACTTTA CCAATAGCAA ATGGTACTAC GTCTGGAGCT 1080 

TTTTCTTTGA TTTGTTTCAA GACTGGCTCA AGAGTTTCGT AAGAAGTAAC ACCTGAAATA 1140 

TCGATACCAT ATTTAGCAAG GAGAGTTCCG TTGAAGGCAA AGTTTTGAGA TGATGCAACG 1200 

TTGGCTGCAA CTGGAACAGC GTAAATCTTA CCATTTACAG TATTACCCTT GATGTAAGCT 1260 

GGGTCAAGTG CTTTGTAAAG GTCTTTACCT TCTTTTTTGT ACAATTCTGT CAAGTCAGCG 1320 

TAAGCACCTT TTTGAGCATT TACAATATAG TTATCTGCAA AGGCAATATC ATAGTTTTCA 1380 

CCAGATGATG TGATAACTGA CATTTTCTTA CCATAGTCAC CCCAGCCAAG GTATTGGATA 1440 

TCCAATTTGG CACCAACTTT TTCTTCAATG ATTTTGTTGG CATTTGCTAA CAATTCATCC 1500 

AAGTTGTCTG GTTTGTCACC GATTTGGTAC ATTTTGATAA CAGGTTTGTC ACCTGAATCA 1560 
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GCAGCTTTTT TGCTGTTACC TGTCAAATTT CCACAAGCAG CAAGACCTGC AGCCAGAGCG 1620 

ACTACACTAG CAGATGCAAA AGCATATTTT TTCCAGTTTT TCATGATAAA AACTCCTTTT 1680 

TTTATTTTTA AACTTATAAA CAATGTAATG ATCTTATACT CAATAAAAAT CAAAGAGCAA 1740 

ACTAGAAAAC TAGCCGCAGG CTGCTCAAAG CACTGCTTTG AGGTTGTAGA TAAGACTGAC 1800 

GAAGTCAGTT ACATATATCT ACGGCAAGGC GACGTTGACG CGGTTTGAAT TTGATTTTCG 1860 

AAGAGTATTA ACTTCACACA AGGGAAGTTG GGAACTGAGA AATGTTATTT CTCAATAAGC 1920 

ACTATTCTTT CACACCACCG ATAGTCAAAC CTTTTACAAA GTAGCGTTGG AAAAATGGAT 1980 

ACAAAATCGC GATTGGAAGG GTTGCAACCA CAACCATGGC CATACGACCT GTTTCTTTCG 2040 

GTAGAGCAAC TCCCAGTTGA CCAATCAAGC CGACCGCTTT GGCAATGTAG TCCATATTTT 2100 

GTTGGATTTG CATGAGCAAA TATTGCAATG GATACAAGTT GTCACTCTTG ATGTAAAGAA 2160 

GGGCGTTGAA CCAGTCATTC CAGAAACCAA GAGCTGTTAA GAGCGTGATG GTTGCGATAC 2220 

CTGGTAGTGA CAATGGCAAA CAGATTTGGA AGAAAATCCG GGCCTCACTG GCACCATCGA 22 80 

TACGAGCCGA TTCTAGAATG GCTTCTGGAA TGGTCTTCTT GAAGAAGGAA CGCATCAAGA 2340 

TGATGTTAAA TGGTGAGAGA AGCATTGGAA CAATCAAGGC CCAAACAGTG TCACCAAGCT 2400 

GAAGTACACG GGTCACCATG ATATAACCTG GTACCAAACC AGCGTTGAAC AACATACTGA 2460 

GAAGGACGAA GATGGTAAAG AATCTGCGAT ACTTAAAGGT TGTCCGTGAA ATAGCGTAGG 2520 

CATAGGTTGT TGTGATAAAG ACATTTGTCA ATGTCCCAAC TACGGTTACA AAGACAGAGA 2580 

TGAAGAGGGC TTGTAGGATT TTATCCTTAA ACTGTGCCAA AAACTCAAAA CCGTCTAAGC 2640 

CAAATTGGGA TGGGAAGAAG CTATAGCCGT ATTGGAGGAG GCTTTTCTCG TCTGTCACTG 2700 

AAATAATGAT AACGAATACA AAAGGTAGGA TACAAGAGAG GGCAATCAAA CCCGAAATGA 2760 

TACTGAAGAA GATATCTGCT TTCTTACTGA AGGAGTGAAT GCCGACATTA TCAATTTTTT 2820 

CTTTTTTAAT TTTCTTTTTT GCCATATTCT CCTCCTTTCT AGAACAAAGC TGAGTTTGGA 2880 

TCGACTCGTC TTGCAAGCAA GTTTGATAGG ATAACCAGAA TCAAACCAAC AACGGATTGG 2940 

TAAAGACCGG CTGCTGCAGC CATACCGATA TCTGCTGTCT GAGTCAAACC ATTAAAGACA 3000 

TATACGTCCA AAACGTTGGT TACATTGTAA AGCTGACCAG CATTGTGTGG GATTTGATAG 3060 

AAGAGACCGA AGTCTGCGCG GAAGATATTT CCGACTGCAA GGATGGTCAA TACAGTTACA 3120 

AGCGGAGTCA ACTGAGGAAT GGTTACGTTG CGAATACGTT GCCACTTGCT AGCTCCGTCC 3180 

ACTGTCGCTG CTTCGTAGTA GGTTGGATCA ATTCCCATGA TCGTCGCATA GTACATGACA 3240 

CTGCTATATC CAAAGCCTTT CCAAATACCT AGGAAAAGTA GGAGATAGGG CCAGATGCCC 3300 
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AGGTCAGCGT AGAAATTGAC TTCTTTGAGA CCAAGACTTT CCAATAGATG ATTGAACACC 3360 

CCTTTATCAA TATTTAGGAA GGCATCTGTA AAGAAACTGA TGATAACCCA AGACAAGAAG 3420 

TAAGGGAACA ACATAGAAGT TTGAAAAATC TTCACCATTC TCTTAGAACG GAGCTCGCTG 3480 

AGGATAATGG CAATCCCTAC AGATACAACT AAACCTAGAA AGATAAAGCC AAGATTGTAG 3540 

AGGACAGTAT TTCGTGTGAT AATAAAGGCG TCTCTTGAAC TAAATAAGAA TCTAAAATTA 3600 

TCGAGTCCGA CCCATTTACT ATTTATGATA CTATCTATGA AACCATTACT GGTCATGTGG 3660 

TAGTCTTTGA AGGCAACCAC GTTCCCAAAT ACTGGAATGT AAAAGAATAG AATCAACCAG 3720 

AGTGCCCCTG GCAAAACCAT CAAGAGAAAG ATCCAGTTGT CTCTCAATGT TTTTGAAAAC 3780 

TTTTTCATAA TTTCCTCCCT TTTTATTTTG ATATCCATCT AAAAATTCTT TTTTAGACTT 3840 

TTGATAACGA TTACATTATT AGTATACTCC TATTTGCAGG TTAGGTTAAA CTCCTAATTA 3900 

TAGAAAAAAC TCCACAAATT ATGTAGCAGA TTTAAAACTT TATCACCACT ATCAAACAAA 3960 

TGTCCTAAAT CAATTGTTTA TTTTATCTCT ATTAGCCCAG TGATGGCGTC ACTCTGTTAT 4020 

AAGCATCCAA CAACGGGGTA TACTGAAAAA TCTCCAGACT AGGGAACTCA GCGATAGTTC 4080 

CTAATCTGGA GATTTTTAAT ATGTTATTAG GCGTTTGCTT TCAACTTAGC AATAACCTCT 4140 

TTAAGATTAT CAATCAACTC TGCTGCAGTA TGCTCAGAGC CTTTTTCATC TGCCAAGAAC 4200 

AAAACTGCTT TTTGAAGTTC TTTTTGAGAG TTTTCAAGGA CATCCTTATC TACTGTTTCA 4260 

AGGTTTGAGT CTTTAAGAAG TTTACTTAAT TCCTTGGCTA ATTTCTTGAG TTTGATTTGC 4320 

AGACTCATCT TCTCCTGCTG TTTCTTTGCC CGCTGTTTGT CCTCCATCCT TAGTTGCTGA 4380 

CTGGCTTTCC TTAATGGACT CTAGGGAAGC AATGGCATCT TTGACTGTTT GCAAGATATC 4440 

ACGTAAACCT TGCTCTGTCA AACTATCATC TGCAAAAGCT TTATTAGCCT CTGCCAAAAC 4500 

CAGACGTGCT GAATCTGTGG TAGGATTCGA TACACCTGTC AATGATCTCA AAAGATTTTC 4560 

TAAGGTTTGA GTCTGCTTAC TAATACTAGA CTAAAATCAA AAAGTATTAT ATAACAGTGA 4620 

TATGAAATCA ACTAAAGAAG AAATCCAAAC CATCAAAACA CTTTTAAAAG ACTCTCGTAC 4680 

AGCTAAATAT CATAAACGCC TTCAAATCGT TCTATTTTGT CTGATGGGCA AATCTTATAA 4740 

AGAGATTATA GAACTTTTAT AGTAGTTTGA AATAAGATGT GAACATCTCT ATCAGGAAAG 4800 

TCAAATTAAT TTATAGAAAT ATTTTAGCAG CCAAGGTGTA CTGTTATAGA TTCAATACAC 4860 

TATACTTGGT GGTTTAGCTC G 4881 
(2) INFORMATION FOR SEQ ID NO: 126: 

U) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13121 base pairs 

(B) TYPE: nucleic acid 



WO 98/18931 



PCT/US97/19588 



869 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO; 126: 

AGGATCCCCG GAAAAGGAGA CTAAAAATGA AGAAAAAATT TCTAGCATTT TTGCTAATTT 60 

TATTCCCAAT TTTCTCATTA GGTATTGCCA AAGCAGAAAC GATTAAGATT GTTTCTGATA 120 

CCGCCTATGC ACCTTTTGAG TTTAAAGATT CAGATCAAAC TTATAAAGGA ATTGATGTTG 180 

ACATTATTAA CAAAGTCGCT GAGATTAAAG GCTGGAACAT TCAGATGTCC TATCCTGGAT 240 

TTGACGCAGC AGTCAATGCG GTTCAAGCTG GGCAAGCCGA CGCTATCATG GCAGGGATGA 300 

CAAAGACTAA AGAACGTGAA AAAGTCTTCA CCATGTCTGA TACTTACTAT GATACAAAAG 360 

TTGTCATTGC TACTACAAAG TCACACAAAA TTAGCAAGTA CGACCAATTA ACTGGCAAAA 420 

CCGTTGGTGT TAAAAACGGA ACTGCCGCTC AACGTTTCCT TGAAACAATC AAAGATAAAT 480 

ACGGCTTTAC TATTAAAACA TTTGACACTG GTGATTTAAT GAACAACAGC TTGAGTGCTG 540 

GTGCCATCGA TGCCATGATG GATGACAAAC CTGTTATCGA ATATGCCATT AACCAAGGTC 600 

AAGACCTCCA TATTGAAATG GATGGTGAAG CTGTAGGAAG TTTTGCTTTC GGTGTGAAAA 660 

AAGGAAGTAA ATACGAGCAC CTGGTTACTG AATTTAACCA AGCCTTGTCT GAAATGAAAA 720 

AAGATGGTAG TCTTGATAAA ATTATCAAGA AATGGACTGC TTCATCATCT TCAGCAGTGC 780 

CAACTACAAC TACTCTCGCA GGATTAAAAG CTATTCCTGT TAAGGCTAAA TATATCATTG 840 

CCAGCGATTC TTCTTTTGCC CCTTTTGTTT TCCAAAATTC AAGCAACCAA TACACTGGTA 900 

TTGATATGGA ATTGATTAAG GCAATCGCTA AAGACCAAGG TTTTGAAATT GAAATCACCA 960 

ACCCTGGTTT TGATGCTGCT ATCAGTGCTG TCCAAGCTGG TCAAGCCGAT GGTATCATCG 1020 

CTGGTATGTC TGTCACAGAT GCTCGTAAGG CAACTTTTGA CTTCTCAGAA TCATACTACA 1080 

CTGCTAATAC CATTCTTGGT GTCAAAGAAT CAAGCAATAT TGCTTCTTAT GAAGATCTAA 1140 

AAGGAAAGAC AGTCGGTGTT AAAAACGGAA CTGCTTCTCA AACCTTCCTA ACAGAAAATC 1200 

AAAGCAAATA CGGCTACAAA ATCAAAACCT TTGCTGATGG TTCTTCAATG TATGACAGTT 1260 

TAAACACTGG TGCCATTGAT GCCGTTATGG ATGATGAACC TGTTCTCAAA TATTCTATCA 1320 

GCCAAGGTCA AAAATTGAAA ACTCCAATCT CTGGAACTCC AATCGGTGAA ACAGCCTTTG 1380 

CCGTTAAAAA AGGAGCAAAT CCAGAACTGA TTGAAATGTT CAACAACGGA CTTGCAAACC 1440 

TTAAAGCAAA CGGTGAATTC CAAAAGATTC TTGACAAATA CCTAGCTAGC GAATCTTCAA 1500 

CTGCTTCAAC AAGTACTGTT GACGAAACAA CGCTCTGGGG CTTGCTTCAA AACAACTACA 1560 
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AACAACTCCT TAGCGGTCTT GGTATCACTC TTGCTCTAGC TCTTATCTCA TTTGCTATTG 1620 

CCATTGTCAT CGGAATTATC TTCGGTATGT TTAGCGTTAG CCCATACAAA TCTCTTCGCG 1680 

TCATCTCTGA GATTTTCGTT GACGTTATTC GTGGTATTCC ATTGATGATT CTTGCAGCCT 1740 

TCATCTTCTG GGGAATTCCA AACTTCATCG AGTCTATCAC AGGCCAACAA AGCCCAATTA 1800 

ACGACTTTGT AGCTGGAACC ATTGCCCTCT CACTCAATGC GGCTGCTTAT ATCGCTGAAA 1860 

TCGTTCGTGG TGGTATTCAG GCCGTTCCAG TTGGCCAAAT GGAAGCCAGC CGAAGCTTGG 1920 

GTATCTCTTA TGGAAAAACC ATGCGTAAGA TTATCTTGCC ACAAGCAACT AAATTGATGT 1980 

TGCCAAACTT TGTCAACCAA TTCGTTATCG CTCTTAAAGA TACAACTATC GTATCTGCTA 2040 

TCGGTTTGGT TGAACTCTTC CAAACTGGTA AGATTATCAT TGCTCGTAAC TACCAAAGTT 2100 

TCAAGATGTA TGCAATCCTT GCTATCTTCT ATCTTGTAAT TATCACACTT TTGACTAGAC 2160 

TAGCGAAACG CTTAGAAAAG AGGATTCGTT AATGGCAAAA TTAAAAATTG ATGTAAATGA 2220 

TTTACACAAG CACTATGGAA AAAATGAAGT CCTAAAAGGA ATTACGACTA AGTTCTATGA 2280 

AGGAGATGTT GTTTGTATCA TCGGTCCTTC AGGTTCTGGT AAGTCAACTT TCCTCCGTAG 2340 

CCTCAATCTT TTAGAAGAAG TCACTAGCGG TCACATCACT GTGAACGGCT ATGATTTAAC 2400 

TGAAAAAACA ACCAATGTTG ACCACGTCCG TGAAAATATC GGCATGGTAT TCCAACACTT 2460 

CAACCTCTTC CCTCATATGT CTGTATTGGA CAACATCACC TTTGCTCCTA TTGAGCACAA 2520 

GTTGATGACT AAGGAAGAAG CTGAGGAATT GGGAATGGAG TTGCTTGAAA AGGTTGGACT 2580 

AG C AG AT AAA GCTAATGCCA ATCCAGATAG CCTATCAGGT GGTCAAAAAC AACGTGTGGC 2640 

CATCGCTCGT GGCCTAGCAA TGAATCCAGA CATCATGCTC TTCGATGAAC CAACTTCTGC 2700 

CCTTGACCCT GAGATGGTTG GAGACGTACT TAACGTTATG AAGGAATTGG CTGAGCAAGG 27 60 

CATGACCATG ATTATCGTAA CCCATGAGAT GGGATTTGCT CGTCAGGTTG CCAACCGCGT 2820 

TATCTTTACT GCAGATGGCG AGTTCCTTGA AGACGGAACA CCTGACCAAA TCTTTGATAA 2880 

CCCACAACAC CCTCGTCTGA AAGAGTTCTT AGATAAGGTC TTAAACGTCT AAACTCAAAC 2940 

TGTAAGGATT TCCTTGCAGT TTTTCTACCT CGTATTGGAA TTTTTGATTT TTCGGAAAAT 3000 

TATGTTAGAA TTAAGTTTAT GAAATGAGGT TTCCTCATAC CTAGCAAGAC TAGGAATAAA 3060 

AATAGAAATT AGGTAGCTAG ATGTCATCTA AGGTTATTGT TACAATTTTC GGTGCGAGTG 3120 

GAGACCTGGC TAAACGCAAG CTCTACCCTT CCCTTTTTAG ACTATATCAA TCCGGCAATC 3180 

TTTCCAAGCA CTTTGCCGTT ATTGGAACTG CCCGTAGACC TTGGAGTAAG GAATATTTTG 3240 

AATCTGTAGT TGTCGAGTCC ATCCTTGATT TGGCAGATAG TACCGAGCAA GCCCAAGAAT 3300 

TTGCTAGCCA CTTCTACTAT CAAAGCCATG ATGTCAATGA TTCGGAACAT TATATTGCTT 3360 



WO 98/18931 



PCT/US97/19588 



871 

TGCGTCAATT ACAAGCTGAG CTTAATGAAA AATACCAAGC TGAACACAAT AAGCTCTTCT 3420 

TCTTGTCTAT GGCACCTCAG TTCTTTGGAA CCATTGCCAA ACACCTCAAA TCTGAAAACA 3480 

TTGTCGATGG CAAAGGTTTT GAGCGCTTGA TCGTTGAAAA ACCATTTGGT ACAGATTACG 3540 

CAACTGCAAG CAAGTTGAAT GACGAACTCC TAGCAACATT TGACGAAGAA CAAATTTTCC 3600 

GTATCGACCA TTATCTTGGT AAGGAAATGA TCCAAAGCAT CTTTGCAGTT CGCTTTGCAA 3660 

ACTTGATTTT TGAAAACGTT TGGAACAAGG ATTTTATCGA CAATGTTCAA ATTACCTTTG 3720 

CGGAGCGCTT GGGTGTAGAA GAACGTGGTG GCTACTATGA CCAATCCGGT GCCCTCCGTG 3780 

ACATGGTCCA AAACCACACT CTACAACTTC TTTCGCTCCT CGCCATGGAC AAACCAGCAA 3840 

GCTTCACAAA AGACGAGATT CGTGCTGAAA AGATTAAGGT CTTTAAAAAC CTCTATCATC 3900 

CAACTGATGA AGAACTCAAA GAACACTTTA TCCGTGGGCA ATACCGCTCT GGTAAGATTG 3960 

ATGGCATGAA ATACATCTCT TATCGTAGCG AGCCAAATGT GAATCCAGAA TCAACAACTG 4020 

AAACCTTTAC ATCTGGTGCC TTCTTTGTAG ACAGCGATCG ATTCCGTGGT GTTCCTTTCT 4080 

TTTTCCGTAC AGGTAAACGA CTGACTGAAA AAGGAACTCA TGTCAACATC GTCTTTAAAC 4140 

AAATGGATTC TATCTTTGGA GAACCACTTG CTCCAAATAT TTTGACCATC TATATTCAAC 4200 

CAACAGAAGG CTTCTCTCTT AGCCTAAATG GGAAGCAAGT AGGAGAAGAA TTTAACTTGG 4260 

CTCCTAACTC ACTTGATTAC CGTACAGATG CGACTGCAAC TGGTGCTTCT CCAGAACCAT 4320 

ACGAAAAATT GATTTATGAT GTCCTAAATA ACAACTCAAC TAACTTTAGC CACTGGGATG 4380 

AAGTTTGTGC GTCATGGAAG TTGATTGACC GTATTGAAAA GCTCTGGGCT GAAAATGGTG 4440 

CCCCACTTCA TGACTATAAA GCTGGAAGCA TGGGACCTCA AGCCAGCTTT GACCTACTTG 4500 

AAAAATTCGG TGCCAAATGG ACTTGGCAAC CAGATATCAC CTATCGTCAA GATGGTCGCT 4560 

TAGAATAAAA AAATTTCCTG CAAGTTTATG CcTTGCAGGA TTTTTGCTTC TGATTAGATT 4620 

AAACCTTCCA AGAGACCTTT CATAAAGTTT TCTGAGTTAA ACTCTCCAAT ATCATCGATT 4680 

TTTTCACCAA AACCAATCAA TTTTACAGGA ATATTGAGTT CTTCACGAAT GGCTAGAACC 4740 

ACACCTCCTC GAGCAGTTCC ATCAATCTTA GTCAAAACAA TTCCCGTTAA AGGTGTGATT 4800 

TTCGAAAATT CTTTGGCCTG TACTAGGGCA TTTTGACCTG TTGATGCATC AAGTGCCAAG 4860 

AAGGTTTCAT GTGGTGCTTC TGGCACAACA CGTTTGATAA TACGACCAAT CTTTTCCAAC 4920 

TCAGCCATAA GGTTATCCTT ATTTTGCAGA CGACCAGCAG TATCAATCAT GAGAATATCG 4980 

ATACCTTCAG TCACGGCACG TTCCATACCA TCAAAGACCA CGCTGGCTGG ATCAGCTTTT 5040 

TCAGGTCCAG TTACTACTGG AACATCTACT CGTCGGCCCC ATTCAGCTAG CTGAGCTACT 5100 
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GCACCCGCAC GGAAGGTATC TGCTGCAACC AGCATGACCT TCTTACCAGC TTGTTTGTAG 5160 

CGGTGGGCTA GTTTTCCGAT AGAAGTTGTT TTCCCAACAC CATTCACACC AACAAAGAGC 5220 

ATAACTGTCA AGTTATCTTG GAAGTGGATG CTTTCATCGT AGCTACCATC CTTTTCATAA 5280 

AGCTCAACCA ATTTCTCAAT GATGACACGA CGAAGTACAT CAGGTTTCTT GGCATTTTCA 5340 

AGCTTGGCTT CGTAACGTAG TTCCTCCGTT AAGTTAGAAG CGACTTGGAC ACCAACATCA 5400 

CTCATAATCA GCAGTTCTTC CAGTTCCTCG AAAAATTCTT CGTCAACAGA GCGGAAGTTA 5460 

GCAAAGAAGG CATTCAAGCG GGCACCGAAA CCTGTGCGAG TTTTCTTAAG ACTGCGGTCA 5520 

TATTTTTCCT GAACAGTTTC TTCTGTTTGA GGAGCTTCTG GTTCAAGCAC TTCAGAATTA 5580 

TTTTCTTCTA CAGTTCCTTC GTGCTCAAGC TTCTCTTCCT CTGGTAATTC TTCTGAGTTT 5640 

GGTAATTCTT CTATTTCTTC TTGAGAAACC CCTACAGCTG GCTCTGAATC CTGACTTTCT 5700 

TCAACTGTGT CTTGGATTTC CTCTTCTTGG AACACAGCTT GTTCAACAAT TTCAACCTCT 5760 

GCTTCTTCCT GAGAAACTTC CTCAACTTCT GTGAAGGTAG GATCAACATC TTCAGACAAA 5820 

TCAAGATTTT CCAGAGCTTC TTTTACAACT TCTTCGATTT TAGGTTCTTC TTTTTTTCCG 5880 

AATAGACGGT CAAACAATCC CATATCTTAG TTCTCCTTTA GCACATATTC TTCGATAGCC 5940 

CAGGCGACAG CTTCCTCATC GTTGGTCATC GGCGTCACTA CATTTGCGGC TGCCTTTACT 6000 

TCAGGAACAG CGTTTTGCAT AGCAACACCA AGACCTGCCC ATTCAATCAT AGAGAGGTCA 6060 

TTGGCCTCGT CACCACAAGC CATCACTTGA CTTTGGTCGA TTCCAAGATG GCTGATTAGT 6120 

TTTGCCAAAC CTGTTGCTTT ATGAACATTC TTTGGTGACC ATTCTAGCAA CATTTCACGT 6180 

GATTTAAAGA TTTCATATTG GTCAAACAAT TCTGGAGAAA TCTTCTGAAT GGCTGCATCC 6240 

AAGGGTTCTT GAGCAAAGGC AGTCACGCAT TTGTTGTAGG TCATTTGACT AGATAAGTCT 6300 

TCAAAGTCCA CTGGAACAAA GGTCAAAGCT GGATTGAATT TGGCATAAAG ACTTTCTTGG 6360 

TCCGATTGGA TTTGATAAAC TGTTCCTTCT GAGATGGCAT CAAGAGGCAG TGATAATTTC 6420 

TCTGTTTCTT CATACAAACG TGCCACATCA TCATATGAAA AGACTGTTTT ATCAAGGATT 6480 

TCTCCTGTAT TTTTCTGAAC TAATCCACCA TTAAAAGTAA TGGTATACTC ATCTTCCTGA 6540 

CCGTCAGTCC CTAACTCATG GAGAAAGAAA TCCATGGCTT TTAAGGGACG ACCAGTTGTC 6600 

AATACGACCT TGATACCACG ATCACGCGCA gCTTGCAAGG TTTCCTTGGT ACGATCCGTC 6660 

AGCCTTTTAT CAGTAGTCAG CAAGGTCCCG . TCCAAGTCCA ATGCAATCAA TTTTATATCT 6720 

GCCATTATAA GCCCTCCATA TAAGCTATAA CCGACCGTTC CTTATGGTGA CCAATCACAG 6780 

TCTTTGCTAA TTCTAAAATT TCAGGTCGTG CATTTTCAGG AGCTACAGGA TGTCCCACAA 6840 

CCTGCATCAT ATGTAAGTCA TTAAGATTGT CTCCAAAAGC CATGACCTGA TCCATTGTGA 6900 
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TACCAAGTTT TTTAACTAAT TCAACAATGG CCACTCCCTT ATCGACATAG TCCAGAACAA 6960 

TATCAATGGA TTCAAAGCCA GTTGTCATGG CCTTAACACC AGGAACGTTT TCGTTTACCC 7020 

AAGCCTCCCC ATCTTCCAGC GTTTCTTCTG TGAAGTTGGT TGTAAATTTG AAAATGTCAT 7080 

CTGTGATATC TTCCAAACTC GCTACTTTTT GGATATTTTC ATTATAGTGC TGACTCACTT * 7140 

TCAAATAGGT CTCATCAACC GTATCTAGAA CATATGAACC CTTCTTACCC GTCAAGAGCA 7200 

GTTTATTGAT ATCTACATAA GGTGAAGTTT TCAGCTTTTC AAAAGTTGCC AGATAAAAGT 7260 

CACGAGACAT AGTCGCTTCA TACAAGTCCT GACCTTGATA CTCTACCAAA CTGCCATTTT 7320 

CCGCGATGAA AATAATGTCA TCACGAACAC CAGCAAATAA TTTTTCTAGA GACAGAAATC 7380 

CCCGACCCGA AGCTACCGCA AAGTAAATCC CTTTTTCCTT GTAGGAAACC AAGAGAGACT 7440 

TGAGACGATC CATATCAAAG CGTCCATTCC CATCTAGGAA GGTTCCGTCC ATATCCGTTG 7500 

CTACTAGTTT AATTGTCATC CTTCAATACT TTCTAAATCT TTTAACTTAA CTGAAACAAT 7560 

CTTTGAAACA CCCGATTCTT GCATGGTCAC TCCATAGATG GAATCAGCCG CTGCCATGGT 7620 

TCCCTTACGG TGGGTTACGA CGATGAACTG GCTGTCCTTG TCAAAGCGGT TGAGGTAATC 7680 

CCCAAAACGT TTAACATTGG CTTCATCCAG CGCAGCTTCC ACCTCATCCA AGATAACAAA 7740 

TGGAATAGTC TTGACACGAA TAATGGAGAA GAGCAAGGCA AGAGCCGATA GGGCTTTTTC 7800 

ACCACCACTC ATGAGATTAA GAGACTGGAT TTTCTTGCCT GGTGGTTGGA CAGAAATTTC 7860 

AACCCCAGCT GTCAGCAAGT CTCCTTCAGT CAAAATGAGG TCAGCCTGAC CTCCACCAAA 7920 

CATCTGCTTG AAGGTCACTT TAAAGGACTC ACGAATGACC TCAAAGGTTG ATTTAAAGCG 7980 

TTCCTTGACC TCATCATTCA TCTCTGTAAT GGTCTCAAGG AGCAGGTTTT TCGCAGACAA 8040 

AATATCATCA CGTTGGCTAT TTAGGAAATC CAGACGGTTG TGAACTTCTT CGTACTGTTC 8100 

AATAGCGTCT AAATTGACAG GACCCAGTGA GCGTATAGCC TTCTCTAAAT CCTTAACTTC 8160 

TTGCTCTGCC AGATTGAGAT TTTCCAACTC ATGCGCCTTT TCTAAAGCTT CTGTGTAGCT 8220 

GATCTGGTAC TGGTCTGTTA ATTGACTTTG TAGATGGCGC AAGCGCTCGC TAACCTTTTC 8280 

TTTCTTGGCT TCAGCACGAG TTTGCTTGCG AATCCACTCT TCATTCTGCT GGCGAGCCTG 8340 

ATCCAAATGA CTAGCAATAT CATCCAGTTG ACCCTCAATA TCATCCAACT CAAACTGCTT 8400 

GCGAATCAAA CCTTGTTGGA GATTTGTTTT TTGAGTTTTG GATTCTTCCG CCTGTTGACT 8460 

GAGCAATTCT GTATCAACCT TCTCAAGATT ATCAATCTTT TCTTGAAGAA GGCGCTGGAT 8520 

TTCCTCTTGT TCAAAATCAA GATTGTCCAA TTCCTTGCCT AAGCGTTCAA TATCAGCAAC 8580 

TTCATAACGT TTTTGCCCTT GCAGTTCTGT CTTAAGCAAA CGAGCTTGCG CTAGCTCTTC 8640 
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CTGCAAGTTT TGATAGCGTT CTTGGATGGC ATTTTTGTTA GACTTAATCT CTTCAATCTC 8700 

AGCTTCCAGA TTTTGCTTGT CACTGGAGAT TGCAGCAAGA CGCTCTTGGC AGTTTTCCTT 8760 

ATCCGCTTGC CAATCTCCCT CGGAAAGACG ATCTATTTCC TCTTCTTGGA GTTTCCAAAG 8820 

AGTTTCCAGT TCTTCAACTT GCTGACTAGT TTGCTGATAA GCGAGGAACA AGCCTTGCTC 8880 

CTGAATACGT GCCTGCTCTC CTTGAGATTT AATAGCTTCT AATGACTCGG TCAATCTGGC 8940 

CATCTCATCT TGCAAGGTCT TCAAAGTCGC CTCTTCTGAA CCCAAGCTTG CTTCTTCTTC 9000 

AGCAATTTCT TTTTGTAATT GCTCCAGTTC TGGCTTGATA AAAATGCTGT TATTCTGGCG 9060 

ATTGGCACCA CCTGCATAAG AACCACCTGT GCGCAACTCT GTCCCATCCA ATGTCACCAT 9120 

ACGAACCTGA TAACGAACTT GGCGAGCTGC TGCACGCGCA TGTTCTACGG TATCAAAGAT 9180 

AGCCGTCGTA GCTAGCAAGT TCTTGAAAAT GGCTTCCAGT CTAGTATCAA AAGTCACCAA 9240 

CTCATCTGCC ATCCCAAGGA AACCTGGGCT TACAGCGATA GCATCTTGGT TCTGACTAGA 9300 

AATCGTACGC GCCTTGATAG TGGTCAAAGG AAGAAAGGTT GCACGACCGG CTCTGTTCCG 9360 

TTTAAGGAAG TCAATAGCCT TGGTTGCCGA CTCTTCATCT TCTACGATGA TATGCTGGCT 9420 

ACTTGCCCCT AAGGCAATCT CTAGGGCAGT TTGATAATAA ACATCAAAGG TCAGATGCTC 9480 

ACTGACTGCA CCAATAATCC CACCTAGGCG ATCTTTTTCT TGGAGAACAC TCTTAACACC 9540 

TGCATAAAAG TTACTATGAT TTCTCAGGAT ATTTTCCAAA CTTTGAGCTC TGGCCTGCTT 9600 

GTTTTTGAGA TTATCCAGAC GGTCAAAGAG TTGGCTTTGT TGAGCTTGAT AGGAAGTTTT 9660 

CTGCTCCTCT TGCTCCTTGG CAATAGCTTG GTAGTCAGCC AATAATTTCT GAACCTGCTC 9720 

CTTGGCAGTT TCAAGCTCTT CCTTTTGCTG ACTAGCCTTC TCTTTAGCTA TAGCTAATTG 9780 

CTCTTTCAGC TTTTCTAGTT GATCTGCTTG TTTTTGAGAA AGCTGACGAC TATTTTCCAA 984 0 

CTCATTCTCA ATACGGGTCA ACTGGTTTGA GACATCCGCT TCTTCTTGTA AAAGAGCTAC 9900 

AAAGCGTTCA CGTAAGAGCT CAATCATCTG ATCAGGATCG TCTGAGAAAG CCAGCAATTC 9960 

AGCTTCTAAA CGATTGAGTT TTTGATTATT TTGGACTAGA TTTCCCTCTA ACAGAGCTAA 10020 

AGAGCTTTCT TTATCAGACT TTTCTTTGCT GAGTGAATTT CTCTTATCCT CCAAAGCAGC 10080 

CAAACGGGCT TGTGCCTCCT GTTGATTCAA GGCCACTTGC TCGGACTCCA GTTTCGATAG 10140 

GGCTAATTTT CTTTCTAAAT CACTAATCAG ACTAGTCAAG TCCATCAAAC TGCCTTGGTC 10200 

TTTGGCCATT TCAGCCTGTA AATCTTGGCG TTGCTTTTTA AGAGTTTGAT TTTCTTCTTC 10260 

TAATTTTTCA CGCTTTTGGT AATAACTCAT CAAGAGTTCT TGAACCTGAG TCAACTCTTC 10320 

TTCTGTCGAC TCTAGTTCAG CCTTATTTTC CTTGATTTGA GCAACCAGAA CATCTAAATA 10380 

AATAGCCTTA CGTTGTCCTT CCAAGTCTAA AAACTTACGG GCATTCTCAG CTTGCTTCTC 10440 
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AAGAGGCTTG 
CTGAGTTTGC 
AGCAGCTTCT 
CTTCCCTTGG 
ATGAATATCA 
ATAGACATGG 
ATCCAGAGTC 
GATGATATCC 
ACGCAGACTT 
ACCTTGGTCA 
TTCCTTTAAA 
GCTAATTTCT 
ACATCAAAAA 
ACATCACCAT 
AACTCGCCTG 
ACATCCTTAT 
TCACGATTGC 
TGGCAATCAC 
GATAGGTCAC 
TGTAGAACAG 
TGCTCATTGG 
TCGATTGCAA 
CTGATAATAG 
TGGAAAGCAT 
AGGATAAAGA 
ATCGAACCCC 
ACTTGTTTTA 
CTTAGTGGGA 
TAAGGTCAGC 



ATTTGATTAT 
TGCAGTTTAC 
TCAAAAATAG 
GAAATAATAG 
CGCAGACGGA 
CGTTCCACCC 
ACAACTACAG 
GGCATCTTGC 
TCTGTAATAT 
AAAACGACCT 
TACATGAATC 
TAGAACGACC 
CCTTATCGTG 
TGACCTGAAG 
CTTCAACCTT 
CCAAAAGAAG 
GACCACCTGA 
GCGCAAAACC 
CTTCAGGCTT 
CGTCTCCTAA 
CATAACTCGT 
AATGATTCTT 
TCCTTTTTAT 
TTGGGAATTC 
AAAAAGCCCT 
CATCTCAAGA 
TTATAACAGA 
AGCATCCCCA 
AACCTGACGT 



CCAACTCGTA 
TCTCGGTTTC 
CTCGTCGTTC 
AGAAGGAATC 
CTTTCTTGCC 
TGATTTCTTG 
AAGCATAATT 
CCCCACGGAG 
TGGACTTTCC 
TGGTCTTATC 
CAGCCCCTTC 
TTGGCCTTGA 
AGCAGGCCCT 
CAACTCTTGG 
AGGAATCATG 
GGCACCAAGA 
TTTTTCTTCC 
AGCTAAACTC 
TTTAGGATAT 
AAATTCCAAG 
ATGAGTAAAG 
TAGTACAGTT 
TATATCAAAA 
TTTAGACAGA 
ATTAAAGGCT 
ACCGGAATCT 
AATTTGCTCT 
TTCCAGATGG 
CCACCTGCAT 
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GATAATGTCC 
TTTTCTGCGA 
CTCAGGCTTG 
TCGTCCCAAT 
GTCAATCTTG 
ACCTGCATCC 
GAGCGGTTTG 
ACTCTTGACA 
AGATCCATTG 
AGCAAAAGAC 
TCAACGGCAT 
CCGATGCTCT 
GTTTCAGAAA 
AGATGGGTTT 
ACTTGATAGA 
AAGGCTTCAA 
CCTTTACCCA 
TCCTCACGGA 
TTTTTATATA 
CGTTCATTGT 
GCAGTTTCCA 
TGTAATTCTT 
AAAGCCCCCT. 
GATTCTCAGT 
TTTTAGGATG 
TACGTGATAT 
AATAACAAGT 
AGTTTTTCAC 
AAGAAATAGC 



TCTAAGCGGT 
GTCTTGTATT 
GAATTAAAAA 
CCAGTATCCA 
TATTCGCTAT 
TTGATAAATC 
CGACTTTCGG 
CTAGACTCCC 
GGTCCAACAA 
TTGAACCCCT 
TTTTGGCAGC 
TACCTTCAAC 
TCACCTGATA 
TATAGTCTGT 
TAAATTCCTT 
AGGCATCACC 
ACTTGATAAA 
CAATCATAGC 
GATATTCTGA 
GTGAAATTTT 
GTAACTTTTT 
TCATACCAAC 
GAGTCACTCT 
TTTAGCGGCA 
TTTACATCCA 
CCATTACACT 
TTTTTGGTCA 
GATCACATAA 
ACTTTGAAGG 



CCAGATTATC 
TTAAAACTCC 
TCTCCTCAAC 
AGAAGAGGTC 
CTCCACTACG 
CGTCATGATT 
TTCCAGCAAA 
CCAAAGCCCA 
CTGCCGTCAC 
GAATTTCGAT 
TTCCTGCTCT 
AAGAACTTCT 
ACGAATAGCC 
AATCATCTCA 
GACCTTGGCC 
AAGAATGGTG 
CTGGTCAAAC 
ACGGAGTTTT 
AATCAATAAC 
TAAGAGGCGG 
GTCTGCAAAT 
CTCTTTCTAA 
AAAACGGGAC 
AATTTGGGTC 
CCCTGAGGGA 
AAGGGTGGAA 
AAGACCCCGT 
TCAACGTGTT 
TCTTGTTCCA 



10500 
10560 
10620 
10680 
10740 
10800 
10860 
10920 
10980 
11040 
11100 
11160 
11220 
11280 
11340 
11400 
11460 
11520 
11580 
11640 
11700 
11760 
11820 
11880 
11940 
12000 
12060 
12120 
12180 
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TCTCAGTTAA AGTGTCTTGC AGATGACCTT TAGCAGGAAG CAAGATACGT TTGCCTTCCA 12240 

CATTTTTGTA AGCACCTTTT TGATATTGTG AGGCTGAACC ATAATATTCT TTGAACTGTT 12300 

CACCATCGAC TTCAATCGTT TTCCCTGGAC TTTCAATGTG TCCTGCAAAG AGGGAACCAA 12360 

TCATGATCAT GCTAGCACCG AAGCGGATAG ACTTAGCAAT ATCACCGTGA GTACGAATTC 12420 

CTCCATCAGC GATAATCGGT TTACGCGCAG CCTTGGCACA CCAGCGTAGA GCAGCCAACT 12480 

GCCAACCACC TGTACCAAAA CCAGTCTTAA CCTTGGTGAT ACAAACCTTA CCAGGACCGA 12540 

TTCCGACCTT AGTAGCATCC GCACCAGCAT TTTCCAATTC ACGCACAGCT TCTGGTGTTC 12600 

CCACATTTCC AGCAATGACA AAGGTATCTG GCAATTCTTT CTTGATGTGT TGAATCATAG 12660 

AAATCACGCT ATCCGCATGA CCATGAGCAA TATCAATAGT GATATACTCA GGAGTATCAG 12720 

CCTTGAGCTG GCTAACAAAA TCATACTCAT AATCCTTAAC ACCGACAGAG ATAGAAGCAA 12780 

TGAGCCCTTG ATTGTGCATT CGTTTAATAA AAGGAATGCG TCCTGCCTCA TCAAAACGGT 12840 

GCATAATGTA GAAGTAACCA CCTTTAGCCA GTTGCTCTGC TACATTTTCA TCCAAAATCG 12900 

TCTGCATATT CGCTGGCACA ACAGGTAGTT TAAAGGTGTG ATTTCCTAAA GTGACACTTG 12960 

TATCCGCTTC TGCACGGCTT TTAATGACAC ATTTATTTGG AATCAATTGA ATATCTTCGT 13020 

AATCAAAAAT TGGAAATTCA TTTAACATAT CGATGTCTCG TTTCTTTTGT AATGACCTAC 13080 

CTATGCTCTT GCATCACTAC GCCTTTTCCG ACGTTTCCTG G 13121 



(2) INFORMATION FOR SEQ ID NO: 127: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9578 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 127: 

CCGAATGCAA TGTTTACGGT TGAACTTGAA AATGGACATC AGATTTTAGC AACAGTTTCT 60 

GGTAAAATTC GTAAAAACTA TATTCGTATT TTAGCGGGAG ATCGTGTTAC TGTCGAAATG 120 

AGTCCATATG ACTTGACACG TGGACGTATC ACTTACCGCT TTAAATAATC GAAAAACTTG 180 

GAGGGATAAG AAATGAAAGT AAGACCATCG GTCAAACCAA TTTGCGAATA CTGTAAAGTT 240 

ATTCGTCGTA ATGGTCGTGT TATGGTAATT TGCCCAGCAA ATCCAAAACA CAAACAACGT 300 

CAAGGATAAG ATAGAAAGGA GAAAACATGG CTCGTATTGC TGGAGTTGAT ATTCCAAATG 360 

ACAAACGCGT AGTAATCTCA TTGACTTATG TTTATGGTAT CGGACTTGCA ACATCTAAGA 420 

AAATTTTGGC TGCTGCTGGA ATCTCAGAAG ATGTTCGTGT ACGTGATCTT ACATCAGATC 480 
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AAGAAGATGC TATCCGTCGT GAAGTGGATG CAATCAAAGT TGAAGGTGAC CTTCGTCGTG 540 

AAGTAAACTT GAACATCAAA CGTTTGATGG AAATCGGTTC ATACCGTGGT ATCCGTCACC 600 

GTCGTGGACT TCCTGTCCGT GGACAAAACA CTAAAAACAA CGCCCGCACT CGTAAAGGTA 660 

AAGCTGTTGC GATTGCTGGT AAGAAAAAAT AATATAGGAG GTAAAAGTCT TGGCTAAACC 720 

AACACGTAAA CGTCGTGTGA AAAAGAATAT CGAATCTGGT ATTGCTCATA TTCACGCTAC 780 

ATTTAATAAC ACTATTGTTA TGATTACTGA TGTGCATGGT AATGCAATTG CTTGGTCATC 840 

AGCTGGTGCT CTTGGTTTCA AAGGTTCTCG TAAATCTACA CCATTCGCTG CTCAAATGGC 900 

TTCTGAAGCT GCTGCTAAAT CTGCACAAGA ACACGGTCTT AAATCAGTTG AAGTTACTGT 960 

AAAAGGTCCA GGTTCTGGTC GTGAGTCAGC TATTCGTGCG CTTGCTGCCG CTGGTCTTGA 1020 

AGTAACAGCA ATTCGTGATG TGACTCCAGT GCCACACAAT GGTGCTCGTC CTCCAAAACG 1080 

TCGCCGTGTA TAATCATCGC ATTACACTGC TTTTCGTTTA AGAGGGAGTA ACTAAATGAT 1140 

CGAGTTTGAA AAACCAAATA TAACAAAAAT TGATGAAAAT AAAGATTATG GCAAGTTTGT 1200 

AATCGAACCA CTTGAACGTG GCTACGGTAC AACTCTTGGT AACTCTCTTC GTCGTGTACT 1260 

TCTAGCTTCT CTACCAGGAG CAGCTGTGAC ATCTATCAAC ATTGATGGTG TGTTACATGA 1320 

GTTTGACACA GTTCCAGGTG TTCGTGAAGA CGTGATGCAA ATCATTCTGA ACATTAAAGG 1380 

AATTGCAGTG AAATCGTACG TTGAAGACGA AAAAATCATC GAACTGGATG TTGAAGGTCC 1440 

TGCTGAAGTA ACAGCTGGTG ACATTTTGAC AGATAGCGAT ATTGAAATTG TAAATCCAGA 1500 

TCATTATCTC TTTACAATCG GTGAAGGTTC TTCTCTAAAA GCGACTATGA CTGTTAACAG 1560 

TGGTCGTGGA TATGTACCTG CTGATGAAAA TAAAAAGGAT AATGCACCAG TTGGAACACT 1620 

TGCTGTAGAT TCTATTTATA CACCAGTTAC AAAAGTCAAC TATCAAGTGG AACCTGCTCG 1680 

TGTAGGTAGC AATGATGGTT TCGACAAATT AACCCTTGAA ATCTTGACAA ATGGAACAAT 1740 

TATTCCAGAA GATGCTTTAG GGCTTTCAGC ACGTATTTTG ACAGAACATC TTGATTTGTT 1800 

TACAAATCTT ACTGAGATTG CTAAGTCAAC TGAAGTGATG AAAGAAGCTG ATACTGAATC 1860 

TGACGACCGT ATTTTAGATC GTACGATTGA GGAACTGGAC TTGTCTGTGC GTTCATACAA 1920 

CTGTTTAAAA CGTGCCGGTA TCAATACTGT GCATGATTTG ACAGAAAAAT CTGAAGCAGA 1980 

GATGATGAAA GTACGAAATC TTGGACGCAA GAGTTTGGAA GAAGTGAAAC TCAAACTCAT 2040 

TGATTTGGGT CTTGGATTAA AAGATAAATA AAGGAGGAAT ACATGGCTTA CCGTAAACTA 2100 

GGACGCACTA GCTCACAACG TAAAGCAATG CTTCGCGATT TGACAACTGA CCTTTTGATC 2160 

AACGAATCAA TCGTGACAAC TGAAGCTCGT GCTAAAGAAA TCCGTAAAAC TGTTGAAAAA 2220 
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ATGATTACTC TAGGTAAACG TGGTGATTTG CATGCACGTC GTCAAGCAGC TGCTTTCGTA 2280 

CGTAATGAAA TCGCATCTGA AAACTATGAT GAAGCAACTG ATAAGTACAC TTCTACTACA 2340 

GCACTTCAAA AATTGTTCTC AGAAATCGCA CCTCGTTATG CTGAACGTAA CGGTGGATAC 2400 

ACTCGTATCC TTAAAACTGA ATCACGTCGT GGTGATGCAG CGCCAATGGC GATCATCGAA 2460 

TTAGTATAAA ATCATCAATT TTGTTGAGTG TTATGATGAT GGAGTCTTGT GCTCTTAGTC 2520 

TAGCTCTGGT CTACCGCTAG GATTTCGGTC CTAGCGGGAA CACTCATCAT AAGTTGGGAT 2580 

AGTAGACGCT TGTTTACGAA ATTGTTTTTT TCTTAAGAAC AACTTCGTAA GCAGGCGTTT 2640 

TTGAGTATTT TCGTTAGAAT TATGCTATAC TATTTGAAAA GAATCCTGTT TAATGTTAAG 2700 

GTTTCTTATT TTAAGAAGAA TTGGAGTTTA CTTATGAAAG CCATTATAAC TGTTGTTGGT 2760 

AAAGATAAAT CTGGAATTGT TGCAGGTGTT TCTGGTAAAA TTGCAGAATT AGGATTGAAT 2820 

ATTGACGATA TCTCTCAAAC TGTCTTGGAT GAATATTTTA CGATGATGGC TGTTGTATCT 2880 

AGTGATGAAA AGCAAGATTT TACCTATCTT CGTAATGAAT TTGAAGCTTT TGGGCAAACT 2940 

TTGAATGTAA AAATCAATAT TCAGAGTGCA GCGATTTTCG AAGCTATGTA TAATATCTAG 3000 

GAGGTCATCA TGGATATTAG ACAAGTTACT GAAACCATCG CCATGATTGA GGAGCAAAAC 3060 

TTCGATATTA GAACCATTAC CATGGGGATT TCTCTTTTGG ACTGTATCGA TCCAGATATC 3120 

AATCGTGCTG CGGAGAAAAT CTATCAAAAA ATTACGACAA AGGCGGCTAA TTTAGTAGCT 3180 

GTTGGTGATG AAATTGCGGC TGAGTTGGGA ATTCCTATCG TTAATAAGCG TGTATCGGTG 3240 

ACACCTATTT CTCTGATTGG GGCAGCGACA GATGCGACGG ACTACGTGGT TCTGGCAAAA 3300 

GCGCTTGATA AGGCTGCGAA AGAGATTGGT GTGGACTTTA TTGGTGGTTT TTCTGCCTTA 3360 

GTACAAAAAG GTTATCAAAA GGGAGATGAG ATTCTCATCA ATTCCATTCC TCGCGCTTTG 3420 

GCTGAGACGG ATAAGGTCTG CTCGTCAGTC AATATCGGCT CAACCAAGTC TGGTATTAAT 3480 

ATGACGGCTG TGGCAGATAT GGGACGAATT ATCAAGGAAA CAGCAAATCT TTCAGATATG 3540 

GGAGTGGCCA AGTTGGTTGT ATTCGCTAAT GCTGTTGAGG ACAATCCATT TATGGCGGGT 3600 

GCCTTTCATG GTGTTGGGGA AGCAGATGTT ATCATCAATG TCGGAGTTTC TGGTCCTGGT 3660 

GTTGTGAAAC GTGCTTTGGA AAAAGTTCGT GGACAGAGCT TTGATGTAGT AGCCGAAACA 3720 

GTTAAGAAAA CTGCCTTTAA AATCACTCGT ATCGGTCAAT TGGTTGGTCA AATGGCCAGT 3780 

GAGAGACTGG GTGTGGAGTT TGGTATTGTG GACTTGAGTT TGGCACCAAC CCCTGCGGTT 3840 

GGAGACTCTG TGGCACGTGT CCTTGAGGAA ATGGGGCTAG AAACAGTTGG CACGCATGGA 3900 

ACGACGGCTG CCTTGGCCCT CTTGAACGAC CAAGTTAAAA AGGGTGGAGT GATGGCCTGC 3960 

AACCAAGTCG GTGGTTTATC TGGTGCCTTT ATCCCTGTTT CTGAGGATGA AGGAATGATT 4020 
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GCTGCAGTGC 
TCTGTTGGAT 
ATGATTGCGG 
ATTCCCAAAG 
GTTATGAAGG 
GCACCAATTC 
TTAGACGTGT 
CACTGAATTA 
TTTTCATAAT 
GATATAATAG 
CATGGCAAGA 
ACTGCTGAAG 
CAGTTTGAGC 
CTTGAAGAAC 
TGGTGTTTCG 
CGTATCTTTA 
GTAGAGGTCG 
GAAGGCTTTG 
GTCAGCCATG 
GGTCTGGATA 
GTTGTCGGTG 
TAATCAGTCT 
AAAACAGCCG 
ATTGCTTTTA 
AGTTGACAAG 
AGTCATTGTT 
TGTGACTCAA 
CTTAGCAAAG 
TGCTATTTGT 



AAAATGGCTC 
TGGATATGAT 
ATGAAGCAGC 
GAAAAGAAGG 
TTAATGGGGC 
ATAGTTTTAA 
ATACTATAAT 
GTTTGAATTT 
AATCTCCTTC 
AAGCAAACGG 
CCATGTTTAA 
GTGAACGAGG 
GTGCTTATTC 
TTGGCTTGCA 
GTAGTTTTGA 
ATGTGGACCA 
ATACAGCTGG 
AAATGATTGC 
GAATGACTAT 
ATGGTAGCGT 
ACCGTAGTTA 
AGACTTGCTT 
AGGGCACTCC 
GAGATTTTCA 
GTTGCGGCTA 
CTTGTTTTGA 
GATAAGATGG 
GTATTGAGAC 
TGTAAGCTAG 



TCTTAATTTA 
TGCCATCCCA 
AATCGGTGTT 
CGATATGATT 
TTCGTCTGTC 
AAATTAAGAA 
CATTAAATAA 
GATTTTCATC 
AAAAGTCGCC 
AGGACGGAAA 
CACGATTGGT 
GATTCAAGAG 
GAGTGATTCT 
GGGGGAAATC 
TGGAGCCTAT 
CGTTCACCAA 
TTGGGCTGAA 
AAAAGAAATG 
TGGAACCATT 
GACAATCCTT 
CCGAGAGCTA 
GCCATGAGCT 
TTTCGGCTGT 
TAAACAAGAG 
CACCGTAATT 
CATTGTAGAG 
CTCCAGAAAT 
GACTACTTCC 
CAACAGATGA 
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GAAAAACTAG 
GAAGATACGC 
ATCAACATGA 
GAGTTTGGTG 
GACTTCATCT 
AATAGGAGAA 
AGACCTCCTA 
TAATATCTTT 
TGTATGGGTG 
ATGGTAAAAG 
CGCGCGCAAG 
TTAGGAATCG 
GGTCGTACCA 
CCTTATCGCA 
GATGGCGATC 
TTGTCTTATG 
GGCTGGGAAA 
GAAGATCAAG 
GTTTATCTGA 
GAATATGAGG 
GGACGTGAGA 
AGGGATTTGA 
TTTTGATGTG 
CAAGGAACCT 
TCCTCTGAGA 
GAGGATAGAA 
GATACCAGAT 
TAAGCTTTCA 
GCGAATAGTA 



AAGCTATGAC 
CTGCTGAAAC 
AAACAACAGC 
GTCTATTAGG 
CTCGCGGTGG 
ATTTTAAGTT 
ATATTATTTG 
ATTTAATGAA 
GCTTTTATTT 
TACGATTGTA 
GTTGGAGCGA 
GTTTGCGAGA 
TTCAGACCAT 
TGGACAAGCG 
TTTTCATGGG 
CTGAACTGGC 
AACTCAGTGG 
GTGGAGGTAA 
TTAATGGCAT 
ACGGCCAGTT 
AGATGGAAGA 
TAAGAATATC 
GAAAACTAAA 
ACTGTTAGAA 
ACCTCTGTAT 
GTAGAGAGTT 
AGCATCATTG 
GCAGCTTCTT 
TAAGGTAATC 



GGCTATCTGT 
TATTGCGGCT 
TGTTCGTATC 
AACTGCACCC 
ACAAATCCCA 
CTATTTAAGA 
AAACAGATAA 
CTCCTAAACT 
TATCATTCAT 
TTTGGTACGT 
TACTCCCTTA 
ATCTGATCTA 
GGGAATTATC 
TATCAGAGAA 
CATTATTCCT 
TGAGGGCTTG 
CCGAATCAAG 
CGCCCTTGTT 
GCATCCGCAT 
TAGGGTTGAA 
AGGCTCTATT 
AAGATAAGAA 
GTGTAATGCT 
CAGTCAGGAT 
AAATAGCTAC 
TTGAAATCAT 
GAGTTGTAAT 
CAATACTTGG 
TTCTGGCAGA 



4080 
4140 
4200 
4260 
4320 
4380 
4440 
4500 
4560 
4620 
4680 
4740 
4800 
4860 
4920 
4980 
5040 
5100 
5160 
5220 
5280 
5340 
5400 
5460 
5520 
5580 
5640 
5700 
5760 
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TAGAGACATA ATCAAGATGA AAGCAGTCCC TGTAATCATA AGAAATCCAC TTCCAAATAG 5820 

ACCAGTATTG AAGGAAGAAA TGAAGGCAAT CCCTAGAACG GTTCCTGGTA CAATATAAGG 5880 

TACCATACTG AGGCTGTCAA TTAAGTTTGT AAACAAATTC CGTTTTCTAA CGGCTAGGTA 5940 

GGAGATAAAT GTCGCAAATA GAACAACTAG AACTAAGGCA ATCAAAGGGA TACGAATGGT 6000 

ATTGAAAATA GCAGATCCCA TACGATGGAA AGCTACCTTG TAACTGTTTG GAGAATAACC 6060 

TTTAACAGAT ACCATACCTG ATGTTTTTAG GAAAGAGGTA TAAATTAAGT AGATTTGAGG 6120 

TAAAACAGAG ATAAAGATAA TTCCGTAGAC TGTTGCATAA ATGGCAGCCA TTTTTCCTTT 6180 

TGTAGTTTTT TTAGGCTCAA TTGGATGGAG CAGATTCATG CTGAAACTGT AGCGGTTTGC 6240 

AATGTGTTTT TGGATAAGGA AAATTGCCAA GGCAATGATA ATCGCCATAA TTGCAAAAGC 6300 

AGAATTTCCT CCAACCTCGC TAATAAATTG GGTATAAATC AGGACAGGGA AAGTCCGATA 6360 

CCCTTCGCCA ATCAACATAG GCGTTCCAAA GTCTGAGAAT GCTCTCATAA ATACAAGCAA 6420 

GGAGCTGCTA GTAAGGTTGG AACTAGGAGA GGTAAAACAA CCGTTACGAT AGGTTTAAAT 6480 

CCGAAGGACC CCATGCTTTC AGCTGCTTCA AGTAGAGAAT TGTCAATACT GTTCATTGTT 6540 

CCAGCAACAT ATAGAAATAC CAGTGGGAAT AGTTGCAGTG TAAAGACAAG TACAATTCCT 6600 

TTGAATCAAT AAATATCGAT AGCTGGAAGA TAAAGGGCAT TTGTCAAAAA TTTAGTGATG 6660 

ACCTCATTTC GTCCTAGCAA GAGAACCCAG GAGTAGGCTC CTACGAAAGG AGCTGACATG 6720 

GAAGCAATGA TAATCAATAT TTGTAGAAAT TTCTTCCCCT TGAAGTCATA CATAGAGAAG 6780 

AGATAAGCTA ATAGGGTTCC TACAACTAAG GAAGTGATAG TAGCGGTAAT GGAAACCTTG 6840 

AAACTGTTGA CTAGTGTCTC AGAGTAGTAG GCTTTACTAA AGAAAGTGAC AAAATTAGCT 6900 

AGTGAGAATT GTCCTTCATG TATAAGTGCT TGCTTGAGCA CGGTAACGAT AGGATAAACG 6960 

AGAAAGATAG GATAGGTAAG AAAGAGGAAG AAAGAGGAAA CTGTCCAAAT ATTTAGTTTT 7020 

TTACGTTCCA TGGTTGACTC CTTTTATCAG GTTTTGGGAA CCATCTGCAG AAAAGATGTT 7080 

TAATTTTTGC GTATTGATTC GTAGACGAAT ACGATTGCCT TTTTGTAGAT CTTCTTCAAA 7140 

AGTTGATTCT TCACTAACTT GAATTTTTGA GGCAAAACCT GTCTCAATGA AATAATCCGT 7200 

ATTTAGTCCA AGATAGACGC TATCTCTAAT AGTTCCTTCA ATATCTCCAG ATTCATCTTT 7260 

GATAAACTCT TCGGGACGAA TGCTTACATG AATAGCTTGC TCCTCAACCT GATCAAGAGC 7320 

TGGCATTCGA AGGGCATAGC CATCTGAAAA GACGATATAA GCGCCGTCGC TCCGTTTTTC 7380 

AAGATTGGCA GGGATAATAT TTGTGCGTCC GATAAAGGTT GCCACAAACT CATTAGCTGG 7440 

TTTATGATAG AGTTCTTTTG GTCGGCCGAT TTGTTGGATC ACCCCATCTT TCATAACAGC 7500 

AATTTGGTCT GAAATAGCCA TGGCTTCTTC TTGGTCGTGG GTTACATAAA CAGTTGTAAT 7560 
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TCCCACTTCG TGTTGGATTT CTCGGATGGC TTGACGCATA TCCAAGCGAA GTTTGGCCTC 7620 

CAGATTACTA AGTGGCTCGT CCATGAGGAG AACACTTGGA TTAACCGCTA AGGCGCATGC 7680 

CAAGGTGACA CGTTGTTGTT GTCCACCACT GAGTTTATCG GGCTTTCGAT CCGCATATTG 7740 

i 

AGCAATTTGC ATGAGTTCAA GATACTTGTT GGTCTGTTGA ATCAATTCTT CTTTTGGAAC 7800 

CTTCTTTTGC ATAAGACCAA AAGCAACGTT GTCTCGGACA GTCAAATGTG GGAAAATAGC 7860 

GTAGTTTTGG AAAACCATCC CGATATTGCG TTTGCTGGGT TCCATATTAT TGATTTTTGT 7920 

ATCATCGAAG TAAAATTCTC CACCTTCGAT ACTGTTGAAA CCTGCAATCA TACGAAGAAG 7980 

GGTCGTTTTC CCACATCCTG AAGCTCCAAG AAGGGTAAAG AGACTTCCTT TTGGAATTGT 8040 

AATGTTCAAA TTCTCAATAA CAGGGACATC GTGGTAGATT TTTTTGGCGT TAATAATTTT 8100 

GATCTCACTC ATAGTGAACC TCTTTTACTG TTTAGATTGG ATATCTGTAA AGACTTCGTT 8160 

GTATTTCTTA ACGATATCTG ATTTATTCTT GATGACATAA TCATAATCTT CAGTGAGTGT 8220 

TTTGATTTTG TCAATTGGTT TCATGTTTTC GCTTGTTTTA GCATTTTTAC GAACAGGACG 8280 

GTTAGTAGTG GTTGTACCAA GTGTATCTTG TACTTCTTGA GAGATAATAA AATCGATAAA 8340 

TTTCTTGGCA TTTTCCATAT TTTTAGATTT TTTAACGATA GCAGCACTAG CAGGTAGGAA 8400 

GACGGTTCCT TCTTTTGGAT AGACTACCTT AATGTTAGCT CCGTCATTTA AGAGTTTAAC 8460 

TGCTGGATCT TCATAAGAGA GACCAACAGC CATTTCTCCA TCAGCGACTA CTTTATAGAC 8520 

ACTAGATGAA CTTGAACCGA TTTTACCATC AATAAGTGTG AAAAGATCTT TTACATAAGA 8580 

CCAAGCCTTA TCATCTTTGT AACCACCTTG AGCTTGTAGC ATATTTGTTA ATTGAGCAAA 8640 

GGCGCTAGAA GAGTTTGCTG GGTCAGCAGT TGCGATTTTT CCTTTTAGTT CAGGTTTGAA 8700 

AAGATCGTTA TATCCTTCGA TGTTCATGCC TTTAGTTAAA TCAGGGTTGA CGATTAAAAC 8760 

ACTACCATCT AGTGTATAAG GAGTAGAGTA GCCAGTTGTG TTTTGATATT CTTTGATAAC 8820 

ATTATCATTT TCTTTTGAAG TATAGTTTTC AAAGAGTTCT CCGTGGGTAG TATATTGTGT 8880 

ATAAGAACCA CCAAAGATAA CATCAGCTAC AGGAACTTCT TTTTCTGACT CTAGTTTTTT 8940 

GAAAAGTTCT CCAGTACCAG CTTGAATCAG TTCTACTTTG AT AC CAT ATT TTTCTTCAAA 9000 

GGCAGGAATA GTTGCTCCAA TTAAGCCCTC TGAGTTTGGT GAATAAACGA CTAGCGAACC 9060 

GCCGTCTCCT TTATCAGATG AACTGTCATC GGCAGATTCA TTAGAAGAAC AAGCAGCATA 9120 

ATACATCCAT TTCTTTTTCA TGATGGATAC CTCCGTTGTG TTATTTAAGT TTATTTTAAA 9180 

ACAATGTAAG CGTTTTTAAA ACATACAATT CTATTCTATA GTGTATTGAA TCTATAACAG 9240 

TACACTTTGA CTGCTAAAAT ATTTCTATAA ATTAATTTGA CTTTCCTGAT AGAGATGTTC 9300 
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ACATCTTATT TCAATTCACT ATATTAGAGT AAAATTCTCT ACAAAAAGAA GAATAGCCTA 9360 

TTTTACTATT CTTCTGAGTG ATTTCAATTC CTTTGGGGAA ATATGGAGAT ACTTTTTAAA 9420 

TCCTGACAAA TGGTTGTTTC TTTTTCTAAA TCGGTGATAC TGTATCGGAG AATGCGCGTG 9480 

AGGTCACAAA GGCTGCGATA GAGCTTCTAT GGAGAATTTC TTTTTGGAGA GATTTTTTAA 9540 

AGGAATGAGA CATCCGCTAC CTCCTTGGAA GGTTTTTG 9578 



(2) INFORMATION FOR SEQ ID NO; 128: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13440 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 128: 

CGGGCTGTTG TGACGATTCT TATTTCTATC TGTGTTATCT TTTTGGGAAC TATTTTGGGT 60 

GTTGTCTTGG CTTTTGGGCA ACGTTCAAAG TTTAAACCGC TTGTTTGGTT GGCCAACTTG 120 

TACGTTTGGA TTTTCCGTGG GACACCGATG ATGGTTCAAA TTATGATTGC CTTTGCTCTT 180 

ATGCATATCA ATGCTCCGAC TATTCAGATT GGAATTTTAG GTGTTGATTT TTCGCGTCTG 240 

ATTCCAGGGA TTTTGATTAT CTCTATGAAT AGTGGTGCTT ATGTTTCGGA GACTGTTCGT 300 

GCCGGAATCA ATGCGGTTCC AAAAGGTCAG CTAGAAGCGG CTTATTCGCT AGGGATTCGT 360 

CCTAAAAATG CGATGCGTTA TGTGATTTTG CCACAAGCAG TCAAAAATAT CTTGCCAGCA 420 

TTGGGGAACG AATTTATCAC CATTATCAAG GACAGCTCCC TCTTATCAGC TATTGGGGTC 480 

ATGGAGTTGT GGAATGGGGC TACAACAGTT TCTACAACAA CCTATCTACC TTTAACACCA 540 

CTTTTATTTG CAGCATTTTA CTACTTGATT ATGACCTCTA TTCTGACAGT AGCCTTGAAA 600 

GCTTTTGAAA AACATATGGG ACAAGGAGAT AAGAAATAAT GACAGAAACC TTGATAAAAA 660 

TTGAAAATTT ACATAAATCC TTTGGAAAGA ATGAAGTATT GAAGGGCATC AACCTCGAGA 720 

TTAAAAGAGG AGAAGTTGTC GTTATCATCG GTCCTTCAGG GAGCGGGAAA TCTACCTTGC 780 

TTCGCTCTAT GAATTTGTTG GAAGAAGCAA CCAAGGGGAA GGTTATCTTT GAGGGAGTCG 840 

ATATTACGGA CAAGAAGAAT GACCTGTTTG CCATGCGTGA GAAGATGGGC ATGGTTTTTC 900 

AACAATTCAA TCTCTTTCCT AATATGACTG TGATGGAAAA TATCACCTTG TCCCCTATCA 960 

AGACCAAAGG TGACAGTAAG GCCGTTGCAG AGAAAAGAGC TCAGGAACTT TTGGAAAAAG 1020 

TTGGTTTGCC AGATAAGGCA GACGCTTATC CACAGAGTTT GTCAGGTGGC CAGCAACAGC 1080 

GGATTGCCAT CGCGCGTGGG TTGGCTATGG AACCAGATGT TTTGCTCTTT GACGAGCCAA 1140 



WO 98/18931 



PCTYUS97/19588 



883 



CTTCAGCCCT 


AGATCCTGAG 


ATGGTTGGAG 


AAGTTCTGGC 


TGTTATGCAA 


GATCTAGCCA 


1200 


AGTCAGGAAT 


GACCATGGTT 


ATCGTAACAC 


ATGAGATGGG 


ATTTGCCCGT 


GAGGTGGCAG 


1260 


ATCGTGTCAT 


CTTTATGGCA 


GACGGTGTGG 


TTGTTGAAGA 


CGGAACACCT 


GAGCAGATTT 


1320 


TTGAACAAAC 


CCAAGGACAA 


AGGACTAAAG 


ACTTCTTGAG 


TAAGGTTTTA 


TAAGTTAGCT 


1380 


TTGTTTAGCT 


ATTTGTAGCC 


AGCTTTAAAC 


GTTAAAGAGA 


AGATTAGTGA 


AAAGCTCAAC 


1440 


CAGAGCTTTT 


TCTTATAGTT 


TAAAGCTATA 


GGATTGCCTA 


GGAAAGAAGT 


GTTAGAGCTA 


1500 


CATTGTATTT 


TTTGGTATAA 


TTAAAGATAT 


TTGTAAGAAA 


AGAGAAGTGA 


TATGACACAG 


1560 


ATTATTGATG 


GGAAAGCTTT 


AGCGGCCAAA 


TTGCAGGGGC 


AGTTGGCTGA 


AAAGACTGCA 


1620 


AAATTAAAGG 


AAGAAACAGG 


TCTAGTGCCT 


GGTTTGGTAG 


TGATTTTGGT 


TGGGGACAAT 


1680 


CCAGCCAGCC 


AAGTCTACGT 


TCGCAACAAG 


GAGAGGTCAG 


CCCTTGCGGC 


TGGTTTCCGT 


1740 


AGCGAAGTAG 


TACGGGTTCC 


AGAGACCATT 


ACTCAAGAGG 


AATTGTTAGA 


CCTGATTGCT 


1800 


AAATACAATC 


AGGATCCAGC 


TTGGCATGGG 


ATTTTGGTTC 


AGTTGCCATT 


ACCAAAACAC 


1860 


ATTGATGAAG 


AGGCGGTTCT 


ATTGGCTATT 


GACCCAGAAA 


AGGATGTGGA 


TGGTTTCCAT 


1920 


CCTCTAAACA TGGGGCGTCT 


TTGGTCTGGT 


CATCCAGTCA 


TGATTCCTTC 


GACACCGGCA 


1980 


GGAATTATGG 


AAATGTTCCA 


TGAATATGGG 


ATTGACTTGG 


AAGGTAAAAA 


TGCAGTCGTC 


2040 


ATCGGTCGAT 


CCAATATTGT 


CGGAAAACCT 


ATGGCCCAGC 


TTCTTTTGGC 


AAAGAATGCA 


2100 


ACAGTAACCT 


TGACTCACTC 


ACGTACTCAT 


AATCTTTCCA 


AGGTGGCTGC 


AAAAGCAGAT 


2160 


ATTCTGGTTG 


TTGCAATCGG 


TCGTGCCAAG 


TTTGTGACTG 


CTGACTTTGT 


CAAACCAGGT 


2220 


GCGGTAGTCA 


TTGACGTTGG 


GATGAACCGC 


GATGAAAATG 


GTAAGCTCTG 


TGGGGATGTT 


2280 


GATTATGAGG 


CGGTTGCCCC 


ACTTGCTAGC 


CACATTACGC 


CAGTCCCTGG 


AGGTGTCGGT 


2340 


CCTATGACCA 


TTACTATGCT 


GATGGAGCAA 


ACCTATCAGG 


CAGCACTTAG 


GACATTGGAT 


2400 


AGAAAATAAG 


ATAAAAATTT 


TCTGAGGAAA 


GTGTATTTTC 


TATAGCTATA 


TCTAAAATGA 


2460 


TAGAAATGAA 


TATTAAATTT 


TAGAAATAAG 


TTTATAAAAG 


GAGGTTTGCG 


CCTCCTTTTT 


2520 


GTTGTATAAT 


GGAGTGAGGT 


GATTAGATGA 


TTTTAAAAAT 


TTATAATGGG 


GAATATAGTT 


2580 


TACAATGGGA TGGAATATAC 


TACTTAGCAC 


TAATTGATTA 


TCCAAATATT 


CAAGAGTGGG 


2640 


AATTAGAAAA AATTGCTAAA TTTATAGCTT ACGAAAAACT TCATAAACGT 


CAAACAAGTA 


2700 


TTGAGTGTGC 


TGATTCTTGT 


TTAAAAAAAG 


AAATTTTAGA 


TTACATCTGT 


CAGCATCCCT 


2760 


TTCTGCCACC 


ATTTACTCCT 


ACAGATAAAA 


GAGTAGCCTC 


GACTTATGAC 


CTACATAAGA 


2820 


GGTTAGTGAC 


TTCAGACTAC 


TGTAGTCATA 


CTACGACTAT 


AGATGCAGCG 


ATTTCTATTT 


2880 
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1 1AAAACTGG 


TCGTCTTTTA 


TCTGCTGTGA 


AAGCCTTTGG 


GCGAGATGCT 


GAGGAGTTGG 


2940 


111 ILAjAI AG 


TCGAAATGCT 


GCATCTGATC 


CGATAGATTA 


TTTTGACTAT 


GTCATGTTAG 


3000 


IajCj 1 L.AAA 


TACAAGTTCT 


GGTTATCGAT 


TGGCGATGGA 


GCGTTTATTA 


GGTCGAGCTC 


3060 


(-11 L AGAGAA 


AGAATTACAA 


GACAAGTTTA 


TTCCTGGAGT 


AAGTTTTCAT 


TTTATCTATA 


3120 


LnuAl 1 HjAT 


TAAAGTTCCT 


GGTTATATTT 


TTGATGGTTA 


CCATGCTGTA 


AAAATTAAGG 


3180 


ft a nvoTm a * 
Al_ A 1 T AA 


TTTATTAAGT 


GAGTTGTATA 


TTTGCATTAT 


TCCAACTCAT 


AATAAGAGCC 


3240 


HJ\1 I IVjAAAA 


TATTATTCCA 


ACCAAAATAC 


AAGATAGGGT 


GTATTATCTT 


GACTATGCTG 


3300 


GAGAAGACTT 


AGAAGAGTGG 


ACTAAGAAAG 


TCTATCAAGT 


TGTTTTAAAA 


CAATCAGATA 


3360 


aaggatagtt 


GAGGAAAAAA 


CGATGAAAGT 


GATTGATCAA 


ACCTTACTAG 


AAAAAGTCAT 


3420 


tattgaacgt 


TCTTGTACAA 


GTCATAAAGG 


AGACTACGGT 


CGTCTGCTGT 


TGCTTGGTGG 


3480 


GACTTATCCT 


TATGGTGGTG 


CCATCATCAT 


GGCTGCTTTA 


GCAGCTGTAA 


AAAGCGGTGC 


3540 


AGGATTGGTA 


ACCGTTGGAA 


CGGACAGGGA 


AAATATCCCT 


GCTCTACACA 


GCCATTTGCC 


3600 


TGAGGCTATG 


GCCTTTTCTC 


TGCAAGATCA 


GTAATTGTTA 


CAAGAGCAAT 


TGGAGAAGGC 


3660 


AGAAGTTGTC 


TTGCTGGGGC 


CTGGTTTACG 


AGACGATACG 


TTTGGAGAAA 


ATCTTGTAAA 


3720 


ACAGGTCTTT 


GCTAGCTTAA 


AAAAGAATCA 


GATTTTGATT 


GTAGATGGAG 


GGGCCTTAAC 


3780 


CATCCTTGCT 


AGGACAAGTT 


TGTTGTTTCC 


ATCTAACCAG 


CTTATCTTAA 


CTCCCCACCA 


3840 


AAAAGAATGG 


GAAAAACTGT 


CTGGTATTGC 


TATTGAAAAG 


CAAAACGAAG 


GTACAACATC 


3900 


TAGTGCCCTG 


ACTTCTTTCC 


CTCAAGGAAC 


AATTTTGGTA 


GAGAAAGGTC 


CAGCTACTCG 


3960 


TATTTGGCAA GTTGGCCAGT 


CTGATTATTA 


CCAGTTAAAG 


GTTGGCGGTC 


CCTATCAGGC 


4020 


GACTGGTGGT 


ATGGGTGATA 


CACTGGCTGG 


AATGATTGCA 


GG ATTTGCAG 


GCCAATTTCG 


4080 


ACAGGCCAGT 


CTCTACGAAC 


GTGTGGCAGT 


AGCAACCCAT 


CTTCATTCAG 


CCATAGCCCA 


4140 


AGAACTATCT 


CAAGAAAATT 


ATGTGGTCTT 


GCCGACGGAA 


ATTAGTAATT 


GTCTTCCTAA 


4200 


AGTAATGAAA 


AGATATGTCT 


AAAATAGTTA 


GACAAAAAAT 


GTTGATAATT 


TGTATCATTA 


4260 


TTCTTAATTC 


ACAAAAAACG 


AACGTTTAGT 


ATTCTTCTTG 


CTAAGAAACT 


AAATTTGTTC 


4320 


GTTTTTTTAC 


TCTTGTAAAT 


CTATTTTTGT 


TAGAGTTGAT 


TTGGTTTACA 


TCCGTACTTA 


4380 


AATTGATTTG 


TTAGAGCTCT 


ACTTTTATTA 


AAAAAATTCA 


ATTTCAAGGA 


TAAATAAGCA 


4440 


GTATTCTAAA 


GGTACTTTTA 


GATGAAATAA 


AAGCCTTTAC 


ATGGTATAAT 


AGAGGTAGCT 


4500 


CTTTAATGGA 


GGTGTTTGAG 


TGGAAAATCT 


GAAGAAAATG 


GCAGGTATCA 


CGGCTGCTGA 


4560 


ATTTATCAAG 


GATGGGATGG 


TTGTAGGGCT 


AGGAACAGGT 


TCTACTGCCT 


ATTATTTTGT 


4620 


CGAAGAAATC 


GGTCGTCGAA 


TCAAGGAAGA 


AGGCTTGCAG 


ATTACAGCTG 


TGACGACTTC 


4680 
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TAGTGTGACC 


AGTAAACAGG 


CTGAAGGGCT 


CAATATCCCG 


CTCAAGTCTA 


TTGACCAAGT 


4740 


AGACTTTGTC 


GATGTGACAG 


TCGACGGGGC 


GGATGAAGTG 


GATAGTCAGT 


TTAATGGAAT 


4800 


CAAAGGCGGT 


GGTGGTGCCC 


TTCTCATGGA 


AAAGGTGGTC 


GCAACACCAT 


CAAAAGAATA 


4860 


CATTTGGGTG 


GTGGATGAAA 


GCAAGCTGGT 


CGAAAAACTA 


GGTGCTTTTA 


AATTGCCAGT 


4920 


AGAAGTGGTT 


CAGTATGGTG 


CAGAGCAGGT 


CTTTCGTCAT 


TTTGAACGAG 


CTGGCTACAA 


4980 


ACCAAGTTTC 


CGTGAAAAAG 


ACGGCCAACG 


TTTTGTGACC 


GATATGCAGA 


ATTTTATCAT 


5040 


TGACCTCGCC 


TTGGATGTCA 


TTGAAAATCC 


AATTGCTTTT 


GGACAAGAAT 


TGGACCATGT 


5100 


CGTTGGTGTT 


GTGGAGCATG 


GTTTATTCAA 


CCAAATGGTG 


GATAAGGTAA 


TCGTTGCTGG 


5160 


ACGAGATGGA 


GTTCAGATTT 


CAACTTCAAA 


AAAAGGAAAA 


TAGAAGGGGG 


CATAAGATGT 


5220 


CTAAATTTAA 


TCGTATTCAT 


TTGGTGGTAC 


TGGATTCTGT 


AGGAATCGGT 


GCAGCACCAG 


5280 


ATGCTAATAA 


CTTTGTCAAT 


GCAGGGGTTC 


CAGATGGAGC 


TTCTGACACA 


CTGGGACACA 


5340 


TTTCAAAAAC 


AGTTGGTTTG 


AATGTCCCAA 


ACATGGCTAA 


AATAGGTCTT 


GGAAATATTC 


5400 


CTCGTGAAAC 


TCCTCTTAAG 


ACTGTAGCAG 


CTGAAAGCAA 


TCCAACTGGA 


TATGCAACAA 


5460 


AATTAGAGGA 


AGTATCTCTT 


GGTAAGGATA 


CTATGACTGG 


ACACTGGGAA 


ATCATGGGAC 


5520 


TCAACATTAC 


TGAGCCTTTC 


GATACTTTCT 


GGAACGGATT 


CCCAGAAGAA 


ATCCTGACAA 


5580 


AAATCGAAGA 


ATTCTCAGGA 


CGCAAGGTTA 


TTCGTGAAGC 


CAACAAACCT 


TATTCAGGAA 


5640 


CGGCTGTTAT 


CTATGATTTT 


GGACCACGTC 


AGATGGAAAC 


TGGAGAGTTG 


ATTATCTATA 


5700 


CTTCAGCTGA 


CCCTGTTTTG 


CAGATTGCTG 


CCCACGAAGA 


CATTATTCCT 


TTGGATGAAT 


5760 


TGTACCGTAT 


CTGTGAATAC 


GCTCGTTCGA 


TTACCCTTGA 


GCGTCCTGCC 


CTTCTTGGTC 


5820 


GCATCATTGC 


TCGCCCTTAT 


GTAGGTGAAC 


CAGGTAACTT 


CACTCGTACG 


GCAAACCGTC 


5880 


GTGACTTGGC 


TGTATCTCCA 


TTTTTCCCAA 


CTGTTTTGGA 


TAAATTGAAT 


GAGGCTGGTA 


5940 


TCGATACTTA 


TGCTGTGGGT 


AAAATCAACG 


ATATCTTTAA 


CGGTGCTGGT 


ATCAACCATG 


6000 


ACATGGGTCA 


CAACAAGTCA 


AATAGTCATG 


GAATTGATAC 


ACTATTGAAG 


ACTATGGGAC 


6060 


TTGCTGAGTT 


TGAAAAAGGA 


TTCTCATTCA 


CAAACCTAGT 


TGACTTTGAT 


GCCCTTTACG 


6120 


GCCATCGTCG 


TAATGCTCAC 


GGTTACCGTG 


ATTGCTTGCA 


TGAGTTTGAT 


GAACGCTTAC 


6180 


CTGAAATTAT 


CGCAGCTATG 


AGAGAGAATG 


ACCTTCTCTT 


GATTACTGCG 


GACCATGGAA 


6240 


ATGACCCAAC 


GTATGCAGGA 


ACGGATCACA 


CTCGGGAATA 


TATTCCATTG 


TTGGCCTATA 


6300 


GCCCTGCCTT 


TAAAGGAAAT 


GGTCTCATTC 


CAGTAGGACA 


TTTTGCAGAT 


ATTTCAGCGA 


6360 


CTGTTGCCGA TAACTTTGGT 


GTGGAAACTG 


CTATGATTGG 


GGAAAGTTTC 


TTAGATAAAT 


6420 



i 
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TGGTATAAGA TGACGCGCTA TGCTTTGCTG GTGAGAGGTA TCAATGTTGG TGGTAAGAAT 6480 

AAGGTCGTCA TGGCGGAGCT TCGTCAAGAA TTGACAAACT TGGGACTGGA AAAGGTTGAG 6540 

AGCTACATCA ATAGTGGCAA TATTTTCTTT ACTTCGATAG ATTCCAAAGC CCAATTGGTT 6600 

GAAAAGCTAG AGACTTTCTT TGCAGTCCAT TATCCATTTA TTCAGAGCTT TTCTTTACTG 6660 

AGTCTAGAGG ACTTTGAGGC GGAACTTGAA AATCTACCAG CTTGGTGGAG CAGAGACTTG 6720 

GCACGAAAAG ATTTTCTCTT TTACACTGAG GGTTTGGATG TGGACCAAGT CATCGCGACA 6780 

GTTGAAAGTT TAGAGCTGAA AGATGAAGTG CTTTATTTTG GAAAACTTGG GATTTTCTGG 6840 

GGGAAATTTT CTGAAGAATC CTATTCTAAG ACTGCCTATC ATAAGTACTT GCTGAAGGTG 6900 

CCTTTCTACC GCCACATTAC TATTCGTAAT GCTAAAACCT TTGACAAAAT TGGTCAAATG 6960 

CTAAAAAAAT AATAAAGGAG ACACACAATG ACATTTTTAA ACAAAATCCA TGAAACTGCT 7020 

ACTTTCCTGA AAGAAAAGGG AATTGCAGCC CCTGAGTTCG GTCTAATCCT TGGATCAGGA 7080 

CTTGGAGAAT TGGCAGAAGA AATCGAAAAT CCAGTTGTAG TAGACTATGC TGAGATTCCA 7140 

AACTGGGGCC GTTCAACAGT AGTCGGTCAT GCTGGTAAAT TGGTATATGG TGAACTGGCA 7200 

GGTCGCAAGG TCTTGGCTCT TCAAGGGCGT TTCCATTTCT ATGAAGGGAA TCCTCTGGAA 7260 

GTGGTGACTT TCCCAGTTCG TGTGATGAAA GTTCTTGGAT GTGAAGGTGT TATTGTAACC 7320 

AATGCAGCTG GCGGTATCGG ATTTGGTCCT GGTACCTTGA TGGCTATCTC AGACCATATC 7 380 

AACATGACGG GGCAAAATCC ATTGATGGGT GAAAACTTGG ATGACTTTGG CCCACGTTTC 7440 

CCAGATATGT CTAGGGCCTA CACACCAGAA TACCGTGCCA CTGCCCATGA AGTGGCTAAA 7500 

AAACTTAATA TCAAGCTTGA TGAAGGTGTC TATATCGGAG TTACTGGTCC GACTTATGAA 7560 

ACACCAGCAG AAATTCGTTC CTATAAGACA CTGGGAGCAG ATGCAGTTGG TATGTCTACG 7620 

GTTCCTGAAG TTATCGTGGC AGCCCACTCT GGCTTGAAAG TTCTGGGAAT TTCATGTATC 7680 

ACTAACTTTG CGGCCGGTTT CCAAGAAGAA CTCAATCACG AAGAAGTTGT AGAAGTGACT 7740 

GAACGTGTTA AAGGTGATTT CAAAGGCTTG CTTAAAGCGA TTCTTGCTGA ATTGTAAGAA 7800 

AAAAGATTTA AAAGGGGGAG TGCCTCTGTT TTTTCAGGAT TGACTGCCTA TCCGGATTAA 7860 

AGAAGAAACA GAGGAATACT ATGAGCTTCT TCCTGCTCTT ATAACTGAAA GAAGCGGAAG 7920 

AATAGGTATG TCTGATCTGA TAGCCAGCAT TGTGAAAGAC AAGATTCTAG GATACTAGCA 7980 

TTAGCTTCCT AGCCAAGCAG ACTAGTATGA TAAGGAGAGA TGAGAATGAA TTGACTTTCT 8040 

GAATTTCTCA GTCTTATCAT ATATAGCACA ATGAGATTTC GCTTGAGTCT GCTTGTAAAT 8100 

AAACGAAAAG AAAGATAAGA AATAATGAAA ATTGGTCAAC GAATTATGCG CTTTGGCATA 8160 

AAAAATTAAG TATCGGAGTT GTATCTGTTG TAGTCGGCTT TGATTTCTAG CTCCAGCTGG 8220 
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AATTTCAGCC AATGAAGTAA AGCAAGATGT AACATCTGAA GTGGTAATAG GTGTGCTAGA 8280 

TTCTAAGGAG GAATTGAAAG AGTCAGAAAA TGATGCTCCA AAACTAGAAA CTCCTCTTAG 8340 

AGAGGAGCCA AGACTAGCTC CTCAAACGCT TCCGGAAGCA AGTGAAGTTC TTGAAAACAA 8400 

AAGGGAAGAG TCAAAAGTAG AGATAACATA ACCAGCTCAA GCGGATGATA TCCGCAAGGT 8460 

TGTTGGGGAA TTAGCCAAGG ATATAAGTAT TACTAAGTTG TATATGACAG GTCATTCTCT 8520 

TGGATGTTAC CTAGCTCAGA TTGCAGCGGT TGAAGCTTAC CAAAAATATC CTGATTTTTA 8580 

TAACCATGTA TTGAGGAAAG TGACAACTTT CAGTGCTCCT AAAGTGATTA CTTCCAGAAC 8640 

TGTTTGGAAT GCTAAGAATG GTTTCTGGGA TGTTGGTTTG GAAAGTCGTA AATTAGCTGT 8700 

TAGTGGAAAA ATTAAGCATT ATGTGGTTGA TAATGACAAT GTTGTGACTC CCTTGATTCA 8760 

TAATAATCGT GATATTGTTA CATTTACAGG TAATTCACGC TTTAAACACC GTTCTCGTGG 8820 

CTATTTTGAA AGTCCAATGA ATGATATTCC TAACTTTAAT ATTGGTAAAC AAGCTACCTT 8880 

GGATAAACAT GGTTATCGTG ATCCGAAATT GGATAAAGTG CGATTCTTTA AGAAACAGGC 8940 

TCTGCCTCGA TCTTCTAGTC AACCAAGCGC TGAACCAATG GAAAATATTG CCTCAGGAAA 9000 

ACAGGTTACT CAAAGTTCGA CAGCTTTCGG AGGAGATGCT AGAAGAGCTG TGGATGGCAA 9060 

AGTCGATGGT AACTATGGTC ACAATTCTGT CACTCATACA AACTTCCAAT CTAAGCCTTG 9120 

GTGGCAAGTA GATTTGGCTA AAGAAGAAAC CATTCGCCAA ATCAATATTT ACAACCGAAC 9180 

AGACACTGCC CAGGATAGAT TGGCAAACTT TGATGTCATT CTTTTAGACA GTTCTGGTAA 9240 

AGAAATTGAG TGAAAACGTA TAACATCTCC TAAAGATGTG TCAGCACAAA TTACGATTAA 9300 

CCATAAAAAA GCGCGCTATG TTCGGATTGA GCTAGAAGGC TATAATGCCC TCAGTCTTGC 9360 

AGAAGTTGAA GTTTTCTGCT TTATAGCTAC GAATGCTGAA ACGGCGACAC AAGTTTCTAA 9420 

GCCAGTTCAA CCAATCAGTC AGACTCCTGT GAAGGATAAA ACATTGACAA TTCAACACAG 9480 

TGGAGCTTAC ATTGCCCGCT ACTCCATAAC TTGGGAAGAA GTTCCAGTAG ATAAAGATGG 9540 

AAACCAAGTT GTTCGTAGTC ATTCTTGGGA AGGAAGCGGT CGCAACCAGA CTGCAGGTTT 9600 

TGTCCTCAAC CTCCCAATCA AAGAAAATAT GAGAAATCTG CGAGTTAAGA TTGAGAAAAA 9660 

GACGGGCCTA CTATGGAATA GATGGCAAAC AATCTATGAA AACAGACCAA TTTTAGCTCA 9720 

ACCCCACCGT AAAATTACCC ATTGGGGTAC GACATTGAAT TCCAAGGTGA GTGACGATGA 9780 

TGTCTTGTAA TCTGATGGTA GAATGACAGT TAGTTTGTCT AGTTTATAAG AAAGTACTAC 9840 

CTGAGCTTGA ATAGGACTCA GGTAGCTCTC TATGAAAGAA CAAAATTAAT ACTCAATGAA 9900 

AATCAAAGAG CAAACTAAGA AACTAGCCGC AGGTTGCTCA AAGCACTGCT TTGAGGTTGT 9960 
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AGATAAGACT GACGAAGTCA GTCACATATA TAATCCAAGG CGACGTTGAC GTGGTTTGAA 10020 

GAGATTTTCG AAGAGTATAA ACAGAAAGGT AGAGCGCGTG TTCTAATTTG AACACGAGTA 10080 

GAAAACTTTT CTAAAAACAA AAACGAAAGG ATGGGTAAAC TGTATTCGCT GAACTGAATA 10140 

CGGGCGACTC TCCTCTAAAT CAAAATTAAG AAAGGAATTG ACCCCACCCT AAAAGTAGTG 10200 

GGAAAAAGAT AGTTGATCTA GCGAGCATCG CTCACTGCGC CCAACTCCTA TTTTCCCTTC 10260 

GCTTTTTGAT GGGTTTGGTA TCTTTCTCAA TATAAAATAT AAAATAAAGA AAGGTAGAGC 10320 

GTGTGTTTTG ATTTGAACAC GAGCGGAAAA CTCGGAAAAT AGATAATCTG ACTGAAAAAT 10380 

CAGGATTTCT CGTCAGGTTC CTAATTTTCA GTCGTTTTCT TCTCGCTCTT TGTATCATAA 10440 

ATTATGTCTA TCCATATTGC TGCTCAGCAG GGTGAAATTG CTGATAAAAT TCTTCTTCCT 10500 

GGGGATCCTC TTCGTGCTAA GTTTATTGCG GAGAATTTCC TTGATGATGC TGTTTGTTTT 10560 

AACGAAGTGC GTAACATGTT TGGTTACACT GGTACTTACA AGGGTCACTG TGTATCTGTC 10620 

ATGGGAACTG GGATGGGAAT GCCATCTATT TCGATTTATG CGCGTGAGTT AATCGTAGAC 10680 

TACGGTGTGA AGAAATTGAT TCGTGTGGGA ACTGCAGGTT CTTTGAATGA AGAGGTTCAT 10740 

GTTCGTGAAT TAGTTTTGGC GCAGGCGGCT GCAACCAACT CAAACATCGT TCGTAATGAC 10800 

TGGCCACAGT ACGATTTTCC ACAAATTGCT AGCTTTGATT TGCTTGATAA AGCCTACCAT 10860 

ATCGCCAAAA AACTTGGTAT GACTACTCAC GTTGGGAACG TTTTGTCATC TGATGTCTTT 10920 

TACTCAAATT ACTTTGAAAA GAATATCGAG CTTGGTAAAT GGGGAGTCAA GGCTGTGGAA 10980 

ATGGAAGCAG CAGCTCTTTA CTATCTTGCT GCCCAATACC ATGTTGATGC GCTAGCTATC 11040 

ATGACCATCT CTGATAGCTT GGTCAATCCA GACGAAGACA CAACTGCAGA AGAACGTCAA 11100 

AATACCTTCA CTGATATGAT GAAGGTTGGT TTGGAAACCT TGATTGCAGA ATAATTATAG 11160 

CCAAAAAGGG GCTCTTTGTC AACTGTAGTG GGTTGAAAAA AAGCTAAGCT TGAGAAAGGA 11220 

CAAATTTCGT CCTTTCTTTT TTGATATTCA GGGCGATAAA AATCCGTTTT TTGAAGTTTT 11280 

CAAAGTTCCG AAAACCAAAG GCATTGCGCT TGATAAGTTT GATGAGATTA TTGGTCGCTT 11340 

CCAGTTTGGC ATTAGAATAG TGTAGTTGAA GGGCGTTGAC GATTTTCTCT TTGTTCTTTA 11400 

GAAAGGTTTT AAAGACAGTC TGAAAAAGAG GATGAACCTG CTTCAGATTG TCCTCAATGA 11460 

GTCCGAAAAA TTTCTCAGGG TCTTTGTTCT GAAAGTGAAA AAGTAAGAGT TGATAGATCT 11520 

GATAGTGGTG TTTCAAGTCT TCTGAATAGC TTAAAATCTT GTCAAGAATT TCTTTATTTG 11580 

TTAAGTGCAT GCGAAAAGTA GGGCGATAAA AACGTTTATC GCTsArTTTA CGACTATCCT 11640 

GTTGGATGAG TTTCCAGTAA CGCTTGATAG CCTTGTATTC ATGAGATTTT CGTTCAAACT 11700 

GATTCATAAT TTGAACACGA AAACGACTCA TGGCACGGCT GAGATGTTGG ATAATATGGA 11760 
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AACGATCTAG AACGATTTTA GCACACGGAA AAAGCTGTTT AGCCAAGTCA TAGTAAGGAC 11820 

TAAACATATC CATCGTAATG ATTTTCACTT GACAACGAAC GGCTCTATCG TAGCGAAGAA 11880 

AGTGATTTCG GATGACAGCT TGTGTTCTGC CTTCAAGAAC AGTGATAATA TTAAGATTAT 11940 

CAAAATCTTG CGCAATGAAA CTCATCTTTC CCTTAGTGAA GGCATACTCA TCCCAAGACA 12000 

TAATCTTTGG AAGCCGAGAA AAATCATGCT CAAAGTGAAA GTCATTGAGC TTGCGAATGA 12060 

-CAGTTGAAGT TGAAATGGCC AGCTGATGGG CAATATCAGT CATAGAAATT TTTTCAATTA 12120 

ACTTTTGAGC AATTTTTTGG TTGATGATAC GAGGGATTTG GTGATTTTTC TTTACCAGGG 12180 

GAGTCTCAGC AACCATCATT TTTGAAsAGT GATAGCACTT GAAACGGCGT TTTCTAAGGA 12240 

GAATTCTAGA AGGCATACCA GTTGTTTCGA GGTAAGGGAT CTTAGACGGT TTTTGAAAGT 12300 

CATrTTTCTT CATTAGACTT CCACAATCAG GGCAAGATGG AGCCTCATAA TCCAGCTTAG 12360 

CGATAATTTC TTTGTGGGTA TCCATATTGA TGATATCTAG AATCTTGATG TTTGGGTCTT 12420 

TAATATCGAG CAGTTTTGTG ATAAAATGTA ATTGTTCCAT ATGATTCTTT CTAATGAGTT 12480 

GTTTTGTCGC TTTTCATTAT AGGTCATATG GGACTTTTTT TCTACACAAA AATAGGCTCC 12540 

ATAATATCTA TAGTGGATTT ACCCACTACA AATATTATAG AGCCCAAAAA GGAAGCCCTT 12600 

TATGAATTGT AGGACTTCCT TTTCTTATCC AGAAATTGAT CTAGCTCTCT CTGATTTCGA 12660 

AGAATAGTGA CTTTATGTGA ATATTCTTGG CAAAGTTTTT GGTAATTTTC TTTTTGAGTT 12720 

TTGCGGACGC CCATCCCAAA GAATCCATCT GATAAACTCC CACTCAAAGC GTTCAGGGCA 12780 

ATCTACCGCC ATACTTTCTC TGACTTTTCC ACGGTATTTA AGATAACGCT TAAAGGCTCT 12840 

AAAGAGACAG GTCAATGGCG AAAAATTGAG AAAGATGATT TGGTCAGCTT CTTGCATTCG 12900 

TTCTTGGTAG TAGCACCAAG AATAATTACC ATCGATGACC CAAGCTTTAT GCTTGGTGAG 12960 

AAAGTTTTTT ATCTCGGTTA ACATCCATTC GCAGTCACTG TCTTGCCAAC CAGGTTGAAA 13020 

TTGGAGTGTG TCCATGTGCA GTTTTGGAAT GGAGTAGTAG TTAGATAACT TTTCTGCTAT 13080 

AGTTGACTTA CCAGAACCAG AATATCCGAT AATTGCGATT TTCATTTTCT ACCTTTTCCT 13140 

ATTTGGAGAC AAAAAAACAG CCTCTATGGA CTGTTTCTTA TTTAACAAGT TTAGCTGAAA 13200 

GACGAGCTTT ATCGCGGCTT GCTTTGTTTT TGTGAATCAA ACCTTTAGTT TCTGCTTTAT 13260 

CGATAGCTGA GCTAGCAGCA CGGAAAAGTT CTTCAGATGG GTTTGCTTCG AAAGCTTTTA 13320 

TAGCAGTACG CATAGCTGAT TTTTGAGCTG AGTTCTTTTC GATTCGTCTA ACGTTCAATT 13380 

CAGCGCGTTT GATAGCTGAT TTAATGTTTG CCAATGGTCT TACCTCCATA TTTACTAACT 13440 
(2) INFORMATION FOR SEQ ID NO: 129: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8512 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY : linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 129: 

CCTTTTTTCA AAAACTAGAT ACTAGTCTAT CAAAAGTAGG AAAGGGTTTC AAGAAAATTG 60 

ATTGGAAATT TTTTGAAAAT CATAGAACTA TTAGCTAATC CCTAGTATTG AAAAGACTGG 120 

ATAGCTTCTT TCAGGTCATC TTGTAAACTA TTTCTCTGGT CAAGTTGGAC ATAGACTTCC 180 

ACCAGACAGG ATCTAAAGTT GGAAAATTTG TAAAAATCCT CCCTTTCTTC TATCGGAAAA 240 

TCAACAGTTT TTATCCAAGA AGCTACTTGT TCTTGCTCCA ACTTCCCTTG TAAAATAGGT 300 

TCATAGATCA CTCTTGCTAA ACGCCAATCC TCATCATCTG TAAAGCGAAT CGACATTCTT 360 

TTAAATAGTT GGCCAAGTAT ATCAAATACT TCATGAACTC TGTTTTTAGG AAAGTCTGGA 420 

TGACAAACCA CCTCTGTCAG TAAATCGGCT CCATGTGCAA AAGCGTGAAC CCAACCATAC 480 

TGACTTGAGA AACCCCTTGT ATCCTTTTCT TTTGAAAGAT AGTGCAAGCC TTGATTTAAA 540 

AGGACATTAC GAATTTCTGG AGAAGGATTT CCCAAATGAT CAAACAACCA CTGGATTTCT 600 

TCCTGGTTAT AATTTGGTTT TTCTTCTGCT ATTTTTCTTA GTAAATCTTG ATACATGGTC 660 

AATACCTCTA CATTTCTAGC AACTGTTCAA AAAGGCAGTC TTAAATGACT CAATATTGAA 720 

TTCTCAATTA AATACAATCT GATATAAAAT GACGTAAATA ACTATCAATA CCAGTTCTAC 780 

AGTAAGTTCA AATTTAACAT CACGACCTTC AACGACATTT TTGAAAATAG CTACAACTAA 840 

GACAAATAGA ATGACGCTTA ACAAGCCCAT AAACATCATT CTAAAAAATT TTTCTATTCC 900 

CCTACTCTCC CAACTCAGCA CTATAGGAGA TAATCTGGTC AACTGTGTCA GACAAGAATT 960 

GGATGGTATC ACGGAGTGGT TTGTCTGTTG AAATATCAGC ACCGATAATC ATGGCTGACT 1020 

CAAGTGGTGT CTTGCTACCA CCTGATTTOA GGAGATTGAG CCAGTCTTCA GCTCCAGTTT 1080 

CAGAATGTTT TAGATGAAGG TAACCAGCAG TCGAGATAAC TAGTCCTGCT GAGTAAGTGT 1140 

AACTATACAA GCCCATATAG TAGTGAGCTT GGCGCATCCA AGTCAGAGTT GCATCATCGT 1200 

CAATTTCAAT AGCATCTCCC CAGAAATCCG TCAAAACTTC CTTCATAATG CTGTTGAGCT 1260 

TGCTTGCTCC AAAGGTCTCC CCTTCTTCAA TCAATGTATA CACCTTACGC TGGAAGGCGG 1320 

CTTCCAAGAG GTGGGTGATA AAGTTATGGA AGTAGGTGTC TGTCAAGCGA TGAGCCAGAG 1380 

CGAAGCGTTT TTGACGTGGG TCATTAGACT GGTTCTCCAA GTAATCACTG AGTAGCAATT 1440 

CATTGAAGGT TGACGGTGCT TCAACATAGT AGGTCGACAT ATGGGCATTG AAGTAACTTT 1500 
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GATGATTGTC TGAAAAGATG AATTGACCAG AATGCCCGAT TTCATGAATC AAGGTATAGA 1560 

CATCGCTCAA ACGGCCTGTC CAGCTCATGA GTACATAAGG GTGTACGCGA TATGGGTCCG 1620 

CCGCATAACC ACCGGAATCC TTGCCACTGT TAGCAGCAAA GTCCACCCAG CGCTCTTCTT 1680 

GGTAACGAGC AACTTCCTGA CAATATTCTT GCCCCAAAGG TTCTACCGAC TTCATGACCA 1740 

AATCATAGGC ATCGTCAATA GTCACTTCAG GATTCAGGGC GCTGTCCAAG TCCAATTTCC 1800 

AGTCTGCAAA GGTCATCTTT TCAAGACCAT TTACCTTGGC AACATGCTTG AGGTATCTCT 1860 

GAGCGACTGG TGCAAAGTCC TTCATGATGA GGTCAATCTG GCGGTCAAAC ATGACACGGT 1920 

CCACTTCTTG TTCAGCTAGA AGATAGTCAA AGACAGAGTC GTATCCCTTC ATATCAGCCA 1980 

AGAGTTTTTC AGACTTGACC TGAGCCAGAT AGGCTGCTGC AGCCGTATTT TGGTGCTTAC 2040 

GAAGTCCCTC TGAGAAGGAA CGGAAGGATT TCTCACGAAC CTCAGCATCC TCATGGTTTT 2100 

GGTAGAAATT CTCATAGGTC ACAAAGCTGT TTTTGTAGGT CTTGCCATGG GCTTCAAAGT 2160 

CAGCCATTTC AAAATCCCCA GCTCGCATCT TAGTATAAAT GTCCTGCGGA CTGTAGAAAA 2220 

CTTCACCGAG ATTTGTCAAG GCCTTCTCCA CATCTGCCCC TAAGTAGTGG GCTTTTTTGA 2280 

TTTTAGCCTG ACGAATGGCA GCTGTTAAAT GTGGCAATTT ACCCAAACGG TCCAAGACTT 2340 

CCTCATCTGC TGCCACCAAG GCATCGTCAA AGAAGGTCAA GGCTACGCTG GCATCTGTTT 2400 

CAAATTCCAT CCCAGCTTGG GCAATATTGG CAAATTCGTC ATTGCTATAG TCCGTCGTCT 2460 

GAGGCATAAA ACCATAGTTG CCAATATGGC TCATCTGAAT GTAGATCTGT TCCAATTCCG 2520 

CAAAGGCCTT CTCGAAATCC TCAAAAGTGT GAAGATTGCC CTTGTAATCA CGGCTAAACT 2580 

GGTTGATGTC TTCGCGAGCT TTCTCGATTG CACGCAAGAA ATCCTCACGG TCTTGGTATA 2640 

GGGCTGTTAA GTCCCAGAGT TCCTTCTCTG GAAATTCTGA ACGGTGTTTT TGTTCCATTT 2700 

TCTTCCTCTT ATTTCTCTAA TTCTACTAAA ACACTAAGGG CTGATAAAGC GTAAAGCGGT 2760 

GCTGTTTCTG CTCGCAAAAT ACGAGGACCT AGGCCTGCCA AAACGGCTCC TTTAGCTTCA 2820 

AAACTTTCGA TTTCTGCAGG TGAGAGACCG CCTTCTGGAC CAAAGATAAA GAGCAGTTTG 2880 

GCTCCTGTTT CAAGACCAGT GACTGCTTGC AGAAGCGCAG CGGCTTCTCC TTCTTTAGCT 2940 

GATTCTTCAT AGGCTACTAT GATAGAGTCA AACTGGTCCA GCTGAGCTAG AAAATCTGCT 3000 

TTTTTCTCGA AAAGTTTAAT ACTTGGTACA ATATTACGCT TGCTTTGCTC GGCTGCTCCA 3060 

AGGGCAATTT TTTCTAGTTT TTCAACTTTT TTACCCAATT TCTTGCCATC CCACTTGGCA 3120 

ACTGACCAGT CTGCAGGAAA GGCCCAGATT TGGCTAGCCC CCAGTTCGGT TACTTTTTGA 3180 

GCGATGAACT CCAGCTTGTC TCCCTTGGGA AATCCAGATG CGATGGTCAC TTGGACTGGT 3240 



WO 98/18931 



PCT/US97/19588 



892 

AGTTCCACAT TGTCATTTAA TTCTTGGACC AACTCAAACT GACGATTTTC CATATCCAGC 3300 

ACGCGCGCCA AGCGCTTGAT GCCATCATCA AAGACTAAGG TAACCTCATC CTCTTCTTTC 3360 

AAGCGCATAA CCTGAAACAT ATGCTTACTG GTTTCCTTGT CCTCGATAGT GACAGGAGAG 3420 

ATAGCACTGC CTTTTACAAA ATACTGCTGC ATGCTAGCCT CCAATCACAC CAGAGATATC 3480 

CTTGGTTTTC TTAAAGACAC AGGTATTCCA TTCCCCTTGA ACCATGTGAG TTTCGAGGAA 3540 

AAATCCAGCT GACTCAGCCG ACTGGCGCAC CATGTCCAAC TTGTCCTTGA TAATGCCACT 3600 

CATGATCAGG TAGCCTTCAT CCTTTACCAA GCGATAAGCA TCGTCTATTA GATGAATGAG 3660 

GATATCCGCC AAGATATTAG CCACAATCAC ATCTGCCTCA ATTTCCACAC CCTTAAGCAA 3720 

ATCTCCAGCC GCTACATGGA TATTTTCCAT GCCAGGGTTG AGCTCAATAT TTTCCTGAGC 3780 

CACACGAACC GCCACATCAT CCAGGTCATA GGCGAAAATT TCTTTAGCCC CCAGAAGCGA 3840 

GCTGGCAATA GAGAGAACCC CTGAACCAGT CCCCACATCT AGCACCGTTT CGCCACCACG 3900 

AAGAACCTGT TCCAAGGCAA AAAGGCTCAT CTTGGTAGTT GGGTGGGTTC CAGTACCAAA 3960 

AGCCATGCCA GGATCCAGCT TGATAATCAT TTCCCCCGCA GTCGCCTCAT AGTCTGTCCA 4020 

AGAGGGAACG ATGGTCAAAT CATGAGTGAT ACGAGCAGGT TCATAGTATT TCTTCCAGTT 4080 

GTCTGCCCAG TCTTCCTCAG CCAAGGCAGT CGTACCTATT TTTAACTCTC CCAAATCCAT 4140 

AAAATCTGTC AATTCTGCTA GACGAGCCTG CAAATCCGCC TCAACCACTG TCACATCCAC 4200 

CGTGTCAGGG TAGTAGGCTG TCACTACGAT TTCTTCTTGC TGCTCCACCT CTGGGAAAAT 4260 

CTCTCCAAAG CGGTCCACAT TTCCCACATA GTCCATACTG TCTTCGATTG CGACTCCTTG 4320 

CGCTCCCAGC TCAATCAAGA GATTGGAAAC CAACTCCTCT CCCTCACGCT TCACTGTAAC 4380 

TTTTAACTCT TGCCATGTTT CCATTATTAA TACCAAGCCC GTAAAACACA AAACCAAAAT 4440 

AGGAAATTCT CTGAAGACGC TTGTGTCTAA GAGAAGTTTA TCTTTTTGGC ACAGTGTTTA 4500 

GGGCGGGTTC AGTTTAGAAA TGTAACTGAA CCATCCTTTC TAATCACTTA CTTTTAAATA 4560 

ATCTTTTAAT CTCTCTTGCA ACTGAGGCAC AACTTGACTG GAACTAAGAA ATTCCTCAAC 4620 

ATTCATCAGC TGATAGCCCT GTCCTTCATC TCCGAAGATG ATATTGTCAA ATTGTTCTTG 4680 

TCTTAGCTGA CCAACCATAA AGACCGATTT CTTGCCTTTA AAAATTACGC TAGGATAAAT 4740 

CTTGCTCCAA AGCAGACAGT CTTCATCTAA ATGAATTCCC AGTTCCTCAT AAACTTCACG 4800 

CCGAGCGCAT TCAAAAGGGC TTTCGTCCCC TTCACGGCCA CCACCTGGCA GTTCCCACAT 4860 

ATTGGCCCAG GGAATACTTG CCTTATCATC GCGTAAGATA GTCAAAAGCT TATCCCCACA 4920 

AAACAAAGCA ATCTTGCAAC CTGTGAAATC AGAAATTTCT AGTTCCATCT TCAGTTCCTT 4980 

CTAACATTTC CTTTTCCAGC TCGGCTAACC AGTTTTCATA ATATCTTTTC TCATCCCTCA 5040 
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ACATTCGACT ACTATCCATT TTCTGTCTAG CAATCTTGAG AGCCTTACGA GTTCGATCTA 5100 

CATCTTTCTT CACCTTTAAT TGATACCAGG CTTGTATCAC TTGAAGATTG GACAGTTTGA 5160 

GAGACAGAAA CGATTTGACC TGTCGAATAC TAGCATATTG CTCCGCTTGC TCAAAATCTC 5220 

CTTCCAACAA GGCGATATGA AGCAGGGATA GTTGGGCAAC TGTCTGCATC ATCGGAGTAG 5280 

TTGTCCTCTC AAGTAATGCT TGAAACTGCT GTTTAGCTAC TTCTTCCTTC CCTTCCAAAA 5340 

-TGGAAACTTC ACCTTGCATA CCTAATACAC CATCCGCAAA ACTCCCTCGT GCATCCTCAG 5400 

GAACTGCTTG AACAAAGTCT TTCAAATCAT ATTCTTGAGG AGCTAGCAAG GTCTGGGCAG 5460 

AATGTCTCAA TACCAGGTAG GCGTATTTGG TATTTTCAGG GTGTTGTAGT AATTCCCAAA 5520 

TTTTTGCTCC ATCGGTGATG TCGACTGGCA AAATGTTATT TAGGAAGAAA GATAAATTAA 5580 

GAAAAATCCA AGTCCCTGCA AAATACCAGC TTCTTGTCAA AAATCCAAAC AATATCGCCA 5640 

ATAATATCAA GCCGAGATGA ACCATCAAGC CTCCTGAAAG CATCAGGATG ATTCTTTGAT 5700 

CGCTTTCATC CTCTTTTAAA CCAATGTATT GAGCACCAAC ATTTTTCAGA ATGGCTGTTC 5760 

TACTAAGATG AAACCTGCCT GACTTTTTGG TCAAAATAAA ATGTCCTAAT CCAAAAGCCA 5820 

CCAGCCGATA GCCTGTCAAG TAGCCACAAA AAGCATGACC CAGCTCATGA AGAATAAAGA 5880 

TTAAATACAT GCTTAGAAGA GCGAAGGCAT AACCAAAAGT AAAGGCTAAA ACTGCGGAAT 5940 

ACCCCAACTC TGCAAATGCG ATTGTTCCAC AAGCAAAAGC TAGCATAATA AAGACAACAG 6000 

CTAGCACATA AACCAAATAA GTCCCAATTT TCTTCATAAC ACCTCCAACC AACTCCTAGT 6060 

ATCTTGGATA AGGATAAAAT TCTCCCTTTT CCAAGCCAAT TTTTCCTTCT TCAAAGACTT 6120 

CTTGGTTCCA TTCCATGACA AATTCCTCTG CTTCTGGGTC TTCCAAAAAG TCCATGAGGA 6180 

CATCTAGCCC AACCTCAGCA GTATCTTTAA GGAAAAGCGC AAAATAAGCT AAAAATTCAC 6240 

GGGAAAATCC TTTTTTAGGC AGGTAAGGAA TAACAGTCAA ATAGTCTTCC TCATTGACTG 6300 

TTGACTTGGC AGGATTGTAG AAAAGGACCG CTTCCTCAAA AAGAATGTCA TCTGATGAAA 6360 

CCTCTCCGTC TTCATCCACC ATCTCCACAC CGCAGCATTT TGCGCTTCCA ATAGAAAACT 6420 

CACTTCTACC GCATGGTTGC GTTTGTCCCA GCTAATCTCA AAGTCAAAGG GAAAGTTCTT 6480 

GTCCAACTCT TCCTCTAAAA TATCTAAAAA TCCGTATGTT GCCATTTTGT CCTCTTTCTA 6540 

TGCGACTCTT TAATCGCCCC GATTGCTCGG AAATATGCTA AAATAGATAC TACCATCTTA 6600 

CCACAAAATT ATTTTATGTC CTAATTATAC CATATTACCT CATTTAAACC CTTGGTATCA 6660 

GTGATTTTCT TAAAAGTCTG ATTTCTTCAT TTCTCATAAA AATCAATATA AAAAGCCCTC 6720 

GAAAGGGCTA ATAAATCTAT AAAATCAATA GGCGAGTAAC TAGCACAAGT GGACGTGCTT 6780 
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TTTTATTGAC TATTACCACG ATACCACGCT TAATCTTAGG CTTGAACTTT CTTATCTGCA 6840 

ATAGCGTCTG TCAAAGTCTG AGAAAAGTTA AGCCCCATTT CTCGTCCCAA CTTATCTGCC 6900 

CATTTTGGTA TGGTCAAAGT CTTTTTAATG GGTTCCTGAC TTCCTAGGTA TTCTGATACA 6960 

TCAACAGATA CCATAGAAAT AAAAGATTTA TCAAGGTCAT AGGTTGACAC GAAATCTTCA 7020 

TCATCTTTAA AAGGATCATT ATCAATTAAA GACAAGCTAT TGATATCTGA TGGCTGAGGT 7080 

AACTCTCCAT CACTCTCTAT CAAATCTGCA ACAGTTATCC CTAGCCACTC CGACCCCATA 7140 

GCCAAAGCCT CAGAAATCCC CTCTCCTTGT GTAGCTGAGT ATTCAAAATC TGGGAAATGG 7200 

ACAAAATAAG TCGCTTCTGT TCCGTCTGTG TCGTCATAAT AAAATAAAGC TGGATACGTA 7260 

ACTAACATTT CACTACCTCC ATATCAAAAA GCAGGGACTG AATTTTACAA CCCAGCTTGC 7320 

TTTCTTATCC CTCTTTCAGT GTACTTATTC AGCTCACCAT GAAGGATTGT GATAGGTCTT 7380 

TCCCCTTGCT TTTCCATTTT AATATGGGAG CCTTTACCX3C CTCTAGTCTT TATCCAACCA 7440 

TGGGCCGTAA GGAGTTTAAC CATCTCTTTT TGTGTCATAG GCATAGCGCT TTTACCTCCT 7500 

GACAACACCA TTATAACACG TGTTACACGT ATTGTAAAGG AGTGATACTT ATTATTCTAT 7560 

TATACATAAA AGCCCCTAGA TGTGGTTCTA AGGGAAGCCA ATTTATTCAT ACCTATTTTT 7620 

CTAATGAGTA GTAAAAACTG CTTCTTTATC GAGCAATTCA TCATCTGTAT AGTCAATTGT 7680 

AAAAGTATCT CGATCTAAGA CAGATTGAGG CGGAGTTGAA TGAATCATAG GAACACTGCG 7740 

TACTCTATAT TTTTTATCTC CAATTTTTAC AAACTGATAC TCTTCGAAAA TCAAATTCAA 7800 

ACCACGTCAA CGTCGCCTTA CCGTACTCAA GTACAGcCTG CGGCTAGTTT CCTAGTTTGC 7860 

TCTTTGATTT TCATTGAGTA TGATTAACTC TCAAGTCTTC GAAATCAGGA TTTTCAACAG 7920 

TTATTACAAG GAGGCGATTT ACTACTTCAA AAACATCAAT TATTCTATTT TTCATATTTT 7980 

TTCAACCCAT TATTAGAATG AACTTCTTGG TAAGCAAAAT CAAGTTTAGA TTTAATGTTT 8040 

TCGTACAAAT CTAAAATCTC TTTTGGAGTA TCTTCCCGGA AGAAAAGTTT TCTTTTCCCT 8100 

GAAATAACTT GATCACTAAG AATCCAATGA CGAATTTGTT TTGTAAAAAT CAAAATTTCC 8160 

TGACTTGGTA GTTCCATCAT TTCCATTGCT TATCACCTCT CTTTTCATTA TAGTTCATAC 8220 

AATGACATTC AGCAATATTA TTTCTCAAGT CAGCACTTCC ACTTCTTTAG GCTCAACTAT 8280 

CCTATTTTGA GCTTTAAGGA AAATCAAATC TCTCATGCTG ATACCTCTCC TCATTAAATT 8340 

AAATAGTAAA AAAGATTCTA TCTCACTCCC TGATTATTAC AAAACCATTG AAATATCACA 8400 

ACTAATAGGC TAGAATGGAC ATAGTAAGAT ATAGTAGATG AGTCATTCTA CTCAAATCCA 8460 

CGTTAGAAAG GACTGCTATG CCAGACAATC TCGCCGTTCG CATGCGCCCn GG 8512 
(2) INFORMATION FOR SEQ ID NO: 130: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 2869 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS : double 
(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 130: 

CTCGTTTCAA GGTTGAGTCT CTTGCAAATC TTGTTCGCGT TCTTCCTTTT GCCAAGGCAT 60 

CTCTCCCATG GTTGGTGCcA GCCATTGTTG GAATCTTGCT CTCATTGGTT CTACCAAACA 120 

AGCAAGAAAG CGATGTTTTT GAAATGGAAT AATCACTTAA ATCACTTTTG TAGCCAAGTC 180 

TACAGGAGTG ATTkTCTTTT TTTATCCGAT GATAAATGTG TTATAATAGG TAG CG AAAGA 240 

GGTGAAGAAA TGAATCAAAC AGTAGAATAT ATCAAAGAAC TGACAGCCAT TGCGtCGCCA 300 

ACAGGCTTTA CTCGTGAGAT TGCGGACTAT TTAGTCAAGA CTCTAGAAGG TTTTGGTTAC 360 

CAGCCGGTTC GCACATCCAA GGGCGGTGTC AATGTAACTA TTAAAGGTCA AAATGATGAG 420 

CAACATCGCT ATGTGACTGC CCATGTAGAT ACGCTTGGTG CTATTGTCCG TGCTGTCAAA 480 

CCAGACGGCC GTCTCAAAAT GGACCGTATC GGTGGCTTTC CTTGGAACAT GATTGAAGGA 540 

GAAAACTGTA CCATTCATGT GGCTAGCACA GGTGAAAAAG TATCAGGAAC CATCCTCATC 600 

CACCAAACTT CTTGCCATGT CTATAAGGAT GCAGGAACTG CAGAACGCAC GCAAGACAAT 660 

ATGGAAGTGC GTTTGGACGC CAAAGTAACT AGTGAAAAAG AAACTCGTGC TCTTGGCATT 720 

GAGGTCGGTG ATTTTATCAG TTTTGACCCA CGAACTGTCG TGACAGAGAC AGGTTTTATC 780 

AAGTCTCGCC ATTTGGATGA CAAGGTCAGT GCGGCGATTT TGCTCAATCT CCTTCGCATT 840 

TATAAGGAAG AGAAGATTGA ATTGCCCGTA ACAACTCATT TTGCTTTTTC AGTCTTTGAA 900 

GAAGTGGGAC ACGGTGCAAA CTCTAACATT CCTGCTCAGG TAGTAGAATA TCTGGCTGTG 960 

GATATGGGAG CCATGGGAGA TGACCAGCAA ACAGACGAAT ATACAGTGTC TATCTGTGTC 1020 

AAGGATGCTT CTGGACCTTA TCACTATGAC TTCCGTCAAC ATTTGGTGGC TTTGGCGAAA 1080 

GAGCAAGATA TTCCATTTAA GCTGGATATC TATCCATTTT ATGGTTCGGA CGCTTCAGCG 1140 

GCTATGTCTG CAGGGGCAGA AGTCAAACAC GCCCTTCTCG GTGCTGGTAT AGAGTCTAGC 1200 

CATTCCTATG AGCGTACCCA TATTGACTCG GTGATCGCAA CAGAACGAAT GGTCGATGCT 1260 

TATCTTAAGA GCACGTTGGT GGACTAATAT GTGCCTTATT TGTCAGAGAA TTGACCTCAT 1320 

CAAGAAGGAA GAAAATCCTT ACTTTGTCAA AGAGTTGGAA ACAGGCTATC TTGTGGTTGG 1380 

AGACCACCAG TATTTTGAAG GCTATAGTCT CTTTCTAGCC AAGGAGCATG TCAGCGAATT 1440 
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GCACCATTTG AAAAAGGAGA 


CAAGACTCCG 


TTTTCTAGAA 


GAAATGAGTT 


TAGTCCAAGA 


1 JUU 


GGCAGTTGCC AAGGCCTTTG 


CTGCTGAGAA 


AATGAATATC 


GAACTGCTAG 


GAAATGGCGA 


± JOU 


TGCTCATCTT CATTGGCATC 


TGTTTCCACG 


ACGGACAGGT 


GATATGAATG 


GTCATGGTCT 


1620 


CAAGGGTCGT GGACCAGTCT 


GGTGGGTTCC 


CTTTGAAGAA 


ATGACAGCAG 


AAACCTGCCA 


1680 


AGCAAAACCG GATGAGATTA AAAGATTAGT 


CAAACGTTTA 


TCGTCAGAAG 


TAGATAAACT 


1740 


ATTAGAAATA AAGGAGTAGA AATGAAGAAA AGATACCTAG TCTTGACAGC TTTGCTAGCC 


1800 


TTGAGTCTAG CAGCTTGTTC 


ACAAGAAAAA 


ACAAAAAATG 


AAGATGGAGA 


AACTAAGACA 


1860 


GAACAGACAG CCAAAGCTGA TGGAACAGTC 


GGTAGTAAGT 


CTCAAGGAGC 


TGCCCAGAAG 


1920 


AAAGCAGAAG TGGTCAATAA 


AGGTGATTAC 


TACAGCATTC 


AAGGGAAATA 


CGATGAAATC 


1980 


ATCGTAGCCA ACAAACACTA 


TCCATTGTCT 


AAAGACTATA 


ATCCAGGGGA 


AAATCCAACA 


2040 


GCCAAGGCAG AGTTGGTCAA 


ACTCATCAAA 


GCGATGCAAG 


AGGCAGGTTT 


CCCTATTAGT 


2100 


GATCATTACA GTGGTTTTAG 


AAGTTATGAA 


ACTCAGACCA 


AGCTCTATCA 


AGATTATGTC 


2160 


AACCAAGATG GAAAGGCAGC 


AGCTGACCGT 


TACTCTGCCC 


GTCCTGGCTA 


TAGCGAACAC 


2220 


CAGACAGGCT TGGCCTTTGA 


TGTGATTGGG 


ACTGATGGTG 


ATTTGGTGAC 


AGAAGAAAAA 


2280 


GCAGCCCAAT GGCTCTTGGA 


TCATGCAGCT 


GATTATGGCT 


TTGTTGTCCG 


TTATCTCAAA 


2340 


GGCAAGGAAA AGGAAACAGG 


CTATATGGCT 


GAAGAATGGC 


ACCTGCGTTA 


TGTAGGAAAA 


2400 


GAAGCTAAAG AAATTGCTGC 


AAGTGGTCTC 


AGTTTGGAAG 


AATACTATGG 


CTTTGAAGGC 


2460 


GGAGACTACG TCGATTAATA CTCTTCGAAA ATCTCTTCAA ACCACGTCAG 


CGTCGCCTTA 


2520 


CCTACTGACT GCGTCGGTTC 


TATTCACAAC 


CTCAAAACAG 


TGTTTTGAGT 


cGATTCGTCA 


2580 


GTTTTATCTG CAACCTCAAA 


GCTGTACTTT 


GAGCAstGCG 


GCTAGCTTCC 


TAGTTTGCTC 


2640 


TTTGATTTTC ATTGAGTACA AAAAGTAAAC 


TTTTCTCTTG 


CAATTCCAGA 


TAAATAGTGT 


2700 


ATAATGGATG GGTATGTGAA 


AAACATACTT 


GTGGGAGGTA AAAATCTCTA 


ATTACCGCCA 


2760 


AAACCACAAA GGAGGATTTA AAAATGGCTA AAAAAGTCGA AAAACTTGTA AAATTGCAAA 


2820 


TCCCTGCTGG TAAAGCTACA 


CCAGCTCCAC 


CGGTTGGACC 


TGCTCTTGG 




2869 


(2) INFORMATION FOR SEQ ID NO: 131: 









(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6186 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY : linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 131: 
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CTGAATCCCT 


TATAGGAGTC 


CAGTAACTTT 


TTAGCCTCTA 


CTTTGCCTTC 


ATAGGCAGCT 


60 


TCAACATCAT 


TAAAAAAAGA 


ArGCACTGAA 


GCAAGTTCTT 


CAGTGCTCCA 


CGACAAATCT 


120 


AGTGGGTAAC 


TATACTGTTT 


GTTCATTAAC 


TAATACCAGC 


TCTCATTCTT 


GCTTCTTTTA 


180 


GTTCTTGCTT 


ACGATAACTA 


CGAGGGAGAA 


AAGCACGAAT 


CTCATCTTCA 


TTAAAACCGA 


240 


TTTGCATACG 


CTTGGCATCA 


ATAATAATTG 


GACGACGCAA 


AAGACTAGGA 


TACTGCTCAA 


300 


-TCAAATGAAG 


CAATTCCGAT 


ACCGAAATAC 


TCTCTACATC 


AATATTCAAT 


TTTTGAAAAA 


360 


TTTTTGAACG 


AGTTGAAATG 


ATGTCATCAG 


TACCATTTTC 


GGTCAAGGAA 


AGGATGTGTT 


420 


GCAATTCTTT 


TCTTGTTAAA 


GGACTGGTCA 


TAATATTGTG 


TTCCACAAAG 


GGAACTTATG 


480 


TTTTTCTAAC 


CAGGCCTTAG 


CCTTACGACA 


TGATGTACAG 


CTCGGTGATA 


GAAATAGTGT 


540 


AATCATGCTT 


TTCTCTTCTT 


ATCTATACTT 


TGCTACTTCT 


ATTATACAAA 


AAAATAAAGC 


600 


GCTTGACTAG 


GGATTTTTAG 


AAAAAAAGCC 


TATTTTTTCA 


AGAAAAATAG 


GCTTTTTGCG 


660 


AACGATTGAC 


ACAATTGGAT 


TTGGTTAATT 


CACTCTTAAC 


GATGGTTTTA 


AACGATATAT 


720 


ATTTTTATAT 


ATGTAAATTA 


AAAACATCTT 


TCCTTTCACT 


TCCTACGACT 


TTTCAGATAC 


780 


AGATAGCCAA 


AGAAGTTTTC 


ATAGAGGGCA 


AAAAAGAGGA 


GGAAGGCATG 


AAGAAAGAAG 


840 


GTCTCTGGCA 


AAATCATAAT 


AACAGGATCC 


TTGGCTGGAT 


CAAAAAGCCA 


GGTATCATCT 


900 


CCCACAAAGA 


GAATTTGATG 


GAAAAGAGTA 


AAGAATTGGT 


CAAAACCAAT 


CAAAACTCCC 


960 


CCAAGTCCAA 


TCATCACAGG 


TAAGACTACT 


AGAGCCAGGA 


GACTTTTTCG 


ATAAAGAGAC 


1020 


AAAAAGTCCT 


TTTTCACAAT 


CCTATTGACA 


AAGACATAGA 


AACTTGGCAG 


TGTCACTAGA 


1080 


GCTACTAGCT 


GAACCAAATG 


AAAGAGATTC 


TTGACCACTG 


CGAAATGGTG 


CAGACCAGCT 


1140 


GCTGACGAAC 


GAAAATCAGG 


CATCTGTAAG 


ACCTGACTAA 


AAGGATTGGT 


CAGATAATTC 


1200 


ATCAAGATAT 


GAAAATTGTA 


TTGAATGGTT 


TCTGGTTTTA 


GATAGACTCG 


ATTCGTTAAG 


1260 


TTTAGCCACT 


GAATCTCCAT 


AGGATAGAAA 


ATCCAAGCCA 


GATAAATGGT 


CAGAAGGATG 


1320 


GAGAGGGAGA 


GGAGAAAGAG 


CATAGAGCCC 


CAAAAGATCA 


ATTTAGTTTT 


CATCAAAATC 


1380 


CCACTCCGCA 


AGGCTAGAAA 


CCACATGTGT 


CGGTGCGATT 


GGCAGGCCAG 


CTACTTCTTC 


1440 


TGCCTTAGTA AAACCTGTCG 


TCACCAAGAG 


CGTTGGAATG 


CCATTGTCAA 


TCCCAGCCCG 


1500 


AATATCAGTC 


AAATAATTGT 


CCCCAACCAT 


GATTAACTCT 


TCACGTTCCA 


AACCTAAGTG 


1560 


CTCAACCGCC 


TTGTCCATAA 


TGATGGCATT 


TGGTTTTCCG 


ATATAAACCG 


GCTTCACTCG 


1620 


TGTCGCTACT 


TCAAGCAGCG 


TAATCAGTGA 


GCCAGCACCT 


GGCAAAAGAC 


CGCGTTCCGT 


1680 


CGGGATGTTG 


AGGTCAGGAT 


TGGTTCCGAT 


AAAATGGGCA 


CCCTTTTGAA 


TAGCAAGAGT 


1740 
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TGCTGTGGCA AATTTTTCAT AGTCGACTTG CCAATCCAGA CCAACTACCA CGTAGGCAGG 1800 

TTTTTCCTTG TCTTCCACAT AACCAGCCGC CTTGATGGCT TCCTTGAGTC CTGCTTCTCC 1860 

GACGACATAG ACGGTCTTTT CAAGCCCCAA ATCATTCATA TAGTCGATGG TTGCCAAAGT 1920 

CGCTGTGTAG ACAGTCGATA GGGGCGTATC GATATTAAAA TTCTGAGCCA ACATCTCCTT 1980 

AACACTCTCT GGAGTGCGGG TTGTATTGTT GGTTACAAAG AGATAGGGAA TGTCCCGCTT 2040 

TTGCAATTCA TGAACAAAAG TCTCTCCAGC AGGGATTCGG TCTTTCCCCT TATAAATGGT 2100 

TCCGTCTAAA TCAATTAAAT AGCCTTTATA TTTCATCTAT TTCTCCCTAA GCCTTTTTTA 2160 

TTTCTTGCCA AGTAATGATT GCTTGGGCAT TGATAACCCC ATCACTTGTA ATTTCATGCT 2220 

TGCTTTCCAG TCCAGTCCGT TCAACAGCCG ATGTAATCAC CCCACCTGGT CGAACTTCCT 2280 

TGACATACTT GAGGTTGATT TTCTTGGGAA TATAGTGGGT CAAAAAATCC GCTCCCATGA 2340 

CCTCAAAAAT CCAGTCCAAG TATTTACTGT TATTGACATG ACCATTCATA TCCAAGTCGT 2400 

AAAAACGAAC ATGGTAATCC TTGCTGATCG GTTCTTCCAA GGACTCATAC TTCGGTCCAC 2460 

GGATAAGTTT TTTATCAAAA TCAGACTGGT AAGGAGCCAC AATCTCAGGT TCAACAACAT 2520 

GGACTTTTCG ACTGTCGCGG TCCATGAGAA CAAAGGTCGC CATCATGTGG ATGAGCTCCT 2580 

GCTCCGCTTC ATTATAAATA GTAAAGCGAC GGTAGCAAAA AAGTCGATTG TAGCTCAAGG 2640 

CTTCCGTTTC GATGGTAATT TCTTCCGCAA AACGAGGCAA ACGAACCACC TCAATATCAT 2700 

ATTCTACGAT AATCCAGACC AGATTATATT CTTCCAAAAT GGCCTTATCA CTAACTCCCA 2760 

GTTCAATCGA CTGCATCCCT GAAACTTGCA GTGACAGCAA AATCACATCT GGAAGTTTGA 2820 

TATGACCGTT CATATCAGCC ATATCAAAAG GAATTTTCAT TTTCATTTGA TAAGTTAAGC 2880 

CCATGATCCT ACTCCAAAAT AAATCGTTCT GCTACAGTAT CTCCCAAAAA GAGACCTCTC 2940 

TTTGTCATGC GAACGTGGTC ACCCTCAATC TGCATGAGGC CTTGTTGAAC CAAATCTCTG 3000 

ACAATTTCTC CATAAAGTCC AGCAAAAGAC TGTCCAAATT TTTCCTCAAA TCGCGCCATG 3060 

GAAACCCCGG ATTTCTTGCG GAGTCCCAAG AACATTTCTT CTTCCATTTG CTCCTTTTGA 3120 

CTCAGGTGAT CTTCTGTAAT ACAAGCATTG CCTTCCTCAA CCGCACTGAG ATAATGACGA 3180 

ATGGGACCAT GATTTTTATA GCGTACTCCA TTGACATAAC CAGATGCCCC TGCACCAATA 3240 

CCATAGTATT CAGCATTGTC CCAGTACATG AGATTATGAC GACTTTCAAA ACCGGGTTTG 3300 

GAGAAATTAG AAATCTCATA ATGCTCAAAA CCCGCTCGCT CCAGCTCTGC AATGATGTAC 3360 

TCAAACATCT CCGCTTCTAG TTCCTCCTTA GGCAGAGGCA ATTTCCCACG TCGCATCCGG 3420 

TTCATAAAGA CCGTATGGTT TTCTAAAATC AAACTATACA AACTCATGTG GGGAATATCC 3480 

AATCCAATGG CTTTAGCCAC ATTTTCCTTT ACTTGCTCCA TGGTCTGACC AGGCAGAGCA 3540 
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TAAATCAAAT CAATGGAGAT ATTGTCAAAA CCAGCCAGTT TCAGGCGATC GATATTTTCA 3600 

TAAATATCCT TCTCCAAATG ACTGCGCCCA ATCTTTTTCA ACATCTTATC ATCAAAGGTC 3660 

TGGACACCTA GCGAAACACG ATTGACAGCC GAATTTTTCA AAACAGCTAT CTTATCCGCA 3720 

TCCAAATCGC CTGGATTGGC TTCAATGGTC AACTCTTCCA AGACAGACAA ATCCAAGTTT 3780 

TTAGTCAAGC CATTCAGTAA CACCTCCAGT TGCGGAGCCG ACAGGGCTGT CGgTGTTCCA 3840 

CCACCGATAT AAAGGGTTGA CAACTTTTCA ATATCATAAG AACGAAACTC TTCCAGCAGA 3900 

TGCTCTAAAT AGCTGTCGAC TGGCTGATTT TTGATGAAGA CCTTTGAAAA ATCACAATAA 3960 

TAACAAATCT GGGTACAAAA TGGGATGTGC ACATAGGCTG ACGTTGGTTT TTTCTGCATA 4020 

GTAATTATTA TACCACAAAG ACTAGATTCC AGATAAAAAT CACCATCCCC AGATACATAG 4080 

TCCGTCCGGA GATGGTGATG GTTTATTCTT CTGTTATATC AATCACAATC TCTTCTGAGT 4140 

CATCAAGAGC TTCGGCTTTT TCTTGCCATT GCTCCTTGAG ATTATTTAAT TGATTTTTTG 4200 

ATGCTTCTGT CGCTTGAAAA GCATAGGATT TAGTTTGAGC AAGTATACTG TCCACAGTGA 4260 

TTTCACCTGA CTCAACCTGT TCTTTTGTTT TCAGAACAAA ATCTGTAGCC TGCTCCTTAA 4320 

CTTCTGTCAG TTTTTCACAG ACTTGCTCCT TGGCATACTC CGGATCTTCT CTCAAATCAT 4380 

CTAGAAAATC TTGAGCCTGA CTGCAAACTT GTTTGCCCTT ATCACTTGTT AAAAACAAGG 4440 

CAAGAGCTGC ACCTGAAACG GTTCCTAAAA GGATTGAGGA TAATTTACCC ATAAGGATTC 4500 

TCCTTTTTTA TTTTTTGAAA AATTTACTTG CAAGACGAAG AGCTGACAGA CTTGCACCAG 4560 

TCTTGAGTGT TTTTGAACCA GCTGATGAAG CTTTCTTGCT CAAGACACGC GCATGGTCAT 4620 

TGAGGTCTGA AACAGATAGA GATAAATCTG CAACAGCACT GAAGAGTGGA TCAATCGTAG 4680 

CCACCTTGAC ATTGATATCA TCTGCCAAGA CATTGACCTT AGCCAACAAC TCATTGGTGT 4740 

GATGCAAGGT CACATCCACA TCTGAAGTCA AGGTTTTAAT CGTCTTTTCT GTTTCATCGA 4800 

TGACACGACC AAGCTTTTGT ACAGTAATGA TCAGATAGAC CAAAAAGACA ATCAAAGCTA 4860 

GGGCAACAAG AATATATGCA ACTTCTAACA TTTAGTTTTC CTCCTCTGTA ATATAGTAAG 4920 

GGGCCTTCTT TCGATTTTGA TAAATAACGA TCATTATACC GAGACCGATA AGGACAACTG 4980 

ACAGCCATTG GGACACTCGA AAGCCGAAGA ACATGAGACT ATCTGTTCGC ATACCTTCGA 5040 

TAACCATACG ACCGAAACCA TACCAAATCA AGTAAAAGGC CGTGATATGA CCTCGTCTGA 5100 

GACTCTTCCA TTTCCGTCTA AAAATCAGAA TCAAGGCAAA GCCAAGCAGA TTCCATAGAG 5160 

ACTCATAAAG GAAAGTCGGT TGACGGTAGC TCCCCTCAAT ATACATCTGG TCACGGATAA 5220 

AGCCAGGTAG ATAATCCAGA TTATCCACTG TTGCACCATA AGCTTCTTGG TTAAAGAAAT 5280 
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TACCCCAACG CCCCAAACTT TGAGCAATCA TAACGCTAGG CGCCGCAATA TCTAGAAAAT 5340 

CCCAAGTATT GATGAGTTTA CGGTCAGCAA AGATATAGAG CACAAGAGCC CCAGTTATCA 5400 

AACCACCGTA AATGGCCAAA CCACCATTCC AAATGGCAAA AATCTCTCCT AAATTCTGAC 5460 

TATAGTAATC AAATCGGAAA ATAACATAGT AGAGACGAGC TCCTAAAATA GCCAAGGGAA 5520 

AGGCTACTAA GATAAAATCT AAAATATCGT CTGGTATGAT CTTCTTTCTA GGTGCTTCTT 5580 

TCATGGTCAA ATAAACCGCA AGAATCAAGC CTGTCACAAT ACATAAGGCA TACCAACGAA 5640 

TGGCTAGGGG TCCTAGTTGA ATAGCAATTG GATCAAGCAT TTTGCACCTC ATTTCGAGCG 5700 

ATTAGACTTG TCAGTCGTTC GTCGAACAAA CGGGTCGCAT CAAAGCCCAT TTCCTTGGCA 5760 

CGATAATTCA TGGCAGCTGC CTCAATCACA ACAGAGATAT TACGACCTGT TTTAACTGGA 5820 

ATACGAATAC GAGGAATGtA CGCCAGAAAC TTCAAGTTCC TCTGCATTAT TTCCAAGACG 5880 

ATCAAAGGTC TTATGCGTAT CGTAATTTTC CAAATAGACA GCAAGCTGAA CCTGTGAAGA 5940 

ATCCTTGACA GCACTCGCAC CGTAGAGACT CATAACATCG ATAATACCAA CCCCACGAAT 6000 

TTCAATCAAG TGTTTCAAAA TTTCAGCTGG TTCACCCCAG AGAGTAATCT CATCCTTGGC 6060 

AAAGATATCG ACACGGTCAT CGGCTACCAA ACGGTGACCA CGTTTGACAA GCTCAAGACC 6120 

TGTCTCGCTC TTACCAATTC CACTATCTCC CTGAATCAAG ACGCCCATCC CATAAATATC 6180 

CATCAA 6186 

(2) INFORMATION FOR SEQ ID NO: 132: 

(i) SEQUENCE CHARACTERISTICS: 

<A) LENGTH: 9541 base pairs 
(B) TYPE: nucleic acid 
<C) STRANDEDNESS : double 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 132: 

GAAAATCACA ACCCTTTTTG CAAAATTTTT GAGATTATTT TCACAAACTT GATTTTTCAA 60 

AGTATACTCA ATAAAAATTA AAAAAATCCA CTACGTCAAG GCGAGGCTAA TGTGGTTTGA 120 

AGAAATTTTC GAAGAGCGTG AATGAGTATC ATCTATAGTA AAATAAAAAA ACTGAACAAT 180 

TTGGTTGGGG ACAGCCAAAC CAATTTCTCA CAATGTTTCA GAAACAAGGG TGTGCTATTC 240 

CAATTTCAGC CTACTATAAC TGTCATAGAT TGCTGAAACA AAGTCTAGGT AAAAGTCTTC 300 

ATAATAAAAA GACCTCCTAT CAAGTGTTCA AAAACTTTGA TAGGAGGTCT TGTTTTGTGA 360 

AAATATTTAT CAAATTTTCT ATACAAGTGA GCTGTTAGCC AGGTTCTTTC TATTCTTTCA 420 

ATTTCAATGA ATGGATTTTT TACTAATACT CATAACTGGG AATTTGTCTG TGTAAAAATA 480 
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GCGAGATAGA 


TGGTATTTAT 


AAAACACTCA 


AGACAGCTAG 


ACTAATATCA 


TTTAAAACAT 


540 


TATCTTCTTT 


TGAGCGACTG 


TTGGTTACCA 


ACATAGCTAA 


ATTTCCTGCA 


TTTTCAAATT 


600 


GATAGGGTTC 


TGATTTAGCA 


TTCACAACCA 


CCAAGAGGTG 


TTCTTTGCCG 


TGAACTTCAT 


660 


AGATAAGGTA 


GCCGCTATGT 


TCAATCGCAG 


AATGCACAAA 


GACATGATGG 


TAAATTTCAT 


720 


CATAGCTAGA 


GTAAGAAAAG 


GCACCAGTTT 


TTGTCTTCAA 


TCGGATGACT 


TGACGGATAA 


780 


ACTCAATACT 


GTCTTGACGC 


TCATTAATCA 


AGTTCCAGTT 


CACTTGGTTC 


ACACTGTCAG 


840 


GAGCATTATA 


GCTATTCATC 


GCACGCTCTC 


TATCATCATG 


GGTCAACTCA 


CCATTTTCAC 


900 


CAGTCGCAAC 


CAGTTTGGTA 


CGACCAAATT 


CTTGACCGAT 


TTCCATAAAG 


GCCATCCCCT 


960 


GCATGAGCAG 


ATTCATGGCT 


GTGGCAGTTT 


CGACCTTGCG 


CATGATTTGC 


TCTGAACTTT 


1020 


GGTCTGGATG 


AAGGGTTGCC 


AATAAATCGT 


GAAGATTGTA 


ATTGTCATGG 


GCTTCTACAT 


1080 


AGTTAAGCAC 


CTGATTTGGA 


TGTGTATAGC 


TTCCTAATTC 


ACGACTTCCT 


AGGATTG CTT 


1140 


TAGCTAGAAT 


TGGCTCTGTC 


GCAGCACCAC 


TGACAAAACC 


TGACTTGATA 


G C ACCAT AAA 


1200 


CTTCTCCCCC 


TTTGACAGCA 


TCGCGCTGAT 


TGTCATTAAA 


GAAACCAATA 


TTTGGCATCT 


1260 


GGTAGGCATT 


GTCCTTCTTG 


GCCTTATCAT 


AAGGGGCAAG 


ACCTGTTCCC 


ATATCCCATC 


1320 


CTTCTCCATA 


GAGGATAATG 


TTGGAGTCGA 


TTTCATCCAA 


GCTTTGACGA 


ATCATCTGCA 


1380 


TGGTCTTGAC 


ATCATGAATC 


CCCATCAAGT 


CAAAACGGAA 


GCCGTCAATA 


TTATATTCCT 


1440 


GCACCCAGTA 


TAGAAGAGAA 


TCAATCATAT 


ACTTGCGAAA 


CATTTCGTGT 


TCACTGGCTG 


1500 


TTTCATTTCC 


AACACCCGTT 


CCATTCTGGA AGGTACCATC 


TGGATTCATA 


CGATAATAGT 


1560 


AATCAGGGAC 


TGTTGTTTGG 


AATGGTGCAT 


CAACAACTGA 


GAAGGTATGG 


TTATAGACTA 


1620 


CATCCATAAT 


GACTCCAATA 


CCCGCATCGT 


GATAAGCTTG 


AACCATCACC 


TTCAAATCAC 


1680 


GAATGACCTG 


AGCTGGATCA 


TCTGGATTAG 


TTGAAAAACT 


AGTTTCTGGC 


GCGTTATAGT 


1740 


TTTGTGGATC 


ATAACCCCAG 


TTGTAGGTTA 


CATTTCCATC 


CTCATCGTAT 


TCTTTATGAC 


1800 


GGTCTGCAAT 


TGGTTGCAAT 


TGAACATAAT 


TGTAGCCCAG 


CTTCTTGATG 


TAATCAAAAG 


1860 


CAGTTGACTG 


GCCGTATTGG 


TTAACTGTTC 


CAGCCTGAGC 


AGCACCCAAG 


AAAGTTCCTC 


1920 


GAAGATGTTC 


ATCTACACCC 


GATGTAGGTG 


ATTTAGTCAA 


ATCACGAATG 


TGCATTTCAC 


1980 


AGATAACTGC 


CTTACATGGA 


TTTTCCAAGC 


GCCAAGTAGC 


CTCCGAACCG 


TGCTTAACCT 


2040 


CGAAGTTTTC 


AACTTGCTTT 


TCTACATGGC 


TCAGAATAGC 


TGAACGTTTG 


CCATCAGGGC 


2100 


TGGTCGCGAT 


TGTATAAGGA 


TCACGTGTCA GTGTTTGGTG 


ATGAGGGAAT 


TGGACTTGAT 


2160 


ACTGATAAGT 


CTTACCTACC 


AAATCTTCTT 


CAACATCCAA 


ACTCCAGACA 


CCGATTGTAT 


2220 
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TGTCCTTATG ATTATAAGAG TAGCTATTGC CTCTTTTCAT CTCAAAAGTC TTCCAAACGG 2280 

GTG CATC ATT AGCAGCTGAT TCATAAACGA CAACTTGCAC TTCTGTCGCT GTAGGTGACC 2340 

AGAGAGAAAA ATGAGCCTGA TTGTCCTCTA CACGGCAACC CAATTCTCCT TGGTAACCCC 2400 

AATGATGATC AAAACTAGCA CTGTTAATGG CCTTATCAAA GGCAAAAGGA TTTTGATTTT 2460 

TATAGAAAGG ACTGGCAATA GCAGGATTTT CAGAGTAATA AATCCTATCA TCGCCTTCCA 2520 

AAATCCAGAC CTCTGTTAAT AGGGGATAGT GATTAAAACG GATAGAATAT TCTTTACTAG 2580 

TTTGACCTGT ATGAACCACA AAATTCAAGC TTTCTATAAC ATGTGAACTT GGGTGTTCAA 2640 

AGCTAAATAA AGCTCCAAAA TAATCTTCTT TGTAGGTTAG CAAATCAATT CGTTGATCCT 2700 

GACTTTTTAC AAAGGAGCAA GTGTCATATT CTCCATTCTT ACGATGGTAA TGAATGCGCA 2760 

TAGGGTAGTT ATACATTTTT TATTTTTCCT TTTTACTTTG TTTCTATTTC ACTAATAAAT 2820 

TTTTGTCAAT CTCGTCTCAA TTAACAGACA TAGTCATATT CTCTAAACTC TGTTTTTAAA 2880 

CGATCCATTA CAAACTTTCT AGCCATGCCT CATCTCTGAC CTGGATACCA AGTTCTTGTG 2940 

CTTTTTGCAG TTTACTTCCA GCGTCTGCAC CTACCACGAC GAGGTCGGTC TTTTTAGAAA 3000 

TACTACCTGT CACTTTGGCA CCCAGACTTT CGAGTTTACT TTTAGCTTCT GAGCGCTTGA 3060 

GTCGTTCCAA TTTTCCTGTC AATACCACGG TCAAACCTGA CAAGGCCGCA TCCGCTACTA 3120 

CCGTCTGTCC TTTATAGTCC AGATTGACCC CAGTTTCTTT CAATTCTCTG AGCAGAATTT 3180 

CAGAGCCTTC TGTCGCAAAA TAAGTCTGAA GACTTTTGGC AATCACGCCA CCTAGACTTT 3240 

CAATACTAGC CACTTCCTCT GAATCTGCCT GAGACAGATT TTCAATTGAA TGGAAATATT 3300 

GAAGTAAAAG CTGACTAACC TTGCTTCCGA CATGACGAAT TCCCAAACCA AATAAGAGCT 3360 

TCTCGGCAGA ATTTTCCTTT GATGCTTGGA TAGCCTGATA CAGTTTAGCA GCGGACTTTT 3420 

CCTTAACTCC CTCTAAAAGG AGGAAATCCT CTTCTTGCAA ACGATAAATA TCCGCCACAT 3480 

CCTTGACTAA ATTAGCAGCA AAAAGCTTCT CAACAATAGA TGGACCAAGG CCTGTAATAT 3540 

TCATAGCATC ACGAGAAGCA AAGTGAATCA AGCCTTCCAT GATTTGAGCA GGGCAACGCG 3600 

GATTGATACA ACGTAGGGCC ACTTCATCTT CAAAGTGCAA CAAGTCAGAG TTACAACTTG 3660 

GACAGTTTGT AGGGATATCT AGTTTTTCTT CAGAAACCCG TTTGGACTCT ACCACACGTA 3720 

AAACGGCAGG GATGATGTCA CCAGCCTTAT ATACAATGAC CGTATCGTCT TTTCGGATAT 3780 

CTTTTTCAGC AATATAATCT ACATTGTGCA GGGTCGCACG GCTAACAGTC GTACCGGCAA 3840 

GTTGTACTGG TGTTAGATTA GCAGTTGGAG TTACAACACC GGTACGGCCA ACTGTCCAGT 3900 

CAACTGATAA GAGTTGAGCT TCTTTTTCTT CGGCAGGGAA CTTGTAGGCT ACTGCCCACT 3960 

TTGGAGCCTT AACTGTAAAA CCAAGTTCTT CTTGACTTGC TAGGTCGTTG ACCTTGATTA 4020 
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CCACTCCATC AATATCGTAA GGCAGATTTT CCCGTTCCTG TCCTACTTCT TGGATAAAAT 4080 

TCCAGATTTC ATCTATGTTT TCAGCCAAGA TTCGCTTAGG ATTGACCACA AAACCTAGTT 4140 

GTTCTAGGTA CTTCAAACCC TTTTCTTGGC TATCACGAGT TGAAGGGCTG GCTTCTTGAT 4200 

AGAGAAACGT TGCAAGATTA CGCTTGGCAA CTACTGCTGT ATCCAACTGA CGCAGAGTTC 4260 

CTGCTGCCGC ATTACGAGGA TTAGCAAATT CAGGCTCTCC ATTTTCTTGG CGCGCTTGGT 4320 

- TAACTTGGTC AAAGGAAGCG CGTGGCATGT AACATTCCCC ACGAACTGTG ATATCTAGTT 4380 

CTTCTGGCAA AGTCAAAGGG ATGTCCTTAA CACGCTTGAG GTTTTCTGTG ATATTTTCAC 4440 

CAATTGAACC ATCTCCACGT GTTACCCCAG CAACCAAAAT CCCCTTTTCA TAAGTCAGCG 4500 

AGATAGATAA GCCATCGATT TTCAGCTCAC AAATATAGGT CGGATGAGCC ACTTCCTTAC 4560 

GAACACGCGC ATCAAAAGCA TCTAGCTCCT CACATGAAAA AGCATCCTGC AAA CT AT AAA 4620 

GAGGATACTG ATGACTGTAT TTTTCAAAAC CATCTAAAAC CTTGCCACCA ACACGATGAG 4680 

TCGGACTGTC TGCTAGCACT TGCTCTGGAT AAGCAGTTTC TAACTCGACC AACTCACGGT 4740 

AAAGGCGGTC ATACTCACTG TCTGAAACCG AGGGATTATC GCTGGTATAG TACTCAGTCG 4800 

CATAGCGATT GAGCAAAGCG ACTAACTCAT TCATTCTTTT ATTCATAAGA CCATTTTACC 4860 

ATAAAACAAG CCCTCCTCAC AAACGAGAAG GGCGGAAAAA ACACTTAGTT TGAAATTATT 4920 

TTTGAAACTC AAGCAACCTT ATATCAATTT TTCAAAATGA GTTCGAACAT ATCCGAGAGC 4980 

TAAGAAATAT AAGGCTACAA CTCCAAGTCC AATAATCAAG AAAGAATAAA GATGGACACT 5040 

TGGCAAGACT GTCATAAATC CTTTTGCAAT AGGCATAAAT AGAATAGCTA AGGTAAAAAT 5100 

TGTACTCAGT ACTCTTCCAA GAAATTCGCT CTCAACCTTG GTTTGTACTT GAGTAAAAAA 5160 

GTGAATATTA AAAATCGTCA TAAACAATTC ACAAACTAAA TTTCCAGAAA AGGAAAGAAA 5220 

AGTTGGAAGT GGTAATCCCA TCATAAAAAC TCCGACACCT GTCAAAGCCA GTAAAATCAA 5280 

AAGATTATAA ATATTAGCTT TAATTTTACT AGCTAGAAGA GCCCCAATGA TGGAACCAAT 5340 

AGCCCCCATA GTTAAAATAC TTGCATAGGC TCCTTCTGAC CCGTAAAGCT GATTCGAAAA 5400 

GGGAAGTAGA AATTCAAAAG CTGCAAAAAA GAAATTAACG CTGGAAGCTA CCAGCAAAAG 5460 

GAAGAAAATT TCTTGCTGAT GCCAGATATA GTGTAACCCA TCCTTGATAT CTACAAAAAT 5520 

ATCTCTCCCA GTAAAAGCCT TTTTCTCTTG AACTTTTGCT TCCTCTTTTG GAAGGAAAGC 5580 

CACTAGAACA AAAGCAATGA AAAAAGTCAG CGAGTCTAGC AGTAGCGTCA T AT G GAG ACT 5640 

TGCAAACTGT AAAACAAGGA AGGAAAGAAC AGGAGAGCTA ACACCTACAA CCTGCAAAAC 5700 

CAGCTCTAAG CGAGAATTAT AGATCACAAT CTCATCTTTC TCCACCACTT CAGTTATGAT 5760 
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AGCTTTATTG 


GCTGTGPGAG 


AAAAGGCAAA 


AfiPAATAftPP 




fappaafa af 
I AbLAALAA I 




CAAAGCGCCA 


ATCATCCAGC 


TATPATTPPT 
mi x x v« v* x 


TATnaaar^aa 

1 A 1 unnnu AA 


fiTapppafap 
A x AuLLAuAL 


a a ar*a ritpp 
AAAGAA1 \_V_L 




ACAAACAAGA 




1 1 nnnn IV. X 1 


ipp a pp ipiR 

AI_VjAL.GAGAA 


AAACGG 1 1_ I\j 


ft ft ft f ft ft f°*TT\j~* r°* 

AAA I AACTCC 


5940 


GCCAAAGGGA 


TTG APG AG A A 


T Af2 AT^IYI a P 


n ipptp ap & a 


7\ ff/"»ff/> ft f ft O ft 

A 1 L. 1 vjA I At_ A 


Fn/rr'pfiix ft ft ft ^> 

I iCCTaaaAC 


6000 


TGTCTGTCCT 


ATACPTPfr A 




a /"* a a rr* R 
t- L. AG AQ_ Alt A 


mnwnoo ft m ft ft m 

TTTCCATAAT 


CATAGAGCAT 


6060 




1 lrtl iuAlAb 


rrrr 1 a cr^r* r*T 
UH-L-ALvjtsU 1 


AATCAACTGC 


ACTGCATAGC 


G ATT CAT ATT 


6120 


AAAGPTPPTP 


ippi a 71 rrwrwrvrv*" 
ILAAAl 1 1 I \j 


AAACTATTGT 


atcaaaaccg 


AAAGGAGCTT 


TTTATTTTTT 


6180 


PP PTT A TT Tfl 


uunnAAl 1AA 


CTTTTGACAA 


ft rnmmnvfto/^ffi ft 

atttttcgta 


GTGTTCCTGA 


TAATAGGCTA 


6240 


1 \3<~ 1 L. 1 UG 


AAGACCTAAC 


ACATCAAAAA 


TATGCATGGC 


CTCTTGCATC 


TGCTTACAGC 


6300 


r v T w TVTn v Pa r 1 a 

LllLlil At- A 


CTGTCCTTTT 


TGATATAAGG 


CAAAACCTTT 


TAAATAATGG 


AAAACATTAC 


6360 




CTTAATACCT 


TTGTCAATAA 


TCTTCTCTGT 


ATAAGCCTCA 


AAATAGTTGG 


6420 


V_/\ i 1 A1AAAA 


AGAAGAATGC 


TCTAAACAAT 


GCTGGTAACA 


ATTGAGGGCC 


AAAATCAACA 


6480 


/"""PR a rpprp/-»rpm 


A 1 uurCGAL. 1 A 


ATCTCTTGGT 


AAAATTCCTC 


CCTCTCCATA 


ACTTCTCTAC 


6540 


PAATPPfiAfiT 


wLn 1 Ala Itl 


ACATCGTAGA 


AACTATAGAG 


GTTACCGAAA 


AGAATCAACT 


6600 


PATAPATfWP 


CCATTCTTCT 


GTTTTGAAGA 


GATAATCTGC 


TACCTTACCC 


AAATCATCCT 


6660 


GPTTPATATP 


IT a 1PTPPT1 
A 1 AAL. 1 UVt^-A 


TCTCTTTGAC 


ft ft ft m/^ ft ^ ft /~*/^ 
AAATCAGACC 


TTGTAGCAAA 


ATCCAGTTCA 


6720 


GCTPAAAATA 


AAvauljvsAL* 1 V> 


G 1 LuAftt. J. t_ 1 


if ft /*• ft /^irifTwmn/** 

i AGACTTTTC 


AAGTTGTTCT 


CTTTGAAGCT 


6780 


TTTG A A AA pr 


luUUi 1 A I l_L» 


TTTGAATAGT 


ft ft ft rvn/^rv< ft m 

AAAGTGGGAT 


AATCTGTGCC 


ATCATAGACA 


6840 


CATGTTPATfi 


Al lAluAAAA 


TTCCTTGCCT 


cp a rn/"i/^» » qv"* ft ft 

1 A rCC ATGAA 


ATTTTCGATT 


GTTACATGAA 


6900 


TGTTATPCAA 


A ATPTP A A An 
Aft 1V.1 wUUiu 


aiftppjvspi 

nnftLuuuAuft 


PTWr 1 a /"v*f /"» 
L. IuLLAWj 1 


AGACTCCCCA 


ft p/^fr^P ft n ft 

AGCTCAAAGC 


6960 


GAGATAAPTfi 


rtVjrlOvj 1 AJjAo 


PlPPlipT'PPP 


CTGCTGCTTC 


^wTwnfn iv ^ iv iv iv 

CTTTAAAGAA 


rt> iv iv frvnm/^/^ iv 

TAATTTCCAC 


7020 


TTGTTCGAAA 


TTP A Pfi A A AT 


ipttttpp a a 
H»- 1111 LLnn 


PftTPTTPPAT 


CTTTACACCT 


»n/yv^ ft m ft ft 

GCTCTGATAA 


7080 


TTCTTCCCAC 


TCAAGCATAG 


CTTCTTCCTG 


ACGATGGCTG 


ATTTTGTCCA 


GCTCAGCCTG 


7140 


TAATTCCATG 


AGTTTGTCGG 


CATCGTTTGT 


TTCCAACATT 


TGTTCAGAAA 


TGGCTTGGCT 


7200 


TTGACTTTCT 


AGCTCTTCAA 


TTTCAGCTTC 


TAGACTTTCG 


ATTTGTCGCA 


TGAGTTTGCG 


7260 


AACTTCTTTT 


TGACTTTCTT 


TCTGGGCCTG 


ATAGTCATTG 


ACTGGACTTG 


CTTCCTTTGC 


7320 


TTGATTGCTA 


GTTGAAGCTT 


CCTCAGTCTG 


ACTCATTTCT 


GCTGTTGCTT 


TCTTCTCAAC 


7380 


ATAGTAGTCG 


TAATCTCCAA 


GGTAGAGAGT 


TGAACCATTC 


TCAGACAATT 


CCAAAACATG 


7440 


AGTTGCCACA 


CGATTGATAA 


AGTAACGATC 


ATGACTGACA 


AACAGCAAGG 


TTCCATCAAA 


7500 


GTCAATCAAG 


GCATTTTCTA 


GCACTTCCTT 


ACTATCAATA 


TCCAAGTGGT 


TGGTCGGCTC 


7560 
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ATCCAGAATC AAAAAGTTAT TGTTTTCCAT AGACAATTTA GCTAAAAGCA AACGAGCTTT 7620 

TTCGCCACCA GATAGCATGC CGACTGATTT TTTAACATCA TCTCCTGAGA AAAGGAAGGC 7680 

TCCAAGACGG TTGCGGATTT CAACTTCTGG TGTCAGTTTG AAATCATTCC AGAGTTCATC 7740 

CAGCACCGTA TTACTTGGTG TCAGCTTGCT TTGGGTTTGG TCATAGTAAC CAACCTCAAC 7800 

ATTAGCGCCA AAGCGCTTTT CTCCCTTGAT AAAAGGAATC TGGTCCACAA TAGACTTGAT 7860 

AAAGGTTGAC TTGCCGATAC CATTTGGACC AACGATAGCG ACAGCATTCA TCTTACGAAG 7920 

ATCTAGGTTA ATCGGTTGTG ACAAGACTTC CCCGTCATAG CCAACAGCTG CATTTTCAAC 7980 

AGTCAAAACA ACATTGCCCG ACGTTTTTTC AGACTGGAAG GTCATGTTGG CTGATTTCTT 8040 

GCCAGCTTCA GGCTTGTCCA AACGTTCCAT TTTTTCCAGT TGTTTACGGC GAGATTGAGC 8100 

ACGTTTAGTC GTTGAAGCAC GAACTAGATT GCGATTGACA AAGTCTTCCA GAGCAGCGAT 8160 

TTCCTTCTGT TGCTTTTCAT AGTTTTTTGC CTCAGTAACT AGCTTTTGCT CCTTCAATTC 8220 

GACAAAACGA GAGTAATTCC CCACATAGCG ATCCAAGGAA TGCTTGGTCA AATCTAGCGT 8280 

AATTGTCGCA ACCTTGTCCA AGAAATAACG GTCGTGGCTG ACGATAATGA GGGCACCGCT 8340 

ATAGTTTACC AAGTAATTCT CTAGCCAGGC GATGGTTTCA ATATCCAAGT GGTTAGTTGG 8400 

CTCGTCCAAG ACCAAGAGAT TGGGCTTTTC AAGGAGCATT TTGGCAAGTG CCAAACGAGT 8460 

ATTTTGACCA CCAGAAAGCT CAGCAATTTT CATCTGCCAC ATAGACTCGT CAAACTTGAA 8520 

TCCATTCAAA ATCGCTCGAA TATCAGCTTC ATAGGTAAAG CCACCTGCTT GGCGAAAATT 8580 

CTCAGATAAG CGGTCATAAT CTGACATCAG TTTATCCAAA TCCTCACCAG ACTTTTCACC 8640 

CATCTCCAGC TCCATCTGAC GCAGTTGTCT CTCCGTCCGA CGCAAATCAT TAAAGACATG 8700 

AAGCATTTCA TCGTAGATGG TATTTTCAGA CTCAAAACGG CTATCTTGGG CTAGGTAAGA 8760 

CAGAGAAATA TCTTTTTTCT TATTGATTTC TCCGCTAGTT GGCTCCTCTT CTCCAACTAA 8820 

AATCTTCAAA AGAGTAGACT TACCTGCACC ATTTTTCCCA ACAAGAGCAA TCCGATCTCG 8880 

TTCATCAACC TGCAGGTTGA TATTATCGAA AAGAACCTCT CCTGCAAAAG AACGTTCAAT 8940 

TTTATTAGCT TGTAAAATAA TCATACAAGT AGTATAGCAT GTTTCCCTAA GGCATTCAAG 9000 

ATAATCGTAA GTCTTTTAGT ACAACTTTTA TAACATAAAA TAAACTAAAT TATGTATATT 9060 

TTATATTAGA TTACTTCACT ATCTTGTTGG ATTTTCTAAC CAGCTAATCT TGTTTCAAAT 9120 

AGTTATCGCA CAAGTCTATT ATTTAATTCT TTTCATCATT TACGTACGTA TAGCAGATTG 9180 

AAATAAGATG AGAACAAATC GATTGGGAAA GTAAAATTAA TTTCTATAAA TGTTTTAGCA 9240 

ATTGTTTCGT ACT ATT TT AG ATTCAGTCTA CTATATACAA TATTTTCGGA ACATTCAACT 9300 
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TTTTAACTCT ATTTATTACT AGATTTCATA ATTAAAAAAC CTACTGACCA AGCTAGAAAG 



9360 



CTTGATACAA TAGGCTTTTT AAAGACTGAT TATTTAACAG CGTCTTTAAG AGCTTTACCA 



9420 



GCTTTGAATG CTGGTACTTT AGAAGCTGCA ATTGTCATTT CTTTACCAGT TTGTGGGTTG 



9480 



CGACCTTTAC GTTCTGCGCG CTCACGAACT TCAAAGTTAC CAAAACCGAT CAATTGAACT 



9540 



T 



9541 



(2) INFORMATION FOR SEQ ID NO: 133; 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3502 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 133: 

TTGACTATCC TATCATGCTT TCTAAGGTCT ACTCAAGAAA ATCATTTTCA AGTTTTCACA 60 

CCTTTCTCAA AAAAGTTAAA AAATTTTCTC AAAAACGCTT GACTCTGACC TAAGGCGAAG 120 

GGTTATACTA TCATTGTAAG GAGGAAATCA TGTACCATAT AAAAGAAGCT GCGCAGCTTT 180 

CAGGTGTCTC TGTCAAGACC CTGCATCACT ATGACAAGAT AGGACTCTTG GTCCCCTTAA 240 

AGTCGGAAAA CGGCTATCGA ACCTACAGTC AAGAGGATTT GGAACGCCTT CAGGTCATTC 300 

TTTACTACAA ATATCTAGGC TTTTCTTTAG AGAAAATAGC AGAGCTGTTA AAGGAAGAAA 360 

GGACAGATTT ATTGCCCCAT TTGACTAGGC AGTTGGACTA TCTAACTCGC GAAAGGCAAC 420 

ATCTGGATAC CTTGATTTCC ACCTTGCAAA AAACTATTCA AGAACAAAAA GGAGAAAGAA 480 

AAATGACCAT TGAGGAAAAA TTCACGGGAT TTAGCTATCA AGACAATCAA AAATACCACC 540 

AAGAAGCGGT AGAGAAATAT GGTCAAGAAG TCATGGGACA AGCGCTCGAA CGCCAAAAAG 600 

GTCACGAAGA CGAGGCTACG GCCGCCTTTA ACCAAGTCTT TCAAACTTTG GCACAAAATC 660 

TTCAAGTTGG TTTACCTGCA ACAGCAACCG AAAACCAGGA GCAAGCAGCC AAGCTCTTGC 720 

AAGCCATTCG CACTTATGGA TTTGACTGCT CTATTGAGGT ATTCGGTCAT ATCGGTAAAG 780 

GTTACGTCTA CAACCCAGAG TTTAAGGAAA ACATTGACAA GTTTGGTTCT GAAACAGCCC 840 

AGTACACGTC AGATGCCATT GCGGTTTACG TTCAGACAAA TGCAGAATAA ATAGGCTAGG 900 

AATTTCCTAG CCTATTTTTT ACTTCAAATC ATAAAGCCAG TCGTCACCGT TTTTGTAGTA 960 

AAAGAATTCA CTGAGATCTT CTTCTAGAAA CACACGAAGC ATATCAGACA TATCATCGGT 1020 

TGCAAGTTTT AGATGAGAAA GATTTTCAAA GTCCTCCCAC CAAACTTTCC CTTCGTCTGA 1080 

AGACTGGAGT TCACCAGTAA AGTGTTCTGT CTTGTAAAAA AGGACGACAT AACGATAATC 1140 
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CTTGTCGTCA 


TACCAGTTTT 


TGATACCACA 


GAGTTGGGGT 


TTGGAAATGA 


TCAGACCAGT 


1200 


TTCTTCTTTC 


ACTTCACGAA 


TG AC AG CATC 


GACAAAGGAT 


TCGCCACGTT 


CAACATGACC 


1260 


ACCAGGAAAA 


GTAATGCCAG 


ACCAGTCGGG 


ATTAACTCGG 


TCTTGGACCA 


GGACCTTATC 


1320 


TCCGTTTTTA 


ATCATACACA 


TGTTAACAAA 


TTCGACTGCC 


TCTCTTCTGT 


TCATTCTTCA 


1380 


CAACCTTTAA 


TCTTTAATCA 


TAATGCAGAC 


TTCCCGCCAC 


CCAGCCGGTA 


CAGAGGGCAG 


1440 


- AAGTG ATGTT 


AAAGCCACCC 


GTGTGGGCAT 


TGATATCCAT 


AACTTCGCCT 


GCAAAGTGGA 


1500 


GGCCAGGTAC 


CAGCTTACTT 


TCAAGGGTTT 


TAGGATTGAT 


TTCCTTGAGA 


CTGACTCCAC 


1560 


CCTTGGTAAC 


AAAGGACTTT 


GCAAGGGACA 


TTTTTCCAGT 


TACAGGAATT 


TTAAGTTCTT 


1620 


TAATGGACTG 


GACAAGTTGT 


TCTCGTTCCT 


TTTCAGTCAG 


TTGTTTGACT 


TTTTCAGGAT 


1680 


ATCCTTGTAC 


AAAAAATTCG 


GCCAAGCGTT 


CTGGTAACAA 


GGTTTTTAAA 


GCGTTTTTCA 


1740 


AGGATTTTTC 


CCGATTTTCT 


TCTAGAAATG 


TAACCAAGTC 


CTTCTCAGAA 


AGTTGAGGCA 


1800 


AAACATCGAG 


TGAGAGAACC 


TCCCCACCTT 


TGACAAAGCT 


AGACATGCGT 


AGGGCAGCAG 


1860 


GACCTGACAA 


ACCAAAGTGG 


GTAAAGAGTA 


AATCATGAGT 


GATGACATGC 


TTACCATAAC 


1920 


TTAGGGTCAC 


ATCGTCCAGA 


GAAATACCTT 


GTAAGGCTTT 


ATGTGGAAAA 


TCTGTTAATA 


1980 


AAGGACTTTC 


AGCAGCCTCA 


AGATCGGTGA 


TGGTATGCTT 


AAAATGGCGA 


GCAATCTCGT 


2040 


GACCAAAACC 


AGTCGAACCA 


GTCGAAGGAT 


AAGACTTACC 


ACCTGTTGTG 


ACAATGAGTT 


2100 


TCTCACAAGT 


GAAGGTTTGA 


TCCGCTGACT 


TAAGGACAAA 


CTGGTCATCT 


ACTTTTTTAA 


2160 


CAGAAACGAT 


TTCTATTTGA 


GTAGCAACTT 


GACCACCTAG 


TTCGGTGATT 


TTCTTTTCCA 


2220 


AAGCTTCGAT 


AATAGTCCGA 


GACTTGTCAC 


TGGCTGGAAA 


GACGCGTCCG 


TGGTCTTCGA 


2280 


CCTTAAGTTT 


AACACCATTT 


TCTGTAAAAA 


AGTTGATGAT 


GTCATGATTA 


TCGAACTGGG 


2340 


AGAAAACACT 


GTAAAGAAAG 


CGTCCGTTTC 


CAGGAATTCC 


AGCTAGCAGG 


TTGTCTAAGC 


2400 


TACCATTGTT 


GGTCACATTG 


CAACGTCCCC 


CACCAGTCCC 


AGCTAATTTT 


TTTCCAAGTT 


2460 


TCCGATTTTT 


TTCGATGAGG 


AGGGTTTTCT 


GTCCATAAAA 


GCTACTGGAA 


ATCGTAGCCA 


2520 


TCATACCAGC 


AGGTCCCCCA 


CCGATGACAA 


TAGTATCAAA 


ATGTTTCATA 


GCTCTATTGT 


2580 


ACCACAAAAA 


AACAAGAGAT 


GATGGTCACC 


TCTTGTCAAG 


AATGCAATTA 


ATCAATTTCA 


2640 


TAGCCCATCA 


GCAAACCGCC 


CTCTTCTGCA 


TAGAAACTGC 


AGAGACCAGA 


GGTTGGTAGA 


2700 


ATTTTAATAT 


CCGCTTGTGG 


GAAGGTTTCA 


CGGATTCGCT 


CTGAGAGCTG 


TTGACAACAT 


2760 


TTTTCGTTAT 


TGCGTTGGGC 


CATGACAATA 


CGGCCACCAG 


CATATCCAGC 


TTTTACTAAC 


2820 


TCATCATAGG 


CAGCTTGAAC 


TGATTTCTTT 


GATCCCCTTG 


CTTTTTGTAG 


CAATTCGAGA 


2880 
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GTCCCAGTTT CACTAGCTTT TCCGACCATA CGAATGTTGA GAAGGCCAAC GACCGTACCG 2940 

ATAAGCTTGC TCAAACGGCC GTTCTTCACC AAGTTATCGA CTTTGGCTAG GACAAAGAGC 3000 

AACTTAGTTT TTTCTTGATA GGCGGTGATA GCTTCAACCA CTTCTTCAAA AGACAAGCCC 3060 

TGGTCAATCA AGTCATTCAA TTTTTCTACG AGTAGGTCAA CTTCACCACC AGCAGATAAA 3120 

CTATCAATCA CATGAATCTT AGTGTCAGGA TGGTCTTCCA GATAAATATT CTTTGCTAGT 3180 

TGAGCACTAT TGTGACTGCC AGAAAGGGTA CCTGTGATGG TTACTAGGAA AATGTTTTTG 3240 

GCACCTTCAA ATGCTCGCAA ATAGTCATCT GGGCTTGGAC AAGCCGATTT TGAAGCTTCT 3300 

GCAGTTGCAT ACATGGTTTC CATCATTTGG TCAATATCGA GACTGGCGTC ATCAACAAAG 3360 

ACCTGATCAG CTACTTGAAT GGTTAAGGGG ACACTTACAA AGGTTGTGTT AATAGCTGGT 3420 

GTTGGCAGTT GACGATAATC ACAACCAGAG TCAGCAATAA TCTTCCAAGT CATAGAAATT 3480 

CTCCATCTTT GTCAGGAACG AT 3502 



(2) INFORMATION FOR SEQ ID NO: 134: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12665 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 134: 

CGATTGATTT TTTTAAAGCG TTCGATAGAG AATGAGAAAC GAATCCTTAG CAATGGCGGG 60 

AAAGAATTTG GAGTTGAGAA TACAAAACGA TTAACTATGG CTCATATTGT TTTTTATCTC 120 

TCTTGCTTGG TTGAGGCAAT GGTGCACAAG ACAATTTTTG ATGGCATGGG CATGGTTGGT 180 

TTAGTCTTGC TTATTTTTTC TATGCTGATG TTGATGTTGG TGATTCACTT GTTGGGAGAT 240 

ATTTGGACAG TGAAGCTTAT GCTTGTCAAT AATCACAAAT ATGTAGATCA TATCTTGTTT 300 

AGGACAGTAA AACACCCTAA TTACTTTTTA AATATTCTTC CTGAGTTGAT TGGCTTGACC 360 

TTGTTGAGTC ATGCTTATGT GACTTTTGTT TTAGTTTTTC CAGTTTATGC AGTTATTTTG 420 

TATCGACGAA TAGCTGAAGA GGAAAAGCTA TTACATGAAG TTATAATCCC AAATGGAAGC 480 

ATAAAGAGAT AAATACAAAA TTCGATTTAT ATACAGTTCA TATTGAAGTG ATATAGTAAG 540 

GTTAAAGAAA AAATATAGAA GGAAATAAAC ATGTTTGCAT CAAAAAGCGA AAGAAAAGTA 600 

CATTATTCAA TTCGTAAATT TAGTGTTGGA GTAGCTAGTG TAGTTGTTGC CAGTCTTGTT 660 

ATGGGAAGTG TGGTTCATGC GACAGAGAAC GAGGGAGCTA CCCAAGTACC CACTTCTTCT 720 

AATAGGGCAA ATGAAAGTCA GGCAGAACAA GGAGAACAAC CTAAAAAACT CGATTCAGAA 780 
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CGAGATAAGG CAAGGAAAGA GGTCGAGGAA TATGTAAAAA AAATAGTGGG TGAGAGCTAT 840 

GCAAAATCAA CTAAAAAGCG ACATACAATT ACTGTAGCTC TAGTTAACGA GTTGAACAAC 900 

ATTAAGAACG AGTATTTGAA TAAAATAGTT GAATCAACCT CAGAAAGCCA ACTACAGATA 960 

CTGATGATGG AGAGTCGATC AAAAGTAGAT GAAGCTGTGT CTAAGTTTGA AAAGGACTCA 1020 

TCTTCTTCGT CAAGTTCAGA CTCTTCCACT AAACCGGAAG CTTCAGATAC AGCGAAGCCA 1080 

- AACAAGCCGA CAGAACCAGG AGAAAAGGTA GCAGAAGCTA AGAAGAAGGT TGAAGAAGCT 1140 

GAGAAAAAAG CCAAGGATCA AAAAGAAGAA GATCGTCGTA ACTACCCAAC CATTACTTAC 1200 

AAAACGCTTG AACTTGAAAT TGCTGAGTCC GATGTGGAAG TTAAAAAAGC GGAGCTTGAA 1260 

CTAGTAAAAG TGAAAGCTAA CGAACCTCGA GACGAGCAAA AAATTAAGCA AGCAGAAGCG 1320 

GAAGTTGAGA GTAAACAAGC TGAGGCTACA AGGTTAAAAA AAATCAAGAC AGATCGTGAA 1380 

GAAGCAGAAG AAGAAGCTAA ACGAAGAGCA GATGCTAAAG AGCAAGGTAA ACCAAAGGGG 1440 

CGGGCAAAAC GAGGAGTTCC TGGAGAGCTA GCAACACCTG ATAAAAAAGA AAATGATGCG 1500 

AAGTCTTCAG ATTCTAGCGT AGGTGAAGAA ACTCTTCCAA GCCCATCCCT GAAACCAGAA 1560 

AAAAAGGTAG CAGAAGCTGA GAAGAAGGTT GAAGAAGCTA AGAAAAAAGC CGAGGATCAA 1620 

AAAGAAGAAG ATCGCCGTAA CTACCCAACC AATACTTACA AAACGCTTGA ACTTGAAATT 1680 

GCTGAGTCCG ATGTGGAAGT TAAAAAAGCG GAGCTTGAAC TAGTAAAAGA GGAAGCTAAG 1740 

GAACCTCGAA ACGAGGAAAA AGTTAAGCAA GCAAAAGCGG AAGTTGAGAG TAAAAAAGCT 1800 

GAGGCTACAA GGTTAGAAAA AATCAAGACA GATCGTAAAA AAGCAGAAGA AGAAGCTAAA 1860 

CGAAAAGCAG CAGAAGAAGA TAAAGTTAAA GAAAAACCAG CTGAACAACC ACAACCAGCG 1920 

CCGGCTCCAA AAGCAGAAAA ACCAGCTCCA GCTCCAAAAC CAGAGAATCC AGCTGAACAA 1980 

CCAAAAGCAG AAAAACCAGC TGATCAACAA GCTGAAGAAG ACTATGCTCG TAGATCAGAA 2040 

GAAGAATATA ATCGCTTGAC TCAACAGCAA CCGCCAAAAA CTGAAAAACC AGCACAACCA 2100 

TCTACTCCAA AAACAGGCTG GAAACAAGAA AACGGTATGT GGTACTTCTA CAATACTGAT 2160 

GGTTCAATGG CGACAGGATG GCTCCAAAAC AATGGCTCAT GGTACTACCT CAACAGCAAT 2220 

GGCGCTATGG CGACAGGATG GCTCCAAAAC AATGGTTCAT GGTACTATCT AAACGCTAAT 2280 

GGTTCAATGG CAACAGGATG GCTCCAAAAC AATGGTTCAT GGTACTACCT AAACGCTAAT 2340 

GGTTCAATGG CGACAGGATG GCTCCAATAC AATGGCTCAT GGTACTACCT AAACGCTAAT 2400 

GGTTCAATGG CGACAGGATG GCTCCAATAC AATGGCTCAT GGTACTACCT AAACGCTAAT 2460 

GGTGATATGG CGACAGGTTG GGTGAAAGAT GGAGATACCT GGTACTATCT TGAAGCATCA 2520 
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GGTGCTATGA AAGCAAGCCA ATGGTTCAAA GTATCAGATA AATGGTACTA TGTCAATGGC 2580 

TCAGGTGCCC TTGCAGTCAA CACAACTGTA GATGGCTATG GAGTCAATGC CAATGGTGAA 2640 

TGGGTAAACT AAACCTAATA TAACTAGTTA ATACTGACTT CCTGTAAGAA CTCTTTAAAG 2700 

TATTCCCTAC AAATACCATA TCCTTTCAGT AGATAATATA CCCTTGTAGG AAGTTTAGAT 2760 

TAAAAAATAA CTCTGTAATC TCTAGCCGGA TTTATAGCGC TAGAGACTAC GGAGTTTTTT 2820 

TGATGAGGAA AGAATGGCGG CATTCAAGAG GCTCTTTAAG AGAGTTACGG GTTTTAAACT 2880 

ATTAAGCCTT CTCCAATTGC AAGAGGGTTT CAATCTCTGC CAGGGTGCTG GCTTGCGAAA 2940 

TGGCTCCACG GAGTTTGGCA GCGCCAGATG TTCCACGGAG ATAGTGAGGA GCGAGACCGC 3000 

GGAATTCACG AACTGCGACG TTTTCTCCTT TGAGGTTAAT CAATCGTTTC AAGTGTTCGT 3060 

AGGCGATCTT CATCTTGTCT TCAAAGGTCA AATCAGGTAG GATTTCTCCT GTTTCAAAGT 3120 

AATGGTTGAT TTGGTTGAAG AGGTAAGGAT TTCCCATGGC AGCTCGGCCA ATCATGACTG 3180 

CGTCAGCACC AACTTCTTCG ATGCGTTGCT TGGCTTCTTG GACAGTACGG ATATCACCGT 3240 

TGGCGATGAA TGGAATCTTG GTTAGAGCTT GGGCAACCTT GTAAAGGGTC TCAAGGTCTG 3300 

CGTGGCCAGT ATACATTTGT TCACGGGTAC GGCCATGCAT GGCGAGGGCA GAAACACCTG 3360 

CAGCTTCAGC AGCGAGAGCA TTTTCTACTG CAAGAGATGG GTCCGCCCAG CCGGTACGCA 3420 

TTTTGACAGT AAGTGGGATA TCAAGGACAG ACTGGACCTT GTTGATGATG GAGTAAATCT 3480 

TGTCTGGATC CTTGAGCCAC ATAGCACCAG CTTCGTTCTT CACGATTTTG TTGACAGGGC 3540 

AGCCCATGTT GATATCGACG ATATCGGTCT TGGTGTTTTC TTGGATGAAT TCTGCTGCGC 3600 

GTGCTAGGCT GTCTTCATCG CTACCAAAAA GTTGGATAGA GACAGGGTTT TCGCCCTCAT 3660 

CGATATGAAG CATGTGCAGG GTTTTTTCGT TGTTGTATTG GATTCCCTTG TCAGAGACCA 3720 

TTTCCATTAC AACGAGTCCA GCTCCGAGCT CCTTTGCGAT AGTACGAAAG GCTGAGTTGG 3780 

TCACGCCAGC CATAGGCGCT AAAACGGTAC GATTGGGAAT CTCAATATTG CCAATCATAA 3840 

AAGGTGTATT AAGATTTGTC ACGAATGAGT TCCTCCAGGT CCTTTTCATC AAAGTTGTAA 3900 

GTAGTTTGGC AGAATTGACA AGTGATTTCT GCCCCGTGGT CTTCCTCTTT CATTTCCTGT 3960 

AAGTCTGAGC TTGGAAGGCT GGCAAGAGCG TTCATAAAGC GTTCATGGCT ACAGTCACAT 4020 

TGGAAACGGA TTTCTTCTTC AGAAAGACGC TTGTAGGCTT CGTCCCCGTA GATAGCCTTG 4080 

AGGAGGGCTT CGATATGGTC GTCGCTTTCG AGAAGAGTAG AGATAGCTGG CATTTCTTGG 4140 

ATGCGTTTTT CAAAGCGAGC AATCTCTTCT TTCTTGGCTC CTGGCAAGAC TTGAACTAGG 4200 

AAACCACCTG CAACCTTGAC CTTGTCTTCC TCGTCCAAAA GGACATTGAG GCCGACCGCT 4260 

GAAGGCGTTT GTTGGCTTTC AGTAAGGTAA AAGGCAAGGT CTTCACCGAT TTCTCCAGAG 4320 
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ATGAGGGGAG TTATAGAGTT GTAAGGATTT CCAGTACCGT AGTCTGTGAT AACGAGGAAT 4380 

TGACCATTTC CAACAAAAGG TCCGACTAGG ACTTCACCAG TCGCAGTCTT TTTGATGTCA 4440 

ACACCAGGAT TTTGAACATA GCCTTTGACG TTCCCCTTGG TATCAGCGAC GGTGATAATA 4500 

GCACCTAGAG AGCTAGATCC CAACACCTTA ACTGTAAGTT TGGTATTTCC TTTTTCATTG 4560 

GCTGCGAGAA TCTGGCTAGC GATAAGAGTT CGACCAAGCG CTACAGTTGA GCTAGCTTGG 4620 

■GTTTGATGTT TTTCTTGAGC AGTGCGGACG GTTTCAGTGC TATCAAGGAC AAAAGCACGA 4680 

AAGGcTCCGC TTTCTGATAT AGTTTTAATA ATTTTATCCA TAGCTACTAT TTTAGCATAA 4740 

AAATGCCCAA AGGGGGAGCC GTGTGTTTAC TGATTTTCAG GATAATGGAC CAGGAAATCA 4800 

GCATGAAAAT AAAAAGAGAA ACAGATTATT TTAGCATTTG TCAGATTTAT GCTATGCTTA 4860 

AGGTAGAAAA TGAAAGGGAT AACAAATGTA TTTAGGAGAT TTGATGGAGA AAGCCGAGTG 4920 

TGGTCAATTT TCAATACTTT CCTTTCTATT ACAAGAGTCT CAGACGACCG TCAAGGCTGT 4 980 

AATGGAAGAA ACAGGATTTT CAAAAGCAAC CCTAACCAAA TATGTCACCC TGCTCAATGA 5040 

CAAGGCTTTG GATAGTGGCT TAGAGCTGGC TATTCACTCA GAAGATGAAA ATCTGCGTCT 5100 

GTCTATCGGT GCAGCTACCA AGGGGAGAGA TATTCGGAGC TTGTTTTTGG AGAGTGCTGT 5160 

TAAATACCAG ATTTTGGTTT ATCTTCTCTA CCACCAACAG TTTTTAGCCC ATCAGCTGGC 5220 

TCAAGAATTG GTGATTAGCG AGGCTACGCT TGGTCGTCAC TTGGCTGGTT TAAATCAGAT 5280 

TTTGTCAGAA TTTGATTTAT CCATCCAAAA TGGCCGTTGG CGAGGTCCAG AGCATCAGAT 5340 

TCACTATTTC TATTTCTGTC TTTTCCGAAA GGTCTGGTCG AGTCAGGAAT GGGAAGGTCA 5400 

CATGCAGAAA CCAGAGAGAA AACAGGAGAT TGCCAATTTA GAGGAAATCT GCGGTGCAAG 5460 

TTTGTCTGCG GGGCAGAAAT TGGACTTGGT TCTCTGGGCT CACATCAGTC AACAACGTCT 5520 

TCGGGTCAAT GCTTGTCAGT TTCAAGTCAT AGAAGAGAAA ATGCGAGGGT ATTTTGACAA 5580 

TATCTTTTAT CTTCGTTTGC TGAGAAAGGT TCCGTCCTTT TTTGCTGGGC AACATATTCC 5640 

ACTAGGAGTT GAGGATGGTG AGATGATGAT ATTCTTCTCT TTTCTCCTAT CTCATCGCAT 5700 

TCTTCCTCTT CATACTATGG AGTATATTCT TGGTTTTGGA GGGCAGTTGG CAGATTTACT 5760 

GACGCAATTG ATTCAAGAAA TGAAGAAGGA GGAACTATTG GGGGATTATA CAGAGGACCA 5820 

TGTCACCTAT GAACTCAGTC AGCTTTGTGC TCAAGTCTAT CTCTATAAGG GCTATATTTT 5880 

ACAGGATCGC TACAAGTACC AGTTAGAGAA TCGTCATCCA TATTTACTGA TGGAACATGA 5940 

TTTTAAAGAG ACAGCAGAGG AGATTTTTCA TGCTCTACCT GCTTTTCAAC AGGGGACAGA 6000 

TTTAGATAAG AAGATTCTCT GGGAATGGCT CCAGTTAATC GAATATATGG CTGAAAACGG 6060 
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TGGCCAGCAT 




olL luuA ill 


ljAV_ n 1 v_ IvjU i 


TTTCTTGTCT 


1 I lLAAu(jAi 


6120 


GGCAGCPATT 


thy: a & irprn^ 




CAATCGTTTT 


A II ACC ATTXj 


AAGCTTATGA 


6180 


CCCTAGTrfifi 

1 1 vUu 


r* a t t & Tr* & nrwp 

1 1 A 1 uA 1 I 


TGCTGGTTAC 


LAA1 AAL.CL.tj 


Ail CA I AAG A 


AGGAACAGAC 


6240 


ACCAGTPTAT 


l*ii X X AAAAA 


A 1 UA(_ 1 l\jtj A 


1 Al vivjAljOA I 


TTGGTAGCGA 


TTCGCCAGTT 


6300 


ATTATTCAPT 


TAAAAfi/^rTTi 


vjVj 1 1AA1\.LA 


GGTCTTTTTT 


f**fyf* JA 7i H TUT/"' 11 

U 1 XjAAA 1 I CA 


CACAATCTCC 


6360 






AAAAAAlj I i\j 


ATAAACAAGA 


AAGCGCTTTA 


TTTTGTATAC 


6420 


TAGTAAGTfiT 




L.Av_v_ 1 LAnuA 


T CTTT ATC AG 


/*> Jl /V^ Jl Jl r»n*R 

GAGGAL.AGTA 


CATGTCACAA 


6480 


G AA AA AT A PA 




luALLAuwA 


Jl /""Tl H /"* Tl 71 /Trim 
Au 1 At- AAG i X 


L. rCvjTGQ-CAT 


CATTTTCAAC 


6540 


AAA A A AdCiCifl 


AAA irriup IV r* 


L. 1 I.GAG 1 LAA 


AAAvjAG i™rA 


oon ji/-»ti mrnfiirii 
CL.CAGATTTT 


CCCTCAGGCA 


6600 


V3U 1 1 VJVJV3 1 1 \J 




L. AA I <jAAA Jl 1 


TGGAALTrCTG 


TTCAGTCAGT 


TATTGCGGGT 


6660 


111 unl 


IV IV iPIWIVfl 


CAAGCCAAAT 


CAAATCGAGG 


CAATCGGGAT 


TACCAACCAA 


6720 


1\jAAALAA 


CGGTTGTCTG 


GGATAAGAAA 


ACAGGACTTC 


CTATCTACAA 


TGCTATCGTT 


6780 


TGGCAGTCAC 


GCCAGACAGC 


ACCTTTGGCT 


GAGCAACTAA 


AAAGCCAAGG 


TTATGTGGAA 


6840 


AAATTCCATG 


AAAAGACTGG 


TTTGATTATT 


GATGCTTACT 


TCTCTGCTAC 


CAAGGTTCGT 


6900 


TGGATTTTGG 


ATCATGTAGA 


AGGTGCTCAA 


GAGCGAGCAG 


AAAAAGGGGA 


ATTGCTCTTT 


6960 


GGTACTATCG 


ATACTTGGTT 


GGTTTGGAAA 


TTGACTGACG 


GTGCGGCTCA 


CGTGACTGAC 


7020 


TACTCAAATG 


CAGCTCGTAC 


CATGCTTTAT 


AACATTAAAG 


AACTCAAATG 


GGATGATGAG 


7080 


ATTTTGGAAA 


TCCTTAACAT 


TCCGAAGgCT ATACTTCCAG 


AAGTTCGTTC 


TAACTCCGAA 


7140 


ATCTACGGCA 


AGACAGCTCC 


ATTCCATTTC 


TACGGTGGAG 


AGGTGCCAAT 


CTCAGGTATG 


7200 


GCTGGGGACC AACAAGCAGC 


CCTCTTTGGA 


CAGTTGGCTT 


TTGAGCCAGG 


TATGGTTAAG 


7260 


AATACTTATG GAACAGGCTC 


TTTCATCATC 


ATGAATACTG 


GGGAAGAGAT 


GCAGTTGTCT 


7320 


GAAAACAACC 


TCTTGACAAC 


CATTGGTTAC 


GGAATCAACG 


GTAAGGTTTA 


TT ATG C CTT G 




GAAGGTTCTA 


TCTTCATCGC 


AGGAAGTGCT 


ATTCAGTGGC 


TTCGTGACGG 


TCTTCGCATG 


7440 


GTTGAAAATT 


CACCAGAATC 


TGAAAAATAC 


GCTCGTGATT 


CTCACAACAA 


CGATGAAGTT 


7500 


TATGTCGTTC 


CAGCCTTTAC 


AGGTCTAGGC 


GCTCCATACT 


GGAACCAAAA 


TGCTCGTGGT 


7560 


TCCGTCTTTG 


GTTTGACTCG 


TGGAACAAGC 


AAAGAAGACT 


TTATCAAGGC 


GACTTTGCAA 


7620 


TCTATTGCTT 


ATCAAGTGCG 


TGATATCATC 


GACACCATGC 


AAGTGGATAC 


TCAGACCGCC 


7680 


ATTCAAGTAC 


TGAAGGTGGA 


TGGTGGTGCA 


GCCATGAACA 


ACTTCCTCAT 


GCAGTTCCAG 


7740 


GCGGATATTT 


TAGGCATTGA 


CATTGCACGT 


GCTAAAAACC 


TGGAAACAAC 


AGCTCTAGGA 


7800 


GCGGCCTTCC 


TAGCAGGTTT 


GTCAGTAGGG 


TACTGGAAAG 


ACTTGGACGA 


GTTGAAACTC 


7860 



WO 98/18931 



PCT/US97/19588 



913 



TTGAACGAGA 


CAGGAGAACT 


CTTTGAGCCA 


TCTATGAACG 


AATCTCGCAA 


GGAACAACTC 


7920 


TACAAGGGCT 


GGAAGAAGGC 


TGTGAAAGCA 


ACTCAAGTCT 


TTGCGGAAGT 


AGACGACTAA 


7980 


TACTGGCAGA 


ATAAAGCGAT 


TTATTTAGAA 


AGTGTGTAAA 


TATGGAATTT 


TCAAAGAAAA 


8040 


CACGTGAATT 


GTCAATTAAA AAAATGCAGG 


AACGTACCCT 


GGACCTCTTG 


ATTATCGGTG 


8100 


GAGGAATCAC 


AGGAGCTGGT 


GTAGCCTTGC 


AGGCGGCAGC 


TAGCGGTCTT 


GAGACTGGTT 


8160 


TGATTGAAAT 


GCAAGACTTT GCAGAAGGAA 


CATCTAGTCG 


TTCAACAAAA 


TTGGTTCACG 


8220 


GAGGACTTCG 


TTACCTCAAA 


CAATTTGACG 


TAGAAGTGGT 


CTCAGATACG 


GTTTCTGAAC 


8280 


GTGCAGTGGT 


TCAACAAATC 


GCTCCACACA 


TTCCAAAATC 


AGATCCAATG 


CTCTTACCAG 


8340 


TTTACGATGA 


AGATGGAGCA 


ACCTTTAGCC 


TCTTCCGTCT 


TAAAGTAGCC 


ATGGACTTGT 


8400 


ACGACCTCTT 


GGCAGGTGTT 


AGCAACACAC 


CAGCTGCGAA 


CAAGGTTTTG 


AGCAAGGATC 


8460 


AAGTCTTGGA 


ACGCCAGCCA 


AACTTGAAGA 


AGGAAGGCTT 


GGTAGGAGGT 


GGAGTGTATC 


8520 


TTGACTTCCG 


TAACAACGAT 


GCGCGTCTCG 


TGATTGAAAA 


CATCAAACGT 


GCCAACCAAG 


8580 


ACGGTGCCCT 


CATTGCCAAC 


CACGTGAAGG 


CAGAAGGCTT 


CCTCTTTGAC 


GAAAGTGGCA 


8640 


AGATTACAGG 


TGTTGTAGCT 


CGTGATCTCT 


TGACAGACCA 


AGTGTTTGAA 


ATCAAGGCCC 


8700 


GTCTGGTTAT 


TAATACAACA 


GGTCCTTGGA 


GTGATAAAGT 


ACGTAATTTG 


TCTAATAAGG 


8760 


GAACGCAATT 


CTCACAAATG 


CGCCCAACTA 


AGGGAGTTCA 


CTTGGTAGTA 


GATTCAAGCA 


8820 


AAATCAAGGT 


TTCACAGCCA 


GTTTACTTCG 


ACACAGGTTT 


GGGTGACGGT 


CGTATGGTCT 


8880 


TTGTTCTCCC 


ACGTGAAAAC 


AAGACTTACT 


TTGGTACAAC 


TGATACAGAC 


TACACAGGTG 


8940 


ATTTGGAGCA 


TCCAAAAGTA 


ACTCAAGAAG 


ATGTAGATTA 


TCTACTTGGC 


ATTGTCAACA 


9000 


ACCGCTTCCC 


AGAATCCAAC 


ATCACCATTG 


ATGATATCGA 


AAGCAGCTGG 


GCAGGTCTTC 


9060 


GTCCATTGAT 


TGCAGGGAAC 


AGTGCCTCTG 


ACTATAATGG 


TGGAAATAAC 


GGTACCATCA 


9120 


GTGATGAAAG 


CTTTGACAAC 


TTGATTGCGA 


CTGTTGAATC 


TTATCTCTCC 


AAAGAAAAAA 


9180 


CACGTGAAGA 


TGTTGAGTCT 


GCTGTCAGCA 


AGCTTGAAAG 


TAGCACATCT 


GAGAAACATT 


9240 


TGGATCCATC 


TGCAGTTTCT 


CGTGGGTCTA 


GCTTGGACCG 


TGATGACAAT 


GGTCTCTTGA 


9300 


CTCTTGCTGG 


TGGTAAAATC 


ACAGACTACC 


GTAAGATGGC 


TGAAGGAGCT 


ATGGAGCGCG 


9360 


TGGTTGACAT 


CCTCAAAGCA GAATTTGACC 


GTAGCTTTAA 


ATTGATCAAT 


TCTAAAACTT 


9420 


ACCCTGTTTC 


AGGTGGAGAA 


TTGAACCCAG 


CAAATGTGGA 


TTCAGAAATC 


GAAGCCTTTG 


9480 


CGCAACTTGG 


AGTATCACGT 


GGTTTGGATA 


GCAAGGAAGC 


TCACTATCTG 


GCAAATCTTT 


9540 


ACGGTTCAAA 


TGCACCGAAA 


GTCTTTGCAC 


TTGCTCACAG 


CTTGGAACAA 


GCGCCAGGAC 


9600 
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TCAGCTTGGC AGATACTTTG TCCCTTCACT ATGCAATGCG CAATGAGTTG ACTCTTAGCC 9660 

CAGTTGACTT CCTTCTTCGT CGTACCAATC ACATGCTCTT TATGCGTGAT AGCTTGGATA 9720 

GTATCGTTGA GCCAATTTTG GATGAAATGG GACGATTCTA TGACTGGACA GAAGAAGAAA 9780 

AAGCAACTTA CCGTGCTGAT GTCGAAGCAG CTCTCGCTAA CAACGATTTA GCAGAATTAA 9840 

AAAATTAAGA AAAAATAAAA GAGGTGGAGG GCAGCATTCC TTGTCGCCCG TCCCTTCTTT 9900 

TTAATGGAGA CAGAAAGATG ATGAATGAAT TATTTGGAGA ATTTCTAGGG ACTTTAATCC 9960 

TGATTCTTCT AGGAAATGGT GTTGTTGCAG GTGTGGTTCT TCCTAAAACC AAGAGCAATA 10020 

GCTCAGGTTG GATTGTGATT ACTATGGGTT GGGGGATTGC AGTTGCGGTT GCAGTCTTTG 10080 

TATCTGGCAA GCTCAGTCCA GCTTATTTAA ACCCAGCTGT GACCATCGGT GTGGCCTTAA 10140 

AAGGTGGTTT GCCTTGGGCT TCCGTTTTGC CTTATATCTT AGCCCAGTTC GCAGGGGCCA 10200 

TGCTGGGTCA GATTTTGGTT TGGTTGCAAT TCAAACCTCA CTATGAGGCA GAAGAAAATG 10260 

CAGGCAATAT CCTGGCAACC TTCAGTACTG GACCAGCCAT CAAGGATACT GTATCAAACT 10320 

TGATTAGCGA AATCCTTGGA ACTTTTGTTT TGGTGTTGAC AATCTTTGCT TTGGGTCTTT 10380 

ACGACTTTCA GGCAGGTATC GGAACCTTTG CAGTGGGAAC TTTGATTGTC GGTATCGGTC 10440 

TATCACTAGG TGGGACAACA GGTTATGCCT TGAACCCAGC TCGTGACCTT GGACCTCGTA 10500 

TCATGCACAG CATCTTGCCA ATTCCAAACA AGGGAGACGG AGACTGGTCT TACGCTTGGA 10560 

TTCCTGTTGT AGGCCCTGTT ATCGGAGCAG CCTTGGCAGT GCTTGTATTC TCACTTTTCT 10620 

AGTTTATACT CTTCGAAAAT CAAATTCAAA CCACGTCAGC GTCGCCTTAC CGTACTCAAG 10680 

TACAGCTTGC GGCTAGCTTC CTAGTTTGCT CTTTGATTTT CATTGAGTAT TAGAAAACAA 10740 

TTATGTTGAT AGAGCTTGGG CAAGAGCCCA ATTTCAGCAA AAAATGAAGT AAATCTTCTC 10800 

ATAATAAAAC GCATCATATC AAGCACGAAA ATTCCACGAG GTCAACTACA GTCAGAAAGC 10860 

TGAACAACAA GCCAAAACGC CCAAAAAAGG CGGCAAAAAG CAAGCACCTG CAAGCAACGT 10920 

GCCGAAATGG TCAAATCCTG ATTATGTCAA CGAATTAGAC CCAAAAATCG TTGATATGCT 10980 

AGTAGAATTT CACAAGTCAC AAGGCACTTT GGAAACTCCC GAGGCGCAAG CAGAAATCGC 11040 

CCAAAAACGT GAAGAAATCG AGCAAAGGAG AGCTGAGCTT GAGGGTAAAA AACAAGAGCT 11100 

TTTGAACCGC TTGAACAAAT AGAGTTTCGC AAGTATTATG CTTACAAATT ACTTGAGCAA 11160 

TTAACTAAAA TATAAACCCT GCCTTTATAT CTAGGCAGGG TTTATATTTT AGAAATTCAC 11220 

GTAGGTTGTT ACGGTTTTTA CATACCCAGT ATAGTTTGAG TTTCTATAGT ATTCAGTGAT 11280 

AAACTTCCAT TTTCTTTGAG CAACATGGAT ATAAGTACTT GTTATGTAGT ATGGATATGG 11340 

GCTTTGTGAA TCCAAGTAAG ACTGATAAGC TTGTATACCA AAATATGCTC CACCAATTAT 11400 
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TGCACCCCAT GGACCCCCCA ATAAAGCACC TATCCTACCA ATCATATAAC TGATTCCAGC 11460 

ACCAGTCATG AAGTTAGCGA ATGTGTTAGC TTGTTTATTC CCATGTATTG TGTTGACGTA 11520 

ATTCCAAACA TTAGGATCGT ATGATCTAAA AGATATATTT AGGTCGATTT CATTCTTTTG 11580 

ATAAGCCATA TAAAATGCCC CATTGATATA GACGCCGTCA GCACGTCGTT CAATAGTGTC 11640 

TACACTTCCA TCTGGATTGA CAACCTCAAG AACTTCATCG CTTAAAATAT TTACTTGCGT 11700 

ATCTCCGAAC CGCACTGATG AGCCATTCTC AAACTGAGCC TCACCAGATA CAACTTTAGA 11760 

GTTTGCCGAT AAGCTATCAT CAGCAAAAAC AAACAAGCGA CGGGGAAATG CTAGACATAC 11820 

AGAAAACAGA CATAACTAGC AAACACATGC ATTTAAACAT CTTAGACATA ACGGAAACTC 11880 

CTTTGTATTT TTGATTTTTT TCAACTTTTA TTATACAATA AAACCAAATA AAAAGAAAGC 11940 

GGTAACAATA TGCTTAATGC GAAAATTTTT TATATATTTT TATGTTTGAT CGTTATCGAA 12000 

ACTACAGGCT TGTTGTTGTT GAAAAGAGGT CTCGAAATGG GTTATTTAGA CACAGAAGCT 12060 

ATTATCCTCG CAGTTTTTTC ATTTGCTTTT TACAACCTAT GTTCATTCGC TTGGGTCTGC 12120 

TCTACAATAA AAAACAATAA AAAATAAATA GACGTATTTT CAAAAAAAAC maAATGCATA 12180 

TTTATATTAG CAAAACGACG ATTTAAATCG TCGTTTTTTT GTAGTACGAC GGGCATGTCG 12240 

TATATCTGAG GTGTAAGTCC TCAGCCTGAC TATCGTGAGG TAGCAGGGAG AGGAAGGGAT 12300 

AGCGAAATCG TGGCTCTACG AACAGGAACG TGATAGTAAG GCGTATATAG CGGATAAGGA 12360 

GGCTTCAAAC TCTAAAGTCC AAAAAGGTAG TCGTAACCTA TATGTGTAAA TCACGAGAGT 12420 

AATTGAATTC GGACTAAGGT TTGTGTGAAA AAGATAAATC TTTCTAGAGT CTAAAGACTC 12480 

TGCGTCAGAT TTCCTATTTT CACTGTAACC TTTTAACGTC CTCATATCTT GTATAAACGA 12540 

GGAAAGATGT ACGACTTATC CCGTGAGGTT TCATGAGCGT GAAAGCGTAG TAACAACGAA 12600 

TCATGAGAAG TCAGCCGAGC CCATAGTAGT GAGGAAACTT CCGTAATGGA AGTGGAGCGA 12660 

AGGGG 12665 
(2) INFORMATION FOR SEQ ID NO: 135: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5305 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 135: 

CGCTAATCAC TACAATCATT TTATTGTACT TTTTCACTCT CAAGAAAAGC AAGAAGTATT 60 
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CATTTTAGTT TCATTTAGTA TTATTTTGCA TACCTAAAAT ACAGTAAAAA ATCAGTCATC 120 

TTGGTATGCT CCTGCTTTCA CTATTCAACA CGTTTTTGAC TTATACTAGG CTCATTTCCA 180 

AAAGCATTAT ATAATAGTGA TATGAAACCA ACTAAACTAA ACAAGAAATA TAAGCAATAA 240 

AAATTCGTTT AAAAGATCTT ACTAAAGCTA ATACTAAATA AAAATAAAAG AGTAAACTAG 300 

GAAGTTTATT TCAAACAACC TAAAATACTG ATTTTCGGCT GAAGATAATA CTGGAGTGCA 360 

AATTAATGGG GTTATAATAA ATAGCTGATA GCTTGTGTTG GTTTTGGATT TTTTAAGAGT 420 

AGATGAGTAT TAAAACTATA AGGAGGACGA AGGTGGCTAA AAATTTAAAA TTAAAATTAG 480 

CTCGGGTAGA GCGTGATTTA ACACAAGGTC AACTGGCAGA GGCTGTCGGG GTGACACGCC 540 

AGACTATTGG TTTAATAGAG GCGGGAAAAT ACAATCCCAG TCTCTCGCTC TGCCAGTCTA 600 

TTTGCAGATG TTTAGGGAAA ACCCTAGACC AACTATTTTG GGAGGAAGAA GATGAAAAAT 660 

AGATTTTATT ATTCTCAATT ACTAGACGAA AGAGAAGAAC AACTGTTCAA TAAAGCGGGC 720 

TCTGAAAGTT TCTATATCTG CATTGCTTTG TCGCTCCTAT CTTATATCAT TTCAGTATTA 780 

GCACCAAGCC TTTTTAATTC TAATATGCTG CTAATCGTTA TCATCATAGG GACATTTTAC 840 

TTTTTCAATC GTGCCCGTTA TCTGGGAGTG ACCTACTATG GTCGTTTTCA TTTTACGATT 900 

TTGGGTTGTT TTTTCCTAAC CTTGGCTATT ACGGCTCTTT TGATGTTGCA GAATTATCAA 960 

TTCAACATAG AAATTTATCA GC ACAATCCT TTGAATTTTA AATACCTGTC TGCTTGGGTC 1020 

ATTACTTATA TCATTTACCT TCCGTGGATC TTTATTGGCA ATCTTGGTCT TAAGAGCTAT 1080 

GGCGAATGGG CTCAGAAAAA ATTTGAACAA GATATGGATG AATTGGAGAG TGGAGAATAG 1140 

CTTGTTACTC TTTTCTCAAT CCAGCTAAAA TGTGATATAA TAGTACTAAT TTATTGGAAT 1200 

ACATGAAAGT TCTTGAAAAT TTTCATGGGT TTCTAGCTAA GGAAGTAGGA AAAGTATGTA 1260 

TCCAGATGAT AGTTTGACAT TGCACACGGA CTTGTACCAG ATCAACATGA TGCAGGTTTA 1320 

CTTTGACCAA GGGATTCACA ATAAGAAGGC GGTCTTTGAG GTGTATTTCC GCCAACAGCC 1380 

TTTTAAGAAC GGCTATGCGG TTTTTGCAGG TTTAGAAAGA ATTGTGAACT ATCTTGAAGA 1440 

CTTGCGTTTT TCAGATAGTG ATATAGCCTA TTTGGAGTCG CTTGGTTATC ATGGGGCGTT 1500 

CTTGGATTAC CTTCGCAATT TCAAGTTGGA GTTGACCGTT CGTTCTGCCC AAGAAGGGGA 1560 

TTTGGTTTTT GCTAATGAAC CGATTGTGCA GGTGGAAGGA CCTCTAGCCC AATGTCAGTT 1620 

GGTCGAAACG GCTCTTTTGA ACATCGTCAA CTACCAGACT TTGGTGGCGA CGAAGGCAGC 1680 

TCGTATTCGT TCGGTTATCG AAGATGAACC CTTGATGGAG TTTGGGACAC GTCGGGCTCA 1740 

AGAAATGGAT GCGGCCATCT GGGGAACACG CGCAGCTGTG ATTGGTGGCG CCAATGGAAC 1800 

CAGCAACGTG CGTGCGGGTA AGCTCTTTGA CATTCCTGTT TTGGGAACCC ATGCCCATGC 1860 
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CTTGGTACAG 


GTTTATGGCA 


ATGACTATGA 


AGCTTTCAAG 


GCTTACGCTG 


CGACCCACAA 


1920 


AAATTGTGTC 


TTTCTTGTGG 


ATACCTATGA 


CACCCTTCGC 


ATCGGTGTAC 


CAGCTGCCAT 


1980 


TCAGGTGGCG 


CGTGAGCTGG 


GTGATCAGAT 


TAACTTTATG 


GGTGTGCGGA 


TTGACTCTGG 


2040 


GGATATTGCC 


TACATTTCTA 


AGAAAGTCCG 


TCAGCAACTG 


GATGAGGCTG 


GATTTACAGA 


2100 


GGCTAAGATT 


TATGCTTCTA 


ATGATCTAGA 


TGAAAATACC 


ATCCTTAACC 


TCAAGATGCA 


2160 


AAAGGCCAAG 


ATTGATGTCT 


GGGGTGTGGG 


TACCAAGCTG 


ATTACAGCCT 


ATGACCAGCC 


2220 


GGCTCTTGGG 


GCGGTTTACA 


AGATTGTTGC 


AATCGAAGAT 


GAAACTGGTC 


AGATGCGCAA 


2280 


TACGATTAAG 


CTGTCTAATA 


ATGCTGAAAA 


AGTTTCTACG 


CCAGGTAAGA 


AGCAGGTGTG 


2340 


GCGCATTACC AGTCGTGAAA AAGGCAAGTC AGAAGGCGAC TATATCACTT 


ATGATGGTGT 


2400 


GGATATTAGC 


GACATGACAG 


AAATCAAGAT 


GTTCCATCCG 


ACCTATACAT 


ACATCAAGAA 


2460 


GACGGTTCGT 


AATTTTGATG 


CCGTTCCTCT 


CTTGGTGGAT 


ATCTTCAAAG 


AAGGAATATT 


2520 


AGTTTACAAC 


TTGCCTAGTT 


TGACTGACAT 


TCAGGATTAT 


GCCCGTAAGG 


AATTTGACAA 


2580 


GTTGTGGGAT 


GAGTATAAGC 


GTGTGCTCAA 


TCCGCAGCAC 


TATCCAGTGG 


ATTTGGCGCG 


2640 


TGATGTATGG 


CAAGATAAGA 


TGGACTTGAT 


TGATAAGATG 


CGC AAGGAAG 


CCCTTGGTGA 


2700 


AGGAGAAGAA 


GAATGAGTTT 


GCAAGAAACG 


ATTATCCAAG 


AGCTGGGTGT 


CAAACCAGTG 


2760 


ATTGATGCCC 


AGGAAGAAAT 


CCGTCGTTCT 


ATTGATTTCT 


TAAAAAGATA 


TCTGAAAAAA 


2820 


CATCCCTTCC 


TAAAAACCTT 


TGTACTAGGG 


ATTTCTGGGG 


GACAAGACTC 


AACCTTGGCA 


2880 


GGACGTTTGG 


CGCAATTAGC 


TATGGAAGAA 


CTGCGAGCTG 


AAACGGGAGA 


CGATAGCTAC 


2940 


AAATTTATCG 


CTGTCCGCCT 


GCCATACGGA 


GTGCAAGCTG 


ATGAAGCAGA 


TGCTCAAAAA 


3000 


GCCCTAGCCT 


TCATCCAGCC 


AGATGTCAGC 


TTGGTTGTGA 


ATATCAAGGA 


ATCAGCTGAT 


3060 


GCCATGACAG 


CTGCAGTTGA 


AGCGACAGGT 


AGTCCTGTTT 


CAGACTTCAA 


CAAGGGGAAT 


3120 


ATCAAGGCAC 


GTTGCCGTAT 


GATTGCTCAG 


TATGCCCTTG 


CTGGTTCCCA 


TAGCGGAGCG 


3180 


GTCATTGGAA 


CAGACCACGC 


CGCGGAAAAT 


ATCACAGGTT 


TCTTTACCAA 


GTTTGGTGAC 


3240 


GGCGGTGCGG 


ATATTCTCCC 


TCTTTACCGC 


CTCAATAAAC 


GCCAAGGAAA 


ACAGCTCTTG 


3300 


CAGAAACTTG 


GCGCAGAGCC 


AGCCCTTTAT 


GAAAAAATCC 


CAACGGCAGA 


CCTAGAAGAA 


3360 


GATAAACCAG 


GCCTAGCTGA 


CGAAGTCGCA 


CTTGGAGTCA 


CCTACGCAGA 


GATTGACGAC 


3420 


TACCTAGAAG 


GCAAAACAAT 


CAGCCCAGAA 


GCTCAAGCGA 


CCATTGAAAA 


CTGGTGGCAC 


3480 


AAAGGCCAAC 


ACAAACGCCA 


CTTACCCATC 


ACCGTATTTG 


ATGACTTTTG 


GGAGTAAAAA 


3540 


GGTCCGGGGG 


ACCTTTTTAG 


CTTCTTGCCC 


TGAAATTAAA 


AAGCAAGAAA 


AACCTCCACT 


3600 
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GGAGGTTTTC AGCCTCTCAT CTTGAAATAA GAAAGTGAGA GAAGGTCTGG GGGATCTTGA 3660 

ACCCCGAGTT TAGAAATAAG AAAATGAGGC AGATTCAGTA ACTCGAAGAG TTCGATTTCA 3720 

TCGTCTTACC CCTGCAACGA TGACTAGGTT TGAAAAAGCT TGCTAGAGCG CATTTCAAAC 3780 

CAGGCAGCAA CTGCGTCAAG AAATTAGAAG ACAAACTCGT TTTCTAGCTG TTACTGAGTT 3840 

GAGCCTTTTT ACTACGAGTA TAGAAATAAG GAAGTGAGGT AGCATCATGA AATCTATCGG 3900 

TACGCAAATA TTACAGACAG AACGTTTGAT TTTAAGAAGA TTTGTGGAGA GTGATGCAGA 3960 

AGCCATGTTT CAAAATTGGG CTTCATCCGC TGAGAATCTG ACCTATGTTA CCTGGGATCC 4020 

CCATCCTGAT GTCGAAATCA CTCGAAACTC GATTTGCAAT TGGGTTGCTT CCTATACTAA 4080 

TCTCAACTAT TATAAATGGG CCATTTGTCT AAAAGAAAAC CCAGAGCAAG TAATAGGAGA 4140 

TATCAGCATT GTTAAGATAG ACGAGGCTGA TTTAAGCTGT GAAATTGGCT ATGTGTTAGG 4200 

CAAGGCTTAC TGGGGAAATG GTATGATGAC AGAGACTTTG AAAGCTATCT TGGACTTTTG 4260 

TTTTACTCAA GCAGGTTTTC AAAAGGTCAG AGCACGTTAT GCCAGTCTCA ACCCAGCTTC 4320 

AGGTCGTGTC ATGGAAAAGG CTGGAATGTC CTATCTACAA ACCATTGTTA ATGGTGTAGA 4380 

GAGAAAAGGC TATCTTGCGG ATCTTATTTA TTATGGTATA AGTAGGGAAG AATGTTGAAT 4440 

TCTATTTTCT GTTTCTATCG AAGTCAACTA TTTATTGTAA ATATAATAAT TAGCATTCCA 4500 

AGTTTATTTG AAACTTTAAA ATAGCATATT GATTAGTACA AGACAGATGT TCTAGTTCCT 4560 

TCTTTAATCT GGTTTAGTGT TAGTTAAAAA ATCGCTTTAA GCTTGTAACT AAGAGGGAGC 4620 

TAATCGACTA GATTCTCCAG CCGAACAGG1 GGTAATGTAC TTTTTATAGT GTAATCCTAG 4680 

CTGTTGTTAA ATTTAAAATA GAATCCTCTA TCGAGTTAGG GAATTAAATT CAACCAATTT 4740 

TATTCATGTT TTTTCTATCA AATTATCTAA TATTAAAATA GTCTCATTCT GATGAGAAAA 4800 

CTATTCCCAA ATCATTCATA CCTCTCTCAA CTAGATGTAA CTTACAAAAC CCCTGACCTC 4860 

ATGAGCCACT TTCTTCCTCC TCATGAGGTC AGTTTTACTT TCTGCTGTTC CAGTATCGTT 4920 

TTTCCTCGCT AGATTTCCTC AAAAGGGCAG ACTCCTCCCT TGGTGCGTCA CACGATTTTT 4980 

TCATCTCGAC TGTTCTTTAA TG CATC ATT A ACGACGCTTT TCTTCTAGGT GGTTCATAAG 5040 

GAACAGGAAG ATTCAGGTTG ACTTTTCTAA TCCTAGAATA AAGTGCTGAA AACAATTCGG 5100 

AATAGGCATA GAGACTAGAC AATTTGAGGA GCTGCTTGCG TCCTGTTCGA ACACATTTTC 5160 

CCACCACGTG AAGAAAAAGA TGGCGGAAGC GTTTGATTGT TAAAGTTTGG AAGTCACCTC 5220 

CAGCTAGATG TTTGAGAAAA AGATAGAGAT TGTAGGCGAT ACAGCTCATC ATCATACGAA 5280 

CTTCGTTTTT GATTAAGGTT GAACT 5305 
(2) INFORMATION FOR SEQ ID NO: 136: 
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(i) SEQUENCE CHARACTERISTICS; 

(A) LENGTH: 3964 base pairs 
(5) TYPE; nucleic acid 
(C> STRANDEDNESS : double 
(D> TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 136: 

-TGGCAGCTCG TCGTCGTAAA GGACGCAAAG TTTTGGCTGC ATAATCCAAA CGAATTCTAT 60 

CAAAAATCAG TAGGAACTCG AGTCTACTGA TTTTTATTTT TGTAAAAAAG TTCAGTAGAT 120 

GCAAATGGAT TCGGAAGCGA TGTTACAGTA GATTGAAACT AGAATAGTAC ACCTCTGTTT 180 

CTAAAACATT GTTAGAAATC GATTTGACTG TCCTGATCGA TTTGTCCTGT TATTATTTTA 240 

TTTTACTATA AAGTTGAAGT AGGTGGAGAT GGTACAGCAA CAATCGTCTT TAAAGATGGT 300 

TCAGCTATTA CAATTCCAGG AAATCAATTG GTAGCACAAG ATCCAAAAGC ACAAGATAGC 360 

ACTAAACTGA CTGCTGAAAA ATCAACTGTT AAAGCACCTG CTCAAAGAGT AGATGTAAAA 420 

GATATAACTC ATTTAACAGA TGAAGAAAAA GTTAAGGTTG CTATTTTACA AGCAAATGGT 480 

TCAGCATTAG ACGGAGCGAC AATCAATGTA GCTGGAGATG GTACAGCAAC AATCACATTC 540 

CCAGATGGTT CAGTAGTGAC GATTCTAGGA AAAGATACAG TTCAACAATC TGCGAAAGGT 600 

GAATCTGTAA CTCAAGAAGC TACACCAGAG TATAAGCTAG AAAATACACC AGGTGGAGAT 660 

AAGGGAGGCA ATACTGGAAG CTCAGATGCT AATGCGAATG AAGGCGGTGG TAGCCAGGCG 720 

GGTGGATCAG CTCACACAGG TTCACAAAAC TCAGCTCAAT CACAAGCTTC TAAGCAATTA 780 

GCTACTGAAA AAGAATCAGC TAAAAATGCC ATTGAAAAAG CAGCCAAGGA CAAGCAGGAT 840 

GAAATCAAAG GCGCACCGCT TTCTGATAAA GAAAAAGCAG AACTTTTAGC AAGAGTGGAA 900 

GCAGAAAAAC AAGCAGCTCT CAAAGAGATT GAAAATGCGA AAACTATGGA AGATGTGAAG 960 

GAAGCAGAAA CGATTGGAGT GCAAGCCATT GCCATGGTTA CAGTTCCTAA GAGACCAGTG 1020 

GCTCCTAATG CTGCTCCTAA GACAACAAGT GCACCGCAAG CAACTGCAGG AACAATGCAA 1080 

GATGTTACCT ACCAGTCACC TGCTGGCAAA CAATTACCTA ACACAGGTTC AGCATCAAGT 1140 

GCAGCACTTG CTAGTCTTGG TCTAGTGGTG GCAACAAGTG GTTTTGCTTT GCTAGGAAGA 1200 

AAGACTAGAC GTAGAAAATA GAACAGCTAG AAAATTCTAT TCTCTACTTA AAGTTAGATT 1260 

ATAAGGGGGA TTTTGAGAAG TCATCAATCC TAGTGATGGG TGAGAAAAGT GAGAACCCAA 1320 

GATAATCACA TACTTTAGCT GAATAGGAAT ATTCTATCAA TGTAGCCAAT CTCTTCTGTC 1380 

TCTAACTGTG GAATAGGAGA TGGGCAATAT CGGATAGAAA AGATAGCAGA ATAGCTCTCT 1440 
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ATTGAAGAGA 


GGAGGGGAAA 


CCGAAAAATT 


AGGTGCCCCT 


CCTCTTTTTT 


GGTATAATAG 


1500 


AAGATAGAAA 


ACGAGGTTAG 


AAGAGATGAT 


TTTTGATACA 


CATACACACT 


TGAATGTAGA 


1560 


AGAATTTGCA 


GGTCGTGAGG 


CAGAAGAAAT 


TGCCTTGGCT 


GCTGAGATGG 


GTGTGACACA 


1620 


GATGAATATT 


GTTGGTTTTG 


ATAAACCGAC 


GATTGAGCAT 


GCCTTGGAGT 


TGGTAGATGA 


1680 


GTATGAGCAG 


CTCTATGCGA 


CTATTGGTTG 


GCATCCTACA 


GAAGCTGGTA 


CTTATACAGA 


1740 


GGAAGTTGAG 


GCTTACTTGT 


TGGATAAGTT 


AAAACATTCC 


AAGGTTGTGG 


CTTTAGGTGA 


1800 


AATTGGCTTA GATTACCATT GGATGACAGC 


GCCCAAAGAG 


GTGCAGGAGC 


AGGTTTTTCG 


1860 


CCGTCAGATT 


CAGCTATCTA 


AGGACTTGGA 


TTTGCCTTTT 


GTTGTCCATA 


CCCGTGATGC 


1920 


GCTGGAAGAT 


ACCTATGAGA 


TTATCAAGAG 


TGAGGGCGTT 


GGTCCTCGTG 


GTGGTATCAT 


1980 


GCATTCATTT 


TCAGGGACGC 


TTGAGTGGGC 


AGAGAAGTTT 


GTGGATCTTG 


GTATGACCAT 


2040 


TTCCTTCTCA 


GGAGTGGTGA 


CTTTTAAGAA 


GGCAACTGAC 


CTCCAAGAAG 


CAGCTAAAGA 


2100 


GTTACCTTTG 


GACAAGATGT 


TGGTGGAAAC 


AGATGCGCCT 


TACTTAGCAC 


CTGTACCCAA 


2160 


GCGTGGTCGT 


GAAAATAAAA 


CAGCCTATAC 


TCGCTATGTG 


GTCGACTTTA 


TCGCTGACTT 


2220 


GCGTGGTATG 


ACGACAGAAG 


AGCTGGCGGT 


AGCAACGACT 


GCAAATGCAG 


AACGAATTTT 


2280 


TGGACTGGAC 


AGCAAGTAAT 


GAAAGAGAAA 


ATTTCTCAAG 


TTATCGTGGT 


TGAAGGGCGT 


2340 


GATGATACGG 


TCAATCTCAA 


ACGTTATTTC 


GATGTGGAGA 


CCTATGAGAC 


TCGAGGTTCT 


2400 


GCCATCAATG 


CTCAGGATAT 


AGAGCGGATT 


CAGCGCCTGC 


ACCAACGTCA 


TGGAGTCATT 


2460 


GTCTTTACAG 


ACCCAGATTT 


TAATGGGGAA 


CGGATTCGGC 


GCATGATCAT 


GATGGTCATT 


2520 


CCAACAGTTC 


AGCATGCCTT 


TCTCAAGCGA 


GATGAAGCTG 


TTCCCAAGTC 


CAAGACCAAG 


2580 


GGGCGTTCTC 


TGGGAATTGA GCATGCCAGC 


TATGAAGACC 


TGAAAACGGC 


TCTAGCTCAA 


2640 


GTGACAGAAC 


AATTTGAACA 


TGAGAGTCAG 


TTTGACATTA 


GTCGTAGCGA 


TTTGATTCGC 


2700 


CTTGGTTTTC 


TAGCAGGGGC 


AGACAGCCGT 


AAGCGTAGAG 


AATATCTCGG 


AGAGACTCTC 


2760 


CGAATCGGCT 


ATTCCAACGG 


CAAGCAACTC 


CTCAAACGCC 


TAGAGTTGTT 


TGGGGTTACT 


2820 


TTGGCAGAAG 


TGGAAGAAGC 


TATGAAATCT 


TATGAGTAGG 


AAAGATGTAG 


CCGTTACAAT 


2880 


TTTTTAAGTT 


TCACAGTATT 


TTTCGAAGCA 


GGTAGAAGAG 


GAGGCGTCTG 


ATGTTAATTG 


2940 


GTCAAAAAAT 


TAAAGAGATT 


CGGATAGAAA 


AAGGAATTAG 


TCGTCCAGAT 


TTTTGTGGAG 


3000 


ATGAGCAAGA 


ACTGACAGTT 


CGTCAACTGT 


CGCGAATTGA 


AAGTGGAGCT 


TCGCAACCGA 


3060 


GTTTGCCCAA 


GTTAGACTAT 


ATTGCTCGCC 


GGCTAGGAGT 


TCCAGTTTAT 


AGCCTTATGC 


3120 


CGGATTTTTC 


AGCTCTTCCT 


TCTGCTTATT 


TAGAATTGAA 


ATACCAGATT 


TTACGTGAAC 


3180 


CAATCTATGG 


TAAAGAAGAG 


GAGTACGATA 


AGAAGGAAGC 


GTGTTTGGAA 


GAGATTTATA 


3240 
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AAACATACTT TGATAATCTT CCTAAAGAAG AACAATTAGC ATGTGAAGTA TTGCAGGCGT 3300 

GTTTGGATAC TTCTAGAACT AGAAGGCCTG AATATGCAGA GTTAATACTT GAGGAACATA 3360 

TGCCTCAGAT TATAGAAAAA GAAGCTTATT CAATAAATGA TATGTTGTTG ATTCGTTTGT 3420 

TTTTTTATCA AATGCTCATT AGAAAAGATC TTGCCAAATT TATAAATCAA ATCGAAAAGC 3480 

TAATGCTCTT TCTTTTGGAA CAGAAGAAGG TAACTCAAAT AGAGAATTAC TTTATAATTA 3540 

-GAGATACTCT TATTTCAGGA ATGTGTTGTC TTGAAAAGGT AGGAGTAACT GATTGTTTTA 3600 

ATGATTATCT ATCGTGTTTA CAAGAAATTA TGGATAAAAC TCAAGATTAT CAAAAGAAAC 3660 

CTCTTGTATT TATGTTTTTG TGGAAGCAAG CATTAAGAGA AGAAAGAGAT TTTAGTTTAG 3720 

CTGAATCATT TTATCAGTCT TCTAAAACAT TTGCGCAGCT AATTGGAGAT GAATTTCTAG 3780 

TAAAGAAATT GACAGAGGAA TGGCAAGAGG ATGTCAAAAA ATATTTATAA ACATAGTGAA 3840 

TCAGTGACAA AGATGTCCTT GTCCTCGTAT CAAAACAGTT CTAAAGTTCG TCTTTAGGGA 3900 

TGTTTTTTTA GATATAAGCT AAAAATGACA CGAAATGGTT AGATTTTAAG GACATTGATG 3960 

TCCG 3964 
(2) INFORMATION FOR SEQ ID NO: 137: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12666 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 137: 

TGAGACCGTT ATTTGTATTA GGGAAATGGG TATCTATTTT TAATGCTGTG GGGATTTTGA 60 

TTGTTTCTAT TATTCAAACC AAAAGCTTGT CAGGTATTGG AGCAGGATTG TTTAATCTAT 120 

ATAACATTTC ATCTTATATA GGTGATTTAG TTAGTTTCAC TCGATTGATG GCATTAGGAT 180 

TATCTGGAGC AAGTATAGCA TCAGCTTTCA ATTTAATTGT TGGTTTGTTT CCGGGAATAT 24 0 

TGGCTAAACT GACAATTGGA TTAGTATTAT TCATTCTTTT ACATGCGATC AATATTTTTC 300 

TATCGTTACT ATCAGGATAT GTTCATGGAG CACGTCTGAT ATTTGTTGAA TTTTTTGGTA 360 

AGTTTTATGA GGGTGGAGGA AAACCATTTC AACCTTTGAA GGCTTCTGAG AAATATATTA 420 

AGGTTATTAC AAAGAATTAA TGGAGGATAT ATATAATGGA ACATTTAGCA ACTTATTTTT 480 

CAACCTATGG AGGAGCTTTC TTCGCTGCAT TGGGAATTGT ATTGGCGGTT GGATTAAGCG 540 

GTATGGGGTC TGCTTATGGA GTTGGTAAGG CTGGGCAATC TGCCGCAGCT TTACTGAAAG 600 
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AACAGCCTGA AAAGTTTGCC TCAGCTTTGA TATTGCAATT ATTGCCCGGA ACACAAGGAT 660 

TATATGGTTT TGTTATTGGA ATTTTAATTT GGTTGCAATT AACTCCAGAA CTTCCTTTAG 720 

AAAAAGGCGT TGCTTATTTC TTTGTAGCTC TTCCAATTGC TATTGTAGGA TACTTTTCAG 780 

CTAAGCATCA AGGAAATGTA GCAGTAGCGG GAATGCAAAT CTTGGCTAAA AGACCAAAAG 840 

AATTCATGAA GGGAGCAATT TTAGCTGCCA TGGTAGAAAC CTATGCAATT CTTGCTTTTG 900 

TCGTATCATT CATTTTGACC CTTCGTGTAT AAGAAATAAA TTTGCAATTC AAAGGAGGTG 960 

TCTAAATGAG CAATTTAGAA AACTTACGAG AGTCTGTTAT TGAACAAGCT CATGAAAAAG 1020 

GGCGTATGAA ATTATTGGAT TCCAAAAAGA AGATTGATGA TGAATTTGAA ATGCAAAAGT 1080 

CGCTCATTAT AAAGAAAAAA GAAGCTGAAC ATGAACGAAA GTTAAAAGAA TTGCAACAGA 1140 

AATATCAAAT AATTTTTCAA CAATTAAAAA ATAAGGAACG CCAATCAACG TTAGTATCAA 1200 

AACAGAAAAT ATTAAAAGAA CTTTTTCAAT CTGCTTTACT AGAAATGGAA TCTTGGAGTG 1260 

CAGATAAAGA AATGGAGTTC ATCTATCGAA TTCTGGAACG ATATTCACAA CAAGAGGTCA 1320 

TAGTAACCTT TGGGGAACGG ACTTTAGCTA AATTCAATTT GGAACAATTA GAGAAATTGA 1380 

AATTCTCTTT TCCAAATTAT TTATTTAGTG AACAACCTAT CTCAAATGAA TCAGGCTTAC 1440 

TTATTTCAAT AGGTAAAATT GATGATAACT ATTTGTATAA AACATTAATT GGATCGATTT 1500 

CTAAGGAAGA AAGTTCAAGT ATCGCAAATC AAATTTTTAT CAATTAAGGA TGAAATTGGT 1560 

TAATCCTTCT TAGAAATTTG GAGTATTCCA ATAAAATTAG AAAGGTATTT TATGGATACT 1620 

AATCTTTTTT CAAAAATAAA TACGACGATT TCGGTAAAAG AAAACGATTT TATTACAGAA 1680 

GAAAAATTTC AAAAAATTAT ACAATCCAAA GATACGGAGA CATTGGCATT TATCTTAGAA 1740 

TCAACTCCCT ATCATTTATC GATTGACATC TTAGAAGATC CTAGTCAGAC AGAGATTTCG 1800 

CTAATGACAA AATTAGTCAA TGATTATAGA TGGGCCTATG CTGAAAGTCC GTCTGATATA 1860 

ATTGTGACTT TATTTGCTTT ACGATATGTT TATCATAATA TCAAAGTTTT ATTAAAATCT 1920 

AAGGCGG CAA TTAAGAAAGA TTTTTCTAAA TTATTAATTC CAATAGGGAT TTTTGATATA 1980 

GAAAGTTTAA AACATTTAGT TTCTTCCTTA CATTCAGATA CACTTCCTGA TTTTATGGTT 2040 

CGTGAAGTAG AATCAATTTG GAATGAGTAT GAAACTTTTA ATAATATTCG TGTACTTGAT 2100 

GTCGGAGCTG ATCTAGCATA TTTTAAACAT CTGAAACTTT TATCTAATGA GTTAGATGAG 2160 

GTACTGTCTC AGGTTATTGT CGAAATGATT GACTTTTATA ATATTATTAC TGTAAAACGT 2220 

GGTTTATCTC AAAATAAGAG TCATGGGGAT ATTTTACAAT TACTTTCAGA TGAAGGAAGT 2280 

ATTTCTGCTA AAGAATTTAT ATACATTGTA GAAAATCAAG AAATATTTGT GTGGTTCAAT 2340 

AAAATAAATC CAAGCTTAGA TTCAATCTTT TCAACTTATG AATTGAAGAT GCAGGACGCA 2400 
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ACAATTTCAT 


CTTCTGAGTT 


AGAATTTTTA 


TGTGATTTAC 


TATTGTATAA 


AACTTTAGAT 


2460 


CAAGGAAGGT 


ACAATGTAGA 


GGGGCCGTTA 


GTTCTTGCTA 


GATATTTATT 


GGGATGTGAG 


2520 


TTTGAAGTAA 


AGAATCTCAG 


AATGATCATA 


TCAGCTCTTC 


AAAATACAAT 


TCCCTTTGAA 


2580 


TCAATAAAAG 


AAAGGATACG 


CCCACATTAT 


GGAAGCTAAT 


AAGTATAAAA 


TTGGCATAAT 


2640 


TGGTAGCCGT 


GATATTATTT 


TACCATTTAG 


CATGATTGGG 


TTTGATATAT 


TTCCTGCCTA 


2700 


CCAAGAACAA 


GAAGCTATAA 


ATACACTAAG 


AAAATTAGCT 


CAATCTGATT 


ATGGTGTCAT 


2760 


TTATATCACT 


GAAGACATTG 


CTTCAATGAT 


ATTAGATACA 


ATTCGCCATT 


ATGATTCCCA 


2820 


AGTTGTGCCT 


GCTATTATTT 


TATTACCGAC 


TCATAAACAA 


GGTTTAAATT 


TAGGATTAAA 


2880 


ACGTATAGAG 


GATAATGTAG 


AGAAAGCAGT 


AGGACACAAT 


ATTTTATAAT 


AATGTACAAA 


2940 


ATTGTCTGTA ATATTATTCT ATAATTTTTG GACTTAGTAA GGAGAATAAC 


TTTGACTCAA 


3000 


GGGAAGATTA 


TAAAAGTATC 


GGGACCTCTA 


GTTATTGCAT 


CAGGTATGCA 


GGAGGCTAAT 


3060 


ATTCAAGATA 


TTTGCCGTGT 


AGGTAAGCTA 


GGGTTAATCG 


GTGAAATTAT 


TGAAATGAGA 


3120 


AGAGATCAGG 


CATCTATCCA 


AGTCTATGAA 


GAAACATCTG 


GTCTTGGTCC 


GGGAGAACCT 


3180 


GTTGTTACAA 


CTGGAGAACC 


TCTCTCGGTT 


GAATTAGGGC 


CAGGATTGAT 


TTCTCAAATG 


3240 


TTTGATGGCA 


TACAACGCCC 


ATTAGATCGA 


TTTAAATTGG 


CTACTCATAA 


TGATTTTCTA 


3300 


GTTCGTGGGG 


TAGAAGTTCC 


AAGTTTGGAT 


AGAGATATTA 


AGTGGCATTT 


TGATTCCACT 


3360 


ATAGCAATTG 


GTCAAAAAGT 


GAGTACGGGT 


GATATTCTTG 


GAACTGTCAA 


GGAAACCGAG 


3420 


GTAGTTAATC 


ATAAAATTAT 


GGTTCCTTAT 


GGAGTATCTG 


GAGAAGTCGT 


TTCTATTGCA 


3480 


TCTGGCGATT 


TTACAATTGA 


TGAAGTTGTA 


TATGAAATAA 


AAAAATTGGA 


CGGTAGTTTC 


3540 


TATAAAGGAA 


CGCTTATGCA 


AAAATGGCCT 


GTCCGCAAGG 


CGCGTCCTGT 


TTCTAAACGT 


3600 


TTAATTCCAG 


AAGAACCATT 


AATCACAGGT 


CAACGAGTTA 


TTGATGCATT 


CTTTCCAGTA 


3660 


ACCAAAGGGG 


GAGCTGCAGC 


AGTTCCTGGA 


CCGTTTGGAG 


CAGGAAAGAC 


AGTTGTACA* 


3720 


CACCAAGTAG 


CTAAATTTGC 


CAATGTTGAT 


ATTGTTATTT 


ATGTCGGTTG 


TGGAGAACGT 


3780 


GGAAATGAAA 


TGACGGATGT 


ACTGAATGAG 


TTTCCTGAGT 


TGATTGACCC 


TAATACCGGA 


3840 


CAATCAATTA 


TGCAACGGAC 


AGTTCTGATT 


GCTAATACTT 


CAAATATGCC 


TGTTGCTGCT 


3900 


CGTGAGGCTT 


CAATTTATAC 


AGGAATTACC 


ATGGCTGAGT 


ATTTTCGTGA 


TATGGGCTAC 


3960 


TCTGTCGCCA 


TTATGGCTGA 


TTCAACTTCA 


CGTTGGGCAG 


AAGCGCTACG 


TGAAATGTCA 


4020 


GGACGTCTAG 


AAGAAATGCC 


TGGTGATGAG 


GGTTATCCTG 


CTTATCTGGG 


AAGTCGTATC 


4080 


GCTGAATATT 


ATGAAAGAGC 


AGGACGTTCT 


CAGGTTCTAG 


GGCTTCCAGA 


ACGTGAAGGA 


4140 
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ACGATTACTG CTATTGGAGC TGTATCGCCA CCTGGTGGAG ATATTTCAGA ACCAGTTACT 4200 

CAAAACACTT TACGGATTGT GAAAGTTTTT TGGGGGCTTG ATGCTCCGTT GGCACAGCGA 4260 

CGTCATTTTC CTGCAATTAA CTGGCTTACA TCTTATTCAC TATATAAAGA CAGTGTGGGC 4320 

ACTTATATAG ATGGTAAAGA GAAGACAGAT TGGAATAGTA AAATAACTCG TGCGATGAAC 4380 

TACTTACAAC GGGAATCTAG TTTAGAGGAA ATTGTTCGTC TTGTTGGAAT TGATTCTCTG 4440 

TCTGATAATG AACGACTAAC GATGGAAATT GCTAAACAAA TTCGAGAAGA TTATTTGCAA 4500 

CAGAACGCTT TTGATTCGGT AGATACATTC ACTTCGTTTG CAAAACAAGA AGCAATGCTA 4560 

AGTAATATTC TCACTTTTGC TGATCAGGCA AATCATGCTT TAGAGTTGGG TTCTTACTTT 4620 

ACAGAGATTA TGGAAGGTAC CGTGGCAGTT CGAGACCGTA TGGCGAGAAG TAAATATGTT 4680 

TCAGAAGATA GATTAGATGA AATCAAAATT ATATCAAATG AGATTACACA TCAAATTCAT 4740 

TTGATATTAG AAACAGGAGG TCTATAAATG AGTGTTATAA AAGAATACAG AACTGCTAGT 4 800 

GAAGTTGTTG GGCCTCTTAT GATTGTTGAA CAAGTAAATA ATGTGTCTTA CAATGAGTTA 4 860 

GTTGAAATTC AACTTCATAA TGGAGAAATT CGTCGTGGAC AAGTTTTAGA GATCCACGAA 4 920 

GATAAAGCAA TGGTTCAGCT TTTTGAAGGA TCTAGTGGAA TAAATTTAGA AAAGTCTAAA 4980 

ATTCGTTTTG CTGGTCATGC ATTAGAATTG GCTGTATCTG AGGATATGGT TGGTCGTATT 5040 

TTTAATGGGA TGGGAAAACC AATTGATGGT GGACCAGATT TAATTCCAGA GAAATATTTA 5100 

GATATTGATG GTCAAGCTAT TAATCCTGTA TCTAGAGATT ATCCAGATGA ATTTATTCAG 5160 

ACAGGGATCT CCTCTATTGA TCATTTGAAT ACTCTTGTAC GTGGTCAAAA ATTACCAGTA 5220 

TTTTCAGGTT CGGGCTTACC TCATAATGAA TTAGCTGCTC AGATAGCAAG ACAAGCGACT 5280 

GTTTTAAATT CTGATGAAAA TTTTGCGGTT GTATTTGCAG CAATGGGTAT TACTTTTGAA 5340 

GAAGCTGAGT TTTTTATGGA AGAACTCAGA AAAACAGGAG CGATCGATCG TTCGGTTTTA 5400 

TTTATGAACT TGGCAAATGA TCCTGCAATT GAGCGTATTG CAACTCCCCG CATTGCTTTA 5460 

ACTGCGGCAG AGTATCTAGC TTTTGAAAAA GATATGCACG TTCTAGTTAT CATGACGGAT 5520 

ATGACTAACT ATTGTGAAGC GTTACGTGAA GTCTCGGCAG CTCGCCGTGA AGTTCCAGGG 5580 

AGACGAGGCT ATCCGGGATA TTTATATACA AATTTATCAA CTCTATACGA AAGGGCTGGT 5640 

CGCTTAGTTG GTAAAAAAGG TTCGGTGACA CAGATTCCTA TTTTAACAAT GCCAGAAGAT 5700 

GACATAACAC ATCCAATTCC TGATTTAACT GGATACATTA CTGAAGGGCA AATTATTTTG 5760 

TCGCATGAGT TGTATAATCA AGGTTATCGT CCACCAATCA ATGTTTTACC TTCTCTCTCT 5820 

CGATTAAAAG ATAAGGGATC TGGAGAAGGT AAAACTCGTG GAGATCATGC TCCAACTATG 5880 

AATCAACTGT TTGCAGCCTA TGCCCAAGGG AAAAAGGTTG AAGAGTTAGC AGTAGTATTA 5940 
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GGAGAATCGG 


CTTTATCTGA 


TGTAGATAAA 


TTGTATGTGA 


GGTTTACAAA 


GCGTTTTGAA 


6000 


GAAGAGTACA 


TAAACCAAGG 


ATTTTATAAA 


AATCGAAATA 


TAGAAGATAC 


GTTGAATCTT 


6060 


GGGTGGGAAT 


TACTATCAAT 


TCTTCCTAGA 


ACAGAGTTAA 


AACGTATCAA 


AGATGATTTG 


6120 


CTTGATAAAT 


ACTTACCTTT 


GGTAGAAGTT 


TAATCCGGAA 


ATGGAGTGAT 


TATCTATGGT 


6180 


ACGTTTGAAT 


GTAAAACCAA 


CTCGTATGGA 


ATTGAATAAC 


TTAAAGGAAC 


GTTTGACAAC 


6240 


AGCTGAACGT 


GGACATAAGT 


TATTAAAGGA 


TAAAAGAGAT 


GAATTGATGA 


GGCGATTTAT 


6300 


TTCTTTGATT 


CGTGAGAATA 


ATCAACTTCG 


GAAAGAAGTG 


GAAAGTTATC 


TAATTGATAA 


6360 


TCTAAAATCC 


TTTGCAGTTG 


CTAAATCATT 


AAAGAATTCT 


CAAATGGTGG 


AGGAATTATT 


6420 


TTCAATTCCA 


TCGAAAGAAA 


TTGAATTATT 


TGTTGAGAAA 


GAAAATATCA 


TGAGTGTAAC 


6480 


AGTTCCTAGA 


ATGCATATGA 


ATATTACTTC 


TCAAAATGAG 


AACAGTGAAT 


ACAGCTATTT 


6540 


ATCTTCTAAT 


AGTGAAATGG 


ATGATGTATT 


TGCTACAATG 


AATAGTTTAA 


TTTATAAATT 


6600 


ACTAAGACTG 


GCAGAAGTTG 


AAAAAACGTG 


TCAGTTAATG 


GCTGATGAAA 


TAGAAAAAAC 


6660 


ACGTAGACGT 


GTAAATGGTT 


TAGAATACTC 


GATTATTCCA 


AACTTGTCGG 


AAACTATTCA 


6720 


TTATATAGAA 


TTGAAACTAG 


AGGAGGCAGA 


AAGAGCCAAT 


TTAGTTCGTA 


TTATGAAAGT 


6780 


GAAGTAGATC 


CTTTATTTAG 


ATTATTAATT 


AGATGAACAA 


ATATCAGCTT 


GGATAAGGCT 


6840 


TTAAGCCTTT 


CTAAGCTTTT 


TTTATTGACA 


GTATCAGGAT 


ATCTTTTTCA 


AAATTTTGGT 


6900 


TTGTTAGATA ATGAAAATGT TTCTACTAAT CTAGATTTAG 


GATTAGTAAA 


TCGTAAATGT 


6960 


AATTATATAG 


AAAGTAAGCG 


CGTCATAACA 


AGGTATCTAT 


CATTCATGGA 


GCTCCTCCTG 


7020 


TATACTATTA 


GTAAAGTAAA 


ACTATTGGAG 


GATATTTTAA 


TGCCACAACC 


TATTGTTCCT 


7080 


GTAGAGATTC 


CACAATCTCG 


TCGTTTTGAT 


TCTAAAAAGA 


GAAATGATAT 


TCTGCTTAAA 


7140 


ATTCGTATTG 


GCAAGCTTGA 


AGTAAGTTTT 


TTTCAATCTC 


TCAATCTCGA 


AATGGTAGAA 


7200 


CAGCTTTTGG 


ATAAGGTGTT 


GCTCTATGAC 


AATTCATCTA 


TCTAGCCTAG 


GGGAGGTCTA 


7260 


TCTCGTGTGT 


GGGAAAACTG 


ATATGAGACA 


AGGAATCGAT 


TCACTGGCTT 


ATCTGGTTAA 


7320 


AACCCACTTT 


GAATTGGATC 


CTTTCTCCGG 


TCAAGTCTTT 


CTCTTTTGTG 


GTGGACGTAA 


7380 


AGACCGCTTT 


AAAGTCCTTT 


ACTGGGATGG 


TCAAGGATTT 


TGGCTACTAT 


ATAAACGCTT 


7440 


TGAGAACGGC 


AGATTGATTT 


GGCTAAGTAC 


AGAAAAGGAT 


GTCAAAGCTC 


TCACACCAGA 


7500 


ACAAGTAGAC 


TGGCTTATGA 


AGGGCTTTTC 


TATCACTCCA 


AAAATATAGT 


AGATTGAAAC 


7560 


TAGAATAGTA 


CACCTCTGCT 


TCTAAAACAT 


TGTTAGAAAT 


CGATTTTACT 


GTCCTGATCG 


7620 


ATTTGTCCTG 


TTCTTATTTC 


ATTTTACTAT 


AAATCCATCA 


GAAAGTCGTG 


ATTTCTATTG 


7680 
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AAATGAGGAC 


TTTCTTTTTA TACTCATCTG 


CTTTCAAAAA 


GCATTCTAGT 


v_v_*H i 1_ 1 i_L.vsA 


/ I H U 


TTAACGATGG 


ACTTTATCAC CTCCTTCTCC 


AGTCCTTGTA 


TAACATCTTG 


uAu 1 1 VjA 1 iL 


/oUU 


ATGACATCTT 


CCAAAGTTTA AAAGGCTTTA 


TTCTTAAATC 


CACGTTTACG 


AATCTCTTTC 


7860 


CACACTTGTT 


CAATGGGGTT CATCTCTGGT 


GTGTATGGAG 


GAATAAATGC 


AAAGCCAATA 


7920 


TTAGTCGGAA 


TCTTTAAGGT ACTTGATTTA 


TGCCATATAG 


CATTGTCCAT 


AACGAGTAAA 


7980 


AGATAATCAT 


CTGGATAAGC TTGTGAAATC 


TCCTATTCCT 


AAAGCCCCTT 


TAGCGCATAA 


8040 


CTTTGGCTCA 


GCTTCTATTA TCGCTCACAC CATCCATCAG AAGTTTAATC 


TGAAGGTACC 


8100 


CAATTATPfir 


CAAGAAGAAG ATTGGGCTAG 


GATGGGTTTA 




GTAAGGAAAT 


8160 




CATATCAAGG CGAGTCAATA 


CTATTTGGAG 


PP P f , T v PT A *P A 


ACCTCTTGCG 


8220 


AGAfiARAPTA 


TTGACTCAGC CCTTACTTCA 


TGCGGATGAA 


ACTTCTTATA 


GGGTGCTAGA 


8280 


GAGTY3ATAt"5T 


CAGCTGACTT ACTATTGGAC 


TTTTTTGTCA 


GGTAAAGCAG 


AGAAACAAGG 


8340 


GATTAPfiPTT 


TACCACCATG ATCAGTGTCG 


AAGTGGTTCA 


GTAGTACAAG 


AATTCCTAGG 


8400 


AGATTATTPT 


GGCTATGTGC ATTGTGATAT 


TTTGCGGCAG 


TAACTTAGGA 


CTTTAGTCCT 


8460 




CTATGCGATA GCAGTCCAAG 


GTTTAGGAGC 


AAGGCGACGC 


TAAGCTTGGT 


8520 


****** ^— A 1 V»VJ/V\ 


CCGCTCGTCT GCTTATCGTC 


AACTGGAAGA 


AGCTGAACTT 


GTTGGATGTT 


8580 


GGGCGCATGT 


UAuAAuunAu 1 1 1 1 1 i (j AA(j 


CGCCCCCCCA 


AGCAAGCGGA 


TAAATCATCC 


8640 


TTAGGAGCTA 


AAGGTTTAGC TTATTGTGAT 


CAGTTATTTT 


CCTTGGAAAG 


AGACTGGGAG 


8700 


GCTTTG CCAG 


CTGATGAACG ACTACAGAAA 


CGTCAAGAAC 


ATCTCCAGCC 


CTTAATGGAA 


8760 


GACTTCTTTG 


CTTAGTGCCG GCGTCAGTCA 


GTTTTAGCAG 


GTTCAAAACT 


AGGAAGGGCA 


8820 


ATTGAATACA 


GCCTCAAGTA TGAAGAAACC 


TTTAAGACCA 


TTTTGAAAGA 


t_GCjAL.ATCTG 


8880 


GTCCTTTCCA 


ATAATCTAGC TGAACGCGCC 


ATTAAATCAT 


TGGTTATGGG 


JV O/V X f*'!* TV IV IV 

ALtjOAvj 1 AAA 


8940 


AGAGTCCAGT 


GGACTCTTTT AGCCTAAGCT 


CAGTTTAAAA 


AAGCGAGGGT 


Vjo ilAlJliL 


O f\ Art 

y uuu 


TCAAAGTTTT 


GAAGGAGCTA AAGCAAGAGC 


TATTATTATG 


AGTTTGTTGG 


AAACAGCTAA 


9060 


ACGTCATCAA 


TTAAATAGCG AGAAATATCT 


ATCCTATCTT 


CTAGAATGTC 


TTCCAAACGA 


9120 


GGAAACTCTC 


GTAAACAAAG AGGTTTTAGA 


GGCTTATTTA 


CCATGGACTA 


AAGTTGTACA 


9180 


AGAAAAGTGC 


AAATAAGAAA TCTCCAGATT 


AGGAACTATC 


CGTGAGTTCT 


CCAGTCTGGA 


9240 


GATTTTTCAA 


TAGACTTCCT GCGAAACAAA 


ATATGGTATA 


ATAGTTCTAT 


GAATGATGAA 


9300 


GCAAGTAAAC 


AACTAACCGA TGCACGATTT 


AAGCGTCTTG 


TTGGTGTTCA 


ACGCACGACT 


9360 


TTTGAAGAGA 


TGTTAGCTGT ATTAAAAACA 


GCTTATCAAC 


TTAAACACGC 


AAAAGGTGGA 


9420 


CGAAAACCTA 


AATTAAGTCT AGAAGACCTT 


CTTATGGCCA 


CTCTTCAATA 


TGTGCGAGAA 


9480 
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TATCGAACTT ATGAACAAAT TGCGGCTGTT TTTGGTATTC ACGAAAGCAA CTTAATCCGT 9540 

CGGAGCCAAT GGGTTGAAGT AACTCTTGTT CAAAGTGGTG TTACGATTTC AAGAACTCCT 9600 

CTCAGTTCTG AGGACACGGT AATGATTGAT GCGACGGAAG TAAAAATCAA TCGCCCTAAA 9660 

AAAAGAATTA GCGAATTATT CTGGTAAAAA GAAATTTCAC GCTATGAAGG CTCAAGCGAT 9720 

TGTCACAAGT CAAGGGAGAA TTGTTTCTTT GGATATCACT GTGAACTATT GTCATGATAT 9780 

GAAGTTGTTC AAAATGAGTC GCAGAAATAT CAGACAAGCT GGTAAAATCT TGGCTGACAG 9840 

TGGTTATCAA GGGCTCATGA AGATATATCC TCAAGCACAA ACTTCACGTA AATCCAGCAA 9900 

ACTCAAACCG CTAACAATTG AAGATAAAGT CTATAACCAT GCGCTATCTA AGGAGAGAAG 9960 

CAAGGTTGAG AACATCTTTG CCAAAGTAAA AACGTTTAAA ATGATTTCAA CAACCTATCG 10020 

AAATCATCTA AACGCTTCGG ATTACGAATG AATTTGATTG CTGGTATTAT CAATCATGAA 10080 

CTAGGATTCT AGTTTTGCAG GAAGTCTATT ATCAAAAATA CCATCAAGAT TATATAAGAT 10140 

TGATACAGGA AAAGTTTTAT TTGATGGTGT AAATATTAAT CAAATAGATA AAAAAATATT 10200 

AAGTCAAAAT TTAGGAGTAG TTCCACAGGA TTCATTTTTA TTGAACCGAA GTATTCTTGA 10260 

TAATATAACT TTAAAGCACG AAGTTACTTC ACAAAAGATA GAGGAAGTTT GTAAAGCAGT 10320 

TCAAATCTAT GATGAAATCA TGGCTATGCC GATGAAATTT AATACTATCA TCTCAGAGAT 10380 

GGGGTCAAAT ATTTCAGGTG GGCAAAGGCA ACGGATAGCA CTGGCACGTG CATTAATAAA 10440 

TAATCCTAGT ATTGTAATTT TAGATGAAGC AACTAGTGCA TTAGACACTA TTAATGAGGA 10500 

AAGAATAACA AAGTATATAC AAAGTCAGGG CTGTACTCAA ATAATTGTAG CTCATAGATT 10560 

GTCAACGATT AAGGATGCGG ATGTTATTTT TGTAATGAAA GGTGGTAAGA TTGTTGAGTC 10620 

AGGAAATCAT AAGTACTTAA TGGATCTTGG TGGAGAGTAC TACAGCTTAT ATACAAAAAG 10680 

GAAATGAGGT GTAAAGAAAA TGAAGAAAGA AAATGAATAT GTAATTTTAA CAACAGCCTC 10740 

ACTAGGGGTG ATGATTGGAA TAGTGTTTGC AATTTTTTTA GATTTTCCAG TTGAATATGG 10800 

TATTTCTTTA GGCTTGTTGA ATGGAATAGT ATTGGGTTCG CTGATTGTTT ACAAAAACAA 10860 

TAAGAATTAA GCATAATTTT TTGCTGTAAA CTAAGGAGTA GAGATGGCTA TAGTTGAAAT 10920 

TATAAATCTA ACAAAAAGCT TTAAAGATAT TGAAGTTATT CATAACACTT AAATAATAGA 10980 

GCAACTACAG TAGTAGCTTA AAAACATGAT TAAATCGCTA TTCTTAGGAG TAGCGGTTTT 11040 

TCTTTTTGTT TAATACTCTT TGAAAATCTC TTCAAACCAC GTCAGCTTTG CTTTACCGTA 11100 

CTCAAGTACA GCCTGCGGCT CGCTTCCTAG TTTGCTCTTT GATTTTCATT GAGTATAAAA 11160 

AGGGTCAAGT AAGTATAGTA AATTGAAATA AGATATGAAC AAATCGATTA GAAAAGTCAA 11220 



WO 98/18931 



PCT/US97/19588 



928 



ATTAATTTCT 


AGAAATATGT 


TAGAAATTGG 


TTTGAATTCC 


GCAATCAATT 


ivjl l\.nu 111 


11280 


TTATTTCATT 


TCATTTTATT 


TAATTAGATT 


TTCCAATTTT 


TTAATTPAAP 


PTAAAAATPP 


11 JflU 


CCAATCGTAG 


TGATTGAGGA 


TTGAGTAAAT 


AAATPTTAAA 

****** a \* *. nAn 


PA ATAPPTTP 




XX4UU 


GCATTTGCTA 


CATTTTCAAA 


GGCAGCAATG 


TTAGCTCCTG 




T w 1*7^ TAT 1 /"' 7A & i"* 7A 
1 1 1 rt 1 LAnuA 


114 OU 


CCGTATGTTT 


CTGAAGTCGT 


TTTAGCTGTG 


TTGAAGATGT 




r2nir"T w T i T | p a r* a 

ulv.1 1 i uA^jA 




CGGCCATCAA 


CTTCTTCACG 


AGTCCATGAG 


AGGPfiAAfiAP 


ly 111 lAjtjC 1 


LA ill L. AAGA 


11580 


GCTGAAACGG 


CTACACCACC 


AGCCVPTClCiCA 


VjV_ ill llaLAu 


G 1 L.L.G 1 AGAA 


GATACCATTT 


11640 


TCTTTGTAAA 


CTTTGATGGC 


ATP A A CidWCCZ 
fx A utnuu i. 


^- 1 1 vj 1 


IGGGACL.! It, 


AGATACACAG 


11700 


ATAACGCCTT 


GAGCAACCAA 


t\\-\j 111 A<jL 1 


GCTTCACCGT 


XXaATTTCGTT 


TTGAGTGGCA 


11760 


CATGGAAGAG 


CAATGTCATA 




I AAG I CL AT A 


CAGTACCTTC 


GTGGTAGGTT 


11820 


GCAGTTGCTT TTTCAGCTGC 


e\ 1 1 1 


AAAv.GAGQ.AC 


GACGTTTTTC 


TTTAACATCA 


11880 


ACCAAAAGAT 


CGAAGTCGAT 


ftLLfti 1 1 ILpA 


1 L.GATGACAT 


AACCATTTGA 


GTCAGAAACA 


11940 


GAAATAACAG 


TTGCACCGAG 


1 lVAljl 1 VjV_ 1 


1111 GAAG AG 


CATATTGAGC 


AACGTTACCA 


12000 


GAACCTGAAA 


TAACGACTTT 


CTTACCAGCA 


AAGCTGTTAC 


CGTTAGCTTT 


GAGCATTTCT 


12060 


TCAGTATAGT 


AAACCAAACC 


GTAACCAGTT 


GCTTCTGGAC 


GAA1 CAAGL. r 


ACCACCAAAT 


12 120 


CCAAGAGGTT 


TACCAGTCAA 


GACACCAGCA 


TCAAATTGGT 


m a ja/iji /"v"i rprpm 
liSnunLul 1 1 


G 1 A I TGACCG 


12180 


TAAAGGTAAC 


CAATTTCACG 


TCCACCAACA 


CCGATATCAC 


GAGCAGGTAC 


GTCAAGTGAT 


12240 


GGTCCGATGT 


GTTTTTGCAA 


TTCAGTCATG 


AAGCTTTGGC 


AGAAGCGCAT 


CACTTCAGCA 


12300 


TCTGTTTTAC 


CTTTAGGATC 


GAAGTCTGAT 


CCACCTTTAC 


CTCCACCGAT 


AGGAAGTCCA 


12360 


GTCAAGACAT 


TTTTAAAGAT 


TTGTTCAAAT 


CCGAGGAATT 


TCAAGATCCC 


TTGGTTTACA 


12420 


GTTGGGTGGA AACGAAGTCC ACCTTTGTAT GGTCCAACAG 


CTGAGTTGAA 


TTGAACACGG 


12480 


TAACCACGGT 


TTACTTGAAT 


TTTTCCATCA 


CGGTCAACCC 


AAGGAACACG 


GAAAGAAACC 


12540 


ACGCGCTCAG 


GCTCAGTAAT 


ACGTGCCAAG 


ATATTTTCTT 


CGATATACTC 


AGGGTGTTTT 


12600 


TCAAATACAG 


GTTCTAAAGT 


GTTGAAAAAT 


TCTTCAACAG 


CTTGGAGGAA 


TTCAGCCTCG 


12660 


TGCCGG 












12666 



(2) INFORMATION FOR SEQ ID NO: 138: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3083 base pairs 
(B> TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 138: 

AGCAACTGTT GTGAACCAAT TCCGATAAAT TCCAAGAATT GGTTAATAGA GCCATTTTGA 60 

CCAAAAATCC CGATAAAAGC ATAGGCTTTA AGGAGCAAAT TGATCCAGGT AGGAAGGATA 120 

ATCAGCATGA GCCAGAGTTG ACGGTGTTTG AGACGGGTCA AAAAGAGGGC CGTCGGATAA 180 

CTGATAAGCA GTGCCACAAA GGTCACAATG CCTGCATAAA GCACTGAGTT GAAACTCATT 240 

TTAAGATAGG TCAAGTTTTG TGACGCAAAG TAAGATTTGT AATTTTCTAA ACTGAACTGG 300 

CCTTCGATGT TGAAAAAGGA TTGACCGAAA ATCAAGACCA AGGGTGCCAA TACAAAGAGC 360 

GCAATCCAAA GCATGTAGGG TACTACAAAG AGTTTAGAGC TTGTTTTCTT CATCTCTTTC 420 

CTCCTCGATT GCATTGATCA AACCTGCTTC TTGCTCTTCG ATTTCTACGT ACTCCTCAAT 480 

ACGAGCATCG AACTCTTCTT CGGTTTCATT GAGACGCATG ATGTGGATGT CTTCTGGTTC 540 

AAAGTCCAGA CCGATTTCCT CACCCACGAT AGCCTTACGG GTTGAGTGGA TCATCCATTC 600 

ATTTCCAAGT TCGTCATAGG CGATAATTTC ATAATGAACT CCACGGAAAA GCTGGGTATC 660 

GACCTTAACT TGGAGCTTGC CTTCTTCAGG AAGGGTAATG CGCAAGTCCT CTGGACGAAT 720 

AACGACCTCA ACAGGTTCAT TTGGCTTCAT CCCACCATCA ACCGCTTCAA AGCGTTTGCC 780 

GTTAAATTCG ACCAAGTAGT CCTCAATCAT GGTACCTGGC AAGATGTTTG ACTCCCCGAT 840 

AAAGGTGGCA ACAAAGTGGT TGATTGGCTC ATCGTAGATG TCCACAGGGG TTCCAGACTG 900 

GACAATCTCG CCATCATTCA TAACGAAAAT CCAGTCACTC ATGGCAAGAG CTTCTTCCTG 960 

ATCGTGAGTG ACAAAGACAA AGGTAATGCC CAATCGTTGT TGTAATTCAC GCAATTCGTA 1020 

CTGCATGTCT GTTCTCAATT TCAAGTCCAG CGCTGATAAA GGCTCGTCCA ACAAGACCAC 1080 

ACGGGGTTGG TTGATGATAG CACGGGCGAT GGCCACACGC TGACGTTGTC CTCCAGAAAG 1140 

TTTGCGGATG GAACGTTTTT CATAACCTTC CAACTGAACC ATCTTGAGAA CTTCCGCTAC 1200 

ACGCTGCTCG ATTTCTTTCT TATCAATTTT ACGCAAGCGA AGTGGAAAGG CAACATTTTC 1260 

AAACACATTC ATATGTGGGA ACAAGGCATA GGATTGGAAG ACGGTATGTA CGTCGCGCTT 1320 

GTTGGTTGGA ATATCATTGA TACGAACACC GTCTAGCATG ATATCTCCTG TCGTCGCATC 1380 

CAGTAAACCT GCAATAATGT TTAGGATAGT TGATTTCCCC GAACCAGATG CACCTAGAAG 1440 

GGTGTAGAAT TTCCCTTCTT CCAACTCAAA GTTGATGTCT TTGAGAACCT TGGTGTTGCT 1500 

GTCTTCAAAA ACTTTAGAGA CGTTTTTGAA TTCGATAATT GGCTTTTTCA ATTGGCATAA 1560 

ATTCCTTCTT TTTCATAGAT TAACCGATCG GGGCTCTGTC AGGTCCCCAC TACCTCTTGC 1620 

AGGGAGTAAA ACCACCTGCA TACATCTTCG CTACCGATAG GCTTTCACCC AAGATCCGGA 1680 
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CTTCTCTTTC AAGCGTAATA CCTGAGTGTT CCTTGACTTT' TTCGATAACC GATTGGATCA 1740 

AGTCCTCGTA GTCTTTGGCC GTTCCATCTG CGACATTGAT CATAAATCCT GCATGCTTTT 1800 

CTGACACTTC TACGCCACCG ATACGATAGC CTTTCAAGCC AGCTTCTGAA ATTAACTGAC 1860 

CTGCAAAATG CCCGACTGGA CGCTTAAAGA CCGAGCCACA AGATGGGTAT TCCAAAGGTT 1920 

GCTTGAGTTC ACGTAGGTGC GTCAAGCGGT CCATTTCCTG CTTGATAACC TGATGGGTTC 1980 

GTGGAGCTAG GGCAAATTTA ACTGACAAGA CAACTGCACC AGACTCCTGA ATAGCTGAAT 2040 

GACGGTAACC AAAAGCCAAG TCTTTAGCAG ACAGGGTTTC GATTTCTCCA TCCTTGGTCA 2100 

AGACCTTACA AGACTGCAAG ATGTGAGCAA TCTCGCCACC ATAGGCACCC GCATTCATAA 2160 

AGACAGCACC GCCAACGCTT CCTGGAATAC CACAAGCAAA CTCAAAGCCA GTTAAACTAT 2220 

GACGGAGGGC AATGCGAGTT GTTTCAATCA AGTTAGCCCC AGCTTCTGCT TCAATGGTAT 2280 

AGCCATCAAC AGAAACGTTA TTGAGCTTGT CACACAAGAT GACAAATCCA CGAATCCCAC 2340 

CATCACGAAC GATGATATTG CTTGCATTGC CAAGAACCAT CCAAGGGATA TTTTCTTGGT 2400 

TGGCAAATTT CACAACGCGA GCCAACTCAA AACGATTTCG TGGAAAGACC AAATAATCAG 2460 

CCTCTCCACC TACTTTTGTA TAACTATAGC TATGCAAGGG TTCCTTAAAA CGGATATCAA 2520 

TTCCTTCTAA GATTTCAAGC ATTTTTTCTC TTACAGACAT GTCACTCTTC CTTTTACAAA 2580 

ATTCATTCCA TTATACCATT TTTAGAGACA TTTGACGACC ATAAAAATAC CTTGTTTGGA 2640 

TTTTGCATAA GAAAAAGAGG TTCCCCCCTT TTTATGATTT TTTACAAAAG ATTTCCTTGG 2700 

TTCCATAGGC GACCAGAACG AGCTCCAGTG CTAGAATCAC TTCAACCAAG ACTGGATTTG 2760 

TCAACCAGCC TACTTGGAAA AGAGATGGTG CCAGATCAAA GAAGGCATGC AAGCCATAGG 2820 

CTGCTAGGAG ATAAATCCAT TTCTTCTGGC GAACAGCTTG GTAAACCCAA ACTGTCAAAA 2830 

GTAATTGGAA ACCAAGCGCC AAGATTCGCT CAAAACCAAG CAAATAAATC TGCCAGACCG 2940 

AAAGTGACTG AATGGTTTTT AACATATTTT CAGACAGTAA TTGCATAACC TGTGGATTC;- "3 COO 

GAGTTTGAAC TGCCGAAAGA ACAATGTAAA GATTGAGTAA ACTAGTAAGG CCTAGAAAAA 3060 

TCAACTCCAA GCCACCATGC CCC 3083 
(2) INFORMATION FOR SEQ ID NO: 139: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15363 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

{xi) SEQUENCE DESCRIPTION: SEQ ID NO: 139: 
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CCGGAGGATA 
TACTGCGACA 
TAAAGGTTTG 
TCTTATACTC 
ACTGTTTTGA 
GCTGACGTGG 
CCATATCATA 
CATTTTGTCG 
AAATCAACTT 
CACGATTGAT 
GAGACTCCTG 
TTTCAATAAT 
CACGGAGACG 
CTTCATGAAG 
AGGCAAGTTT 
AAGCACGATT 
TAAGAATCAA 
AAATGAGATT 
TTCTGGTGTC 
GCATAAGATA 
CTGTTAAAAA 
GTTCTAGATT 
AAAAGGTCAA 
TTCTATAGGC 
GGGCGATAAG 
CTGCTATCAT 
TCACTGTATG 
TCTTGAAGCG 
GAACTTCTTC 



TTGACCACCA CCAAAAGCAG 
CTGGTCAACT CACTATCTGA 
TCTATCAGCT CTTTCCACCA 
AATGAAAATC AAAGAGCAAA 
GGTTGTAGAT AAGACTGACG 
TTTGAAGAGA TTTTCGAAGA 
CGGTAGGGTA TTGGCAGCTT 
TAATTTCTTG GCATACTTAG 
GCGCTCCAGA TAATAGCCTC 
AACCCGTTCG ACAAAGGCAC 
CATAAAACTA ATCAAAGAGC 
CATCTCTGTA TTGGCAAGAT 
GCTTTGCTCC TTGTTCTTCC 
GAGAAAACGT AAAATCCTCA 
ACGATTGATT CCTTGTTCTA 
GCTAATGTTC CCCTCTTCAT 
GAGTTTCAAA GCAGCCCAGT 
TTCAATACGT CCATGATAAT 
TTCCAACTCT TTTTCCAACT 
ATCCTTGCTT TCTTCCTCTT 
GCTAACAAGC GTCACACCTG 
ACTTGGTATC AAGAGAATCG 
GAGAAACATG TCCTTCATAT 
ATAATAGCCA TAGATCATAA 
AGATAGCAAT AAAAGTAGAG 
TTCCAACTCC ATCCCTAAAA 
CCAGACCGTC TCGGTCACCG 
ACTTGCCTTT AAAATTCCAG 
TGCCAGAAAG AAGGTCACTA 



931 

GGGGAAAATC GAAATCAACC 
TGCTTGATAA TAATGCAAAA 
CTTTTTCATG TCATACTCCT 
CTAGAAAGCT AGCCGCAAGC 
AAGTCGATCA CATACATACG 
GTATTAACTA ATTTCTTCTT 
CCTTCAAGGA ATAGTTCTCT 
TCGTAATCAA TCGTTTTTCT 
TCAGCATTTC ATCGATATTG 
CACTGCTGAT AATAGCTGTT 
GTCTGTAGAC TCCCTTCAGG 
AGAGCTCTGC AATTTGGTCA 
AGCTACGGAA GGTCTTTCCG 
AGGAAACAAG AAAATAATAG 
TATTTTTCAG ATAACGTTGG 
AGGCCTGTTC CAAACCATCA 
CTTCTTGATC ATCCTGGTTT 
TGTCAATAGC CGCATAGAGG 
CTAGCGTTAC TTCATTCAAA 
CATCAGAAAG ATGAGGCAAG 
CAACAAGGAA AAGCAAAAGA 
TAGCAATCGA CACCGTTCCC 
ACTTATTTAG CTTTTTCTTG 
TAAAACGAAT GACAAAAAGG 
GATTATAGAT TGGATTGGTC 
TCACAAAGAC AGAACCGTTG 
TATCCACTTG GGCTTCGAGG 
CAACTACGAC GGCAATAATA 
GAGGCAAACT CAATTCTAAT 



AATAGTAGGC 60 

AAGCTTTTAA 120 

TCACTTATAA 180 

TGCTCAAAAC 240 

GTAAGGCGAC 300 

ACCAATTCCA 3 60 

AAGTTATTTA 420 

TCGTATTCGA 480 

TTGGGTTTGA 540 

TCTCGAAGAC 600 

TTTTCCAAAC 660 

TAATCAAGAG 720 

AGAGTAAAAA 780 

GTCAGTCTTG 840 

TAAACTCGGT 900 

CTTTCAATAC 960 

TCTTGGCTTA 1020 

GGAAGTTTAT 1080 

ATGGCGATAT 1140 

ACCAAGAGAC 1200 

GGATACTCCT 1260 

TTAACACCTG 1320 

AGGCGTCGGG 1380 

AC AAAGGTAA 1440 

AAGATAGGTT 1500 

AGCATAAAGG 1560 

AGCGTGATTT 1620 

CCTGAAACAT 1680 

AAAAGTTCAC 1740 
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TGGCAATATC 


CGTTGCGCGC 


ACACTTAGCA 


AGAAGGTATG 


GAGGAAGCGG 


TTGGTCATGG 


1800 


CTGTTAAAAA 


TCCAATTAAA 


AAACCGCCTA 


GGATTGAAAA 


GATGAGCGAA 


CTGCTAGCTT 


1860 


GCCCCAGAGA 


AAAAGCTCCA 


GTTGTCCAAG 


CTGTCAAAGC 


TACCTGAAAA 


GCCACCAAAC 


1920 


CAGAAGCATC 


ATTCAAGAGT 


CCTTCGCCCT 


TAAGAATATT 


GGACACGCGC 


TTAGGAAAGC 


1980 


TAAAACGCTC 


CGAAAGAGAG 


GCAAAGGCCA 


CCAAGTCCGT 


AGGACCAAGG 


GCTGCCCCAA 


2040 


CAGCCAAGCA AGCTGCCAAG 


GGAAGGCTGA 


ACCAAAGAAG 


ATGGGCCAAG 


CCACCCAAAC 


2100 


TCAGGGTCGA 


GATAAAAATC 


ACTGGAAATA 


TGAGATAAAC 


AATGATTCGC 


CAGTGTTTTA 


2160 


AAATAGCCGT 


AACATCTGCT 


TCTTCAGCCT 


CTCGGAAAAG 


CAAGGGTCCG 


ATAACCAGTG 


2220 


CCAAAAACAA 


CTCCGTATTA 


AGGTGAAAGT 


CAGTATTGGG 


TAAAAAGAGA 


CCAATCACAA 


2280 


TTCCCAAAAG 


AATTTGCACC 


AAAGGGAGAG 


GCAAAAAGGG 


CAGGAGCTTA 


TTGGTTGTAC 


2340 


TTGAGACAAT 


CAAAACCAGT 


AAAAATAGGA 


TGAGGTAAAT 


CAGTAATTCC 


ACGCACGTCC 


2400 


TCCTTAATCT 


TTTTTACAAC 


AGGATTCAAA 


TATCTCCTTC 


TGCTCTTTGA 


TTTTTTGGTC 


2460 


AATCTTGGAA 


CAGTCTTTGT 


GCTCAATTTT 


TCTCTGGCAC 


CGTTCCATTT 


CAAGAGCAAC 


2520 


TAATTTTTTC 


TTGATTTTAA 


GCATTTTTTT 


GCTCATATGC 


GCTTGGTCTA 


GCACGCCCAT 


2580 


CGCTCGTTCG 


TGGTGGGTTG 


ATTCAACAAA 


ATTCTGGCGC 


ATGGCATCCA 


GCTTTTCGTG 


2640 


TAAGTATTGT 


TTATCCATGT 


CTGTATCTCT 


CTAATTTTTC 


AATCATCACT 


AAAAACGGCG 


2700 


GGTTGTTGAC 


TTGGTTTAAA 


GTTCGGTAAA 


TGGCAGCTGT 


GTACTCTTGT 


TGGTTCAACT 


2760 


GGATCACAAA 


ATCCAAGACA 


GCATCTCTCT 


CGAGATCGCC 


TCCTTCATGA 


CCATAGTAAA 


2820 


TCATAATAGC 


AATTCGTCCA 


CCTTTGACAA 


GTAAGCCACA 


TAGCTTTTCT 


AATGCCTCAA 


2880 


TCGTTGTCTG 


CGGTCGGGTG 


ATGACAGACT 


TATCAGCTGC 


CGGCAAATAG 


CCCAGATTAA 


2940 


AAATCCCTGC 


CTTAGCTTTT 


ATCACAAACT 


GGTCCAGTGT 


CTCATGGCCT 


TGCAAGATTA 


3000 


ACTGGGCATT 


TGTCAAGTCA 


GCCTGATGCA 


AACGCTCTTG 


GGTCTTTTCC 


AAGGCTTGCT 


3060 


TCTGAATATC 


AAAGGCATAG 


ACTTGCTTGG 




TAAAAAAAfiP 






CATTTCCCAT 


AGTCGCATCC 


ACTACGACAT 


CCTCTTTTGT 


CACGACCTCA 


GCCAAAAAAT 


3180 


CATGTGCCAT 


CTCAAGTGGT 


CTTTTCATTT 


TCAAACTCCT 


GTTTTACAGC 


CTTGCATCCT 


3240 


TGAACACTTC 


CACGACGTCG 


CATCTCCATC 


TCAATGCTGT 


TGAGGACTTC 


CCATTTATTG 


3300 


AGGCTCCACA 


TAGGACCAAG 


CAGCATATCC 


CTAGGCGCAT 


CTCCTGTAAT 


TCGATGGATG 


3360 


ACGATATGTT 


TGGGAATAAT 


TTCCAGTTGG 


TCACAGATGA 


CCCTGACATA 


TTCGTCCTGA 


3420 


CTCATCAATT 


GTAAACGCCC 


CTCATGGTAA 


TCTCGTTGCA 


TACGAGTATT 


TGTCATAAGA 


3480 


TGGAGCAAAT 


GCAGTTTAAT 


CCCTTGAATA 


TCGTTATCCG 


TGACACAACG 


GCGGACATTT 


3540 
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TCAACCATCA 


TCTCATGGGT TTCACCAGGC AAACCATTGA 


TCAAATGGGA 


AACAATCTCA 


3600 


ATTTTTGGAT 


ACTTTCTCAA 


ACGCTTGACC 


GTTTCCACCT 


ACAATTCATA 


AGAATGCGCA 


3660 


CGGTTAATCA 


GGTCAGAGGT 


TGCTTCATAA 


GTAGTTTGCA 


AGCCCAATTC 


AACCGTCACA 


3720 


TGCATGCACT 


CCGATAACTC 


AGCCAAATAT 


TCGATGGTTT 


CGTCTGGTAA 


ACAGTCTGGG 


3780 


CGCGTTCCAA 


TATTGATTCC 


TACCACACCT 


GGCTCATTGA 


TAGCCTGTTC 


ATAACGCTCT 


3840 


CGAATAACTT 


CCACCTTTTC 


ATGGGTGTTG 


GTAAAATTTT 


GAAAATAAAC 


CAGATACTTC 


3900 


CGAACATCCG 


GCCACTTGCG 


GTGCATAAAG 


TCAATTTCCT 


TATAAAATTG 


CTCACGGATA 


3960 


GGCGCATCCG 


GTGCCACAAT 


GGCATCTCCA 


GAACCAGAAA 


CCGTACAAAA 


AGTACAGCCC 


4020 


CCATGAGCCA 


CAGTCCCATC 


ACGATTGGGA 


CAATCAAATC 


CCGCATCAAT 


AGGGACTTTA 


4080 


AAAGTCTTTT 


CTCCAAAGAG 


TTTTCGATAA 


TAATCATTCA 


AGGTATTATA 


AGATTTCATG 


4140 


ACTTTCATTA 


TAACAAAAAT 


CACCCACAAT 


CTCAAAAGCC 


TGACTTTCCT 


ATAAATTCCT 


4200 


CTGTTTCTCG 


TTTCCATTAG 


CCTTTTTTTA 


TGATACAATA 


TGGGTATGAT 


TTTAATGAAA 


4260 


TTAGCATCTA 


TTTTATTATT 


GATACTGACC 


TTAGTCGTCT 


GCATTATCCT 


AACCAAACTT 


4320 


TTTAGATTAA 


AAAAACTAGG 


ACGAAACTTT 


GCGGATTTGG 


CTTTTCCAGT 


CTTGGTATTT 


4380 


GAGTATTACT 


TGATTACAGC 


TAAAACCTTT 


ACCCATAATT 


TCCTCCCTAG 


ACTGGGGCTA 


4440 


GCCCTCTCGA 


TCCTAGCCAT 


TATTCTCGTC 


TTTTTCTTCC 


TTTTGAAAAA 


ACGCAGCTTT 


4500 


TACTACCCTA 


AATTTATCAA 


ATTCTTCTGG 


CGTGCAGGAT 


TCTTATTAAC 


CCTTATCATG 


4560 


TATATAGAAA 


TGATTGTTGA 


ATTGTTCTTA 


ATGAAATAGT 


CGAATCCCTA 


AGCATTTTCT 


4620 


AGGGATTTTT 


GCTTTCTCTA 


CAAAATAGTA 


TAGACAATAA 


CACTATACAA 


TTTTATACAA 


4680 


AGAAAAGAGT 


CTGGGACAAT 


AGTCTCTTAT 


ATCCAAAAAG 


GCAACGGATT 


TGCCGTTGCT 


4740 


TTTTTGGATG 


GTTACGATAG 


TCTTGGTAAA ATAGAATTGC 


CCAATAAACC 


ATTTAGAAAG 


4800 


GCTATCCCAT 


GCATATTCAC 


TATAACACAA 


ATCAAACAAC 


TTTACCACTA 


GAAATCAGTT 


4860 


CCTTCTTACC 


ACAAGATCAT 


CTCGTTTTTA 


CTATTGAAAA 


AGTGGTGAAT 


ACCTTGGAGG 


4920 


AACGTCACTT 


CTACACCTCC 


TATCATGCCT 


TTGATCGCCC 


GTCTTATCAC 


CCTAAAATGC 


4980 


TTGTATCTAC 


TCTTCTATTT 


GCCTATTCAC 


AAGGGATTTT 


CTCTGGTCGA AAAATTGAAA 


5040 


AATGGAAGAG 


TTAGTGACCT 


TAGATTGTTT 


GTTTATTGAC 


AGAACTAAGA 


TTGAAGCCAA 


5100 


TGCCAACAAG 


TATAGTTTTG 


TGTGGAAGAA 


AACGACAGAG 


AAATTCTCCG 


CCAAACTTCA 


5160 


AGAACAGATA 


CAGGTCTATT 


TTCAAGAAGA 


AATCACTCCC 


CTTCTGATTA 


AATATGCCAT 


5220 


GTTTGATAAG 


AAACAAAAGA 


GAGGGTATAA 


AGAGTCAGCT 


AAAAACTTAG 


CGAATTGGCA 


5280 
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CTATAATGAC AAGGAGGATA GCTACACACA TCCTGATGGC TGGTATTATC GTTTTCACCA 5340 

TACCAAATAT CAGAAAACAC AGACAGACTT TCAACAAGAA ATCAAGGTTT ACTACGCCGA 5400 

CGAACCTGAA TCAGCCCCTC AAAAGGGACT GTATATGAAC GAACGCTATC AAAACTTGAA 5460 

AGCTAAAGAA TGTCAGGCGC TTTTATCTCC CCAAGGTAGA CAGATTTTCG CTCAACGCAA 5520 

GATTGATGTG GAACCTGTCT TTGGGCAGAT AAAGGCTTCT TTGGGTTACA AGAGATGTAA 5580 

TCTGAGAGGG AAGCGTCAAG TGAGAATTGA CATGGGATTG GTACTTATGG CCAATAACCT 5640 

CCTAAAATAT AGTAAAATGA AATAAGAACA GGACAAATCG ATAAGGACAA TCAAATCGAT 5700 

TTCTAACAAT GTTTTAGAAG TAAAAGTGTA CTATTCTAGT TTCAATCTAC TATACAATAA 5760 

GAGAATGACT CAAAATTAAA AAGCTAGAGT TCCACAATTG GAAATATCTA GCTTTTTTGT 5820 

GGTTGAGAAC TATTTTGTCT CAGGCTCTTT ATCTTCTATT TAGGACAAGA GTTTTTCTTT 5880 

GGTCTTTAAT GATAAAGAAG GTATCAAAAT TTCTAGTCTT CTTTTTTACC TTTAGTAACT 5940 

ACTAATCCTG CACTCAAACC TAGAAGAGTT AAACCTGCTG CTACTGCTGC TTGGCTTGCC 6000 

GCACTACCTG TACTTGGTAA CTGGGCTTTA TTAGTTTGAC TAGCTTCACT TGAATCAATT 6060 

GGTTTTGTAT CTGCTTTTTC TGACACTTGT GGTTTTTTAG CTTCTTGAGC TACTGGTTTG 6120 

GTTCCAACCA AGACGATGCG GTCTGTCGGA ACTTCTACCA CTTCACGGAG TTTTTCTTCC 6180 

TTACTTCCAT CAGGATTAAT CGCTGTAAAG ATACGTTCTT TTCCAACTTT TCCTTCTTGT 624 0 

TCTACACGAG TTTCACCTAG ATACAGTGTT GAATCTTTTT TCTCAACTGT CTTGTATGCC 6300 

AAATCTTTTT CAACAAATTC GATTTTTGGA AGATCTTCTT GTACAGCAGC AACTGTCTTC 63 60 

TCAGAAACTG GTTTTTCCTT AGTCAAGTGG ATACGGTATT CCTTGACTTG TTTTCCACTT 6420 

TCTGAAACGA GGCGAACAAG TACTGGAAAG CTATCTTCTC CACTATCTAC CACAGTTGAA 6480 

GCTACTTGAT TGTTTTCTTC AACTGAGACT TTTGGCCGTT GACCTTTATA GGTAATTTGA 6540 

TAGTCTTGAC GATTTTCAGC GAAATCAGCA AGTTCTTTTC CATCTACAAG AATCTTTGAT 6600 

TGAGTGCTTT CTTGAGGCAA TTCACTTGGT GCAAGGAAGG TCATCTCAAT CATCGCAACA 6660 

CCGCTCTTAT CTGCTTTACG CTCCATACGC CATCTCATAG CTTTGGCTTT GATAGCTTTA 6720 

AATGTTACGT TGATTTCATC ACCAGCTGCA ATGTCTTTAT CCGCACGATA AGGAACAGCT 6780 

TCCCAATTTT CTGGATTGTT GAATGGATGG TCTGCGTCGT AGGCTTGGTA GTTTGAATAG 6840 

TAGGTTGGCA CTTCAAACTC TGGACCGACA TAGCGTTCTA AAACGAGTTT AGATGGTGCA 6900 

TCCGTACCAC TATCTGCAAA GAACTGAACT TTTCCTTGTG TAACAGTCCG TTCTACAATC 6960 

TTACCATTTT CACGGAAAAT CACACCCGCT GATACTTCTG GATTAGAAGA TGGTGTTGGT 7020 

GACCAGTTTG TCCAACGACG ATTTTCTGAA TGATCTCCGT CATTGAGATA GTCAACGCGG 7080 
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TCATGAGAGT TTTTGTCAAT ATCATTGGTT GCTGAAGCAA AGGCCTGGTT ACTGTTTTCA 7140 

TCATAGTTAG GGTTATCTGA AAGAGTCTCA CCAAGTTTGT CTGTCACTCG TACAGTGATC 7200 

TCAGCAACAA GGTTACTACC AAGGACACGG CCTCGAACAG TAAATTGACC TGCTTTTGTC 7260 

AGATTTTCCG CTGGAACTTC TTCCCATTCA ACTGTCAGGT CTTTTGTTTC GTAGCCGTCT 7320 

TTACCTGTGA AGTAAACTGG AACCTTAGTC GGCAATTCAA GTGCTTGACC TACTTGTAGC 7380 

AAGCGAGCTT GTTTAACCGC AGCAACTGGT TTATGAGAAA GTAAGCTCTT ATCCTTAGTG 7440 

AAGTGCAGAC GGTATTCTCC TAAGATGTCG CCATTTTCAG CTTTCGCGAT GACACGAACT 7500 

GGCTCACCTT CACGAACGCT TGGAACGACG GTAGCGAGAC CATTGTTGCT AACACTTGCT 7560 

GTGACTGCCG GAACTTTTCC ATCTACAGAC TCAAGGTAGT AGTCTGTCAA ATCAGGGTTG 7620 

AAGTTTGCTA AGTCTTTGCC GTCAACTTGG ATTCTTGTTT GTCCTTGCTT GGCTGCCGCA 7680 

ACTTGTTTCG CAAAGATTTG TACCTCTGTG ATAGACGTTC CACGCTTGTT ATCTGCTTTA 7740 

ACCATGCGAA TACGAACAGC ATAGGTTTCA ACTTTATCAA AGCTAAAGTG GTTCATTTCT 7800 

CCAGCCTTGA GTTGAGCAGG GGCTTTTAGA TTAGTAACTG GTTTCCAGTT GGCAGAATCA 7860 

TTAAAGACAT GGTCCTCATT ACCAACAAAA CTAGGGTTTT TAGGAGCTGT TGGGACAGTC 7920 

TTACCAACAT AATACTCAAT CACATAAGAC TTCGGTACAC CAACTCCATG GTCTTCATGG 7980 

AATCCGACAC TTAGATTATC AACGGAGCGT TTGCTCAAGA TACCTGAATC TCCAAACAGA 8040 

ACACCGACTG AAGCTTCTGG ATTAGTACGA TTCCAGTTTG TCCAACGATT GGCTGGTTGG 8100 

TTATTGTAGG AAATGAGCTT GTCATTAACA TTTGAAACTG GGTCGCTTGG ATTTGAGTCT 8160 

GAAGCAAAGG CAAGTGGCAA TTCTGAACCG GTCCATTGGT CAGAAATGTT TGCACCTTGC 8220 

TCAGTTTGAG CAGATACGCG AACATGAAGT TTAGTTGTTA ATTGCGTACC TTCTAAGCGA 8280 

CCATTAACTG TAAAGACACC TTCCTTAGCG TATTGCTCTG GACGAATCGC ATCCCATGCA 8340 

ACCTTAGCTG ATGAAACGTG ACCATTTGAA TCATATGTCC GAACACTTTC TGGTAATTGT 8400 

GGTGCTTCTG CGATTGGAGT TGTCACACTG ACTTCTTCAA CTGAAACGAT ACCTTCTACA 8460 

GAGACTTTTG CACGCGCTTC AAGGTCAATT CCTTCAACTT TACCTAGTAC TTCAAATGTT 8520 

TGATAGGAGT CTAGTTTTTC TTTCGGAATA GCTTGCCAAG TGACTTTATG AGTTTTAGGG 8580 

AAACCTTTGT CATACTCAAC TGTTACTGTT GCTGGAAGAC TTGGTTCCTG ATGCAAATCT 8640 

GTCACTACAT TTACAGGACG GATGGATTGC GCAATCTTCT TCTCAGTATT GGCTTGGATA 8700 

GTGAGTTCAA CTTGGTCTTT AGCTCCCTCA TATTCAGCGT TCAGAGTGAC TGCTCCTGGC 8760 

TTATGCAACT CAAGCATTCC TTTACGAATT GCGACTTCCC CTTCACCACT TGTAGAGAAG 8820 
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GTTACTTTAT 


CAGCTGGTAA 


TACAGCTTGC 


GTTCCATCTT 


GATAGTGAGC 


TCGAACCGAC 


8880 


AATTTGACAG 


TTTGGTCTTC 


TTTGAGACTG 


TCAGCTTTTT 


CCACTTGCAA 


GCTCAAGTGA 


8940 


GCAATTTTTG 


GCGCTTCTTC 


AAGGAATTGA 


ATTGCATAGG 


TTTGAAGAGG 


GCCACCATCT 


9000 


TTAGGCTGAA 


TAAAGATGCT 


CGCACGCATG 


CCGTTTGCTG 


CGCTTGCTTG 


AAGAACTGTA 


9060 


ACAGCTGCAT 


TTTTAGCACT 


TGCTGTGACT 


TCTGGCAACT 


TAGCTCCATA 


AGCAAGAGTG 


9120 


CGGTATTGCA TTGGTTTTTG ACTAGTAAGA CCTGTTACTG 


CCTCACCACC 


AACCGTTACA 


9180 


GTTGGTACTG 


CAGGTGCCGC 


AGGATTGCCT 


TCTTCTACCA 


CAAGGGTTGC 


ATGAATTGGT 


9240 


TGACCTTCTA AATAACCGGT CGCTTGAATA CGAGAACCTG 


GAATTGCTAA 


CTTAGCTTTA 


9300 


TCTTCTTCGG 


CAATCTCCCA 


CTTGTCCACT 


TCATACTCTT 


CAACACTTCC 


ATCAATCAAA 


9360 


ACATAGGAAA 


CAGATTTGTC 


TACAGAATTC 


AAGTCAGTAT 


TTGGAGCAAT 


ACGTTTCACA 


9420 


ACTGGTAGCT 


CTGATTTAAG 


AGCAATCACT 


TCTACACGAG 


CTTCTACTTC 


TCGTCCGTCA 


9480 


GCCATACCTT 


TCACCGTTAC 


AATACCAGGC 


TTGCTCACAT 


CTACTGAAGA 


CCAGGTTACA 


9540 


GGACGTTCTG 


CACGGCTACC 


ATCACTGTAT 


ACAAACGGAA 


CAGTGGTAGG 


CATTTCAGGT 


9600 


GCCTCTCCAA TAATGGTCTG 


TACTTTTGGC 


ACTTCTGTCC 


CCAAAACAGT 


CTTCTCTTGT 


9660 


CCTTCTTTCT 


TACCAGTAAA 


GACAGTGACT 


TGGTTCGATT 


TCAAGAGATC 


AGAGTGGGCA 


9720 


GTCAGGGTGA 


ATTTCCCTGC 


TTGTTCAGTT 


GATTTGACAA 


TGGCAACACC 


TTTACCATTA 


9780 


AATGCTTTAC 


GAATCCAAGA 


ACCATCTGCT 


TGCGCCTTAT 


AGCGTTCACG 


GCTGGCTTGT 


9840 


TCTCCGTTAT 


CTACACCGAC 


CAGTTGACCT 


TGGCCATGCA 


ATTGGAAGCG 


AACCAGATTA 


9900 


TTAGCAGTTG 


GAACCACATT 


CCCCTGGCTG 


TCAACAATTT 


CATAGTAGAT 


GTAAGTCAAG 


9960 


TCTTTTCCAT 


CTGCTGCAAT 


CGCATGGTCT 


TCCTTAATAA 


GACGAACTGC 


CGCTGGCTTA 


10020 


CCAGCAGTCG 


TAATCTTATC 


TCGAGCAATT 


TCCTTGCCAG 


ATTCATCACG 


AGCAATTGCT 


10080 


TCCAAGGTAC 


CTGGTTGATA 


GGCAACTTTC 


CATTCAAGAT 


AAAGTTCATT 


AGCATTTGCA 


10140 


CCTTCTTGGT 


AAGTCCGCCC 


ATCGCTGGTT 


TGTTTTTTAT 








GATTTTCCAT 


TCAAGAACAA 


TTCTACACTA 


GAAGCATTCG 


AATAAGCACG 


AACTGGAATC 


10260 


TTACCTTCTG 


AGTCAGCTAC 


TTTGGATGCT 


AATTCTTTGT 


TTTCCCAGTT 


CCAGTGAGGA 


10320 


AGAAGGTGTA 


CCATCGGTTT 


CTTCTTAACA 


GAAACCCATT 


GGCTTTGGTA 


GAGATAGAAG 


10380 


TCATGTTTTG 


GAATGCCGGC 


TGTATCTACG 


ATACCAAAGT 


AAGAGCTCTT 


AACAGGAGTT 


10440 


TGATTTTGGT 


TGTGCCATGG 


TGTAGGTTCA 


CCAATATAGT 


CCGTACCTGT 


CCAGATAAAC 


10500 


TGTCCAGCAT 


AGCCAGCGTT 


GTCACGGTCA 


AAAGTCCATG 


AAGCGGTTGC 


TGTTTTCCCC 


10560 


CAACCCACAC 


GATCATTTCC 


ATAATCTGAC 


TGTTCATAAT 


TACGCTCAGG 


TCCATTGCTA 


10620 
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TGTTTCAATT 


CACGTTCAGG 


GCGATAGTAA 


CTTCCACGTG 


TACGGGTAGC 


TGAAGATGTT 


10680 


TCTGATCCAT 


AAATCAACCA 


TTTTGGATGC 


TTAGCTCTAA 


GGGCTTTGTA 


ATTATCTTCA 


10740 


GAATAGTTAA ATCCAACAGC ATCGAGTTCA 


TCAGCAATTT 


TCTCATGCCC 


TCCGCTACCA 


10800 




^% jk jk tnj\u n % iris** 

GGAATTTATC 


TGCTCCCATG 


GTAACATAGC 


GAGTCTTATC 


AACATCCTTG 


10860 




t-CAAACGTTT 


AACAGTTGCT 


AAAGAGTGGG 


CATCACCATT 


AGCTTCACCT 


10920 


Ai I 1CATTAC 


CAATTGACCA 


CATGAAGATA 


GCAGGGTTGT 


TTTTGCCTCT 


TTCGACCATG 


10980 


GTACGTAGGT 


CAAAATCAGA 


CCATTTTTCA 


CCTTTTCGAG 


CTTCTGGGTG 


AGTGGCATCT 


11040 


TTTTCAAAGA 


AACGTCCATA 


GTCATAAGGT 


TTCTTGCCAC 


CATACCACGT 


ATCAAAGGCC 


11100 


TCTTCCTGAA 


CGAGTAAACC 


TAGTTCTGCT 


GCGATTTGCA 


AGGTTTGCTC 


ACTAGCAGGG 


11160 


TTGTGGGTTG 


TACGGATGGA 


GTTAACTCCC 


ATCTCCTTCA 


TTTGTTTGAG 


ACGGCGATAT 


11220 


TCTGCTTTAT 


AGTTTTCTTC 


TGCTCCAAGC 


GCCCCATGGT 


CGTGGTGCAA GGATACTCCA 


11280 


TGGAATTTAA 


TACGTTCACC 


ATTCAAAGAG 


AAACCTTCAT 


TTGGAGTCCA 


GTGATAGTAA 


11340 


CGGTAACCAA 


ACAAATCCTT 


CTTAGCATCA 


ACCAATTGAC 


CGTCAOGGTA 


AACACGCGTA 


11400 


ATCAATTCGT 


ACAAGGCAGG 


TTTGTCATTT 


AAAACAGTCC 


AGAGTTTTGG 


TCTTTCAACT 


11460 


TCTAAAATCG 


CATCTAGGCT 


TGTTGATTCA 


TGTGCTTTTA 


AGGTACGACT 


CGCTGTACGA 


11520 


ACTAAGCCTG 


TTACAGCATG 


ACCACCTCGT 


TCAACGATTT 


GATATTCGGC 


TACAAGTTCA 


11580 


TGGTCTTTGT 


CGTCCGTATT 


GACGATTTTG 


CTGGTCACAT 


GAGTTTCAAC 


CTTGCCATGT 


11640 




CAAGTTTTGG 


TGTTAAAATA 


GTTGTCCCAT 


TTTTCTCAAC 


ATGCACCTTA 


11700 


TCTGTCACTT 


GTAAAGTCAC 


ATCACGATAG 


ATACCACTTC 


CTGAATACCA 


ACGGCTACTT 


11760 


GGCTGTTTGT 


TGACTGCATG 


GACAGCAATC 


ACATTCTCAC 


GACCATCTTT 


TTGAAGGTAT 


11820 


TTGGTGATAT 


CATATGAGAA 


CTGGTTATAA 


CCATTTGGAT 


AATGCCCCAC 


TAACTGACCA 


11880 


TTGACATAAA 


CTTGAGAATC 


CATGTAGACG 


CCATCAAAAG 


TAAGGCGAAC 


ATTTTTCTTG 


11940 


AGGTCTTTTT 


CATCTAGTTT 


GAAAGTCTTG 


CGATACCAAG 


CTTCCCCACC 


GTTGAGCTGT 


12000 


CCACCTTCAT 


TTTGTGCAGG 


AGATTCATGA 


TCGAAATCGT 


TAAAGATACT 


CCAGTCATAC 


12060 


GGTAAATCTA 


ATTTTTTCCA 


CGTAGATACG 


TCTGCATCAG 


GTTTAATGGC 


TTCCTTAGAA 


12120 


TTTGCATTGA 


GTTTAAAGTA 


CCAATTTTGA 


TTAAAATCCA 


CTTTCCTGTC 


TTCAATCATT 


12180 


TGATTCACTT 


CTTCATTTGT 


TACAGCTTTA 


GCATCTTCCT 


TGAGCGGTTT 


TTCTTGATTT 


12240 


GAAGCTTGTG 


ATTCTATCCT 


TGGAGCTTTT 


TCTTCCGGTT 


TAGCAGACAC 


TTTTTCCTCT 


12300 


TTTGGAGTTA 


CGGCTTCATC 


TTCTTTCTTC 


TCAGATGCAA 


TAGCCTCAGT 


TGAACTAGGT 


12360 
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TCACTTTGTT 


CTGTCCTTTC 


AACTATATTT 


TTAGTTTCCA 


AAGCTTTATC 


AGCCTTTTCT 


12420 


TCTACTATCA 


TTTTTTCCTC 


TTTAGGTTTC 


TCAGCAGTAT 


GAGTAATAAG 


TGTTTCATCC 


12480 


GCATAAACTA 


CAGATTCTCC 


AGCTATATTT 


CCTCCTAATA 


AAACTGCACA 


AGTCCCAATC 


12540 


ATTACTGAGC 


AAGCTCCCAC 


AGCAAACTTA 


CGAATGCTAT 


AAACTCTTTT 


CCGATTCCAA 


12600 


TGGCCTTTCC 


CCATAAAACC 


CTCCTTATAT 


TATATTTAGT 


GCAGTTAGCT 


ACTACCAAAG 


12660 


CCCAAGTGGT 


ATACATGGTA 


TGACAACCTA 


GTTTCAACAA 


TTTACACTCT 


GCGAAAATCC 


12720 


AATTCAAACT 


TCGTCAGTGT 


CGCCTTGCCG 


TAGATATGAT 


TACTGACTTC 


GTCAGTTTCA 


12780 


TCTACAACCT 


CAAAACCATG 


TTTTGAGCTG 


ACTTCGTCAG 


TTTCATCTAC 


AACCTCAAAA 


12840 


CCATGTTTTG 


AGCTGACTTC 


GTCAGTTTCA 


TCTACAACCT 


CAAAACCATG 


TTTTGAGCTG 


12900 


ACTTCGTCAG 


TCTTATCTAC 


AACCTCAAAA 


CTGTGTTTTG 


AGCAACCTGC 


GGCTAGCTTC 


12960 


CTAGTTTGCT 


CTTTGATTTT 


CATTGAGTTT 


ATATTTTATA 


GGAGCGCATT 


ATTTTGCTTT 


13020 


TGCTGCGTAC 


TCTTCGTTAC 


GTTTGATCAT 


TTGTTTTCTG 


TACCAAGCAA 


AGATACCGAT 


13080 


ATAGAATACA 


AGGAAGACTA 


CTGCACCAAG 


GATTGCTTTG 


ATATCACCAG 


TTGTAGTGTT 


13140 


ACCAATTGTC 


CAACCAAGAA 


GTTTTTCGAT 


TGGTCCTTCA 


AGAGTAGAGT 


GAGTAATCAA 


13200 


TTGAGTTTGG 


CTCACACCTT 


CTGGGAAGGC 


ACCTACACCT 


TTAGCAAGTT 


CTGTTGCAAA 


13260 


TGGTGCAATA 


AGTGTACCTG 


AAAGAAGGAA 


GAGTGGCAAC 


AAGAGTGTTC 


CGAAGATAAT 


13320 


CATACGGAGC 


AATTTACCAC 


GAGTTACAAC 


CAAGAGAGCT 


GGAGTAACAC 


CCATAGCGAT 


13380 


GATACCTGCA 


AGTGGCAAGA 


TACCATTTCC 


AACTTTTGAA 


AGAAGCACTG 


CTTCAATCAA 


13440 


CATGATTGGT 


GCAAGTACGT 


TGGCACAAGC 


CCAGATTTCA 


GCACGACCAG 


CGATGAATGG 


13500 


CCAGTCAAGA 


CCGATATTGA 


ATTTACGTCC 


TTGAAGACGT 


TTAGTAGCAA 


CGTTTGTAAT 


13560 


ACCTTGTGAT 


AGTGGTTCTA 


CGGCTGCGAT 


GAACCATGAA 


CCGATAAGTG 


AGAAGAGTTC 


13620 


CAAAGATACA 


CCGGCAGTCA AACCAAGAGA 


CAACCATCCT 


TTGATAACAA 


GACGCCATTT 


13680 


ATCTGCATCT 


GCAACACCTG 


CAATTGGATG 


TGGAGTTCCC 


ATAATACCGA 




13740 


AAGGATGAAA 


CCGATGAAGA 


ATTTAGATCC 


CCAGAAACCG 


ATTTTCTTGT 


TCAATTTAGC 


13800 


AGCATCAAAG 


TCATATTTAT 


CAAGGCCTGG 


GAAGAATTTT 


TCAAAAATCT 


TATCCAAAAC 


13860 


CATGATAACT 


GGGTTCATCA 


TGTAGTTCAT 


GTGAGTTGAT 


GTCATTGGTG 


ATGAACTTGG 


13920 


GGCGTTAAGA 


AGGTCATCAA 


ATGTAGGTTT 


CATCAAGTCA 


GAGTTGATAA 


TTTTCAACAC 


13980 


ACCGACAAGG 


ACGATAGCTG 


CTGTAGCAAT 


AAAGAGTGAA 


ACCCCTTGAC 


TCACACCATT 


14040 


GTTATCAGCA 


TACCATTTAA TCAAGAGACC 


TGTGATAGAC 


AAGTGCCAGA 


TATCAAAGAT 


14100 


ATCGACATCA 


AGTGTATCTG 


TTTTCTTCAT 


AGCTAGCATC 


ACTATGTTGA 


CAATCAACAT 


14160 



WO 98/18931 



PCT/US97/19588 



939 

GATGAGCAAG AAGTATAGTG TCCAAGCAGA ACCCCAAGTG ATTGTAGCAA GTGGTGCCCA 
ACCAACGTCG GTAATACTCA ATTGGATACC AGTGTTTTCA ACGAATTTTG CTAGTGATGC 
TGAGAAAGCA GTGTTTAGCA TACCGATGAT AGCACCGATA CCTGTAAGAG CGATGGCAAG 
TTTGATACCA CCTTCAAGCG CTTTGGAGAA TTTCACTCCA AAAAGTAAAG CCAATACTGT 
CAAAATGATT AACATGATGA CAGGTCCACC CATTTCTAAG ATGGGATTGA AAACCTTTCC 
GATTAGGTCA AAGATTGCAT CCATAACAGT TCCTCCCTTT TTGATGTTAT ATGAATGTTA 
ACAAATTAGA ATTAGCTTAA TCCGTGTTCT TTAATAG CTG CTTCAATATT GTCAAATACT 
GGAGCGCTCA TTGCTGGGAT ACGGAATAAG ATTGGCCCAG CTTCGATAAC TGGGATACCT 
GGTTCAAAAC CAAGGTCTGT TGCAGCGATT GGTGTAAAGA TATCGTAACC TTTCATAAGG 
TCTTCGTTTA CATCTTTCAC CATGACTGCA TCACAGTGAA CATCATAACC ACGGTTTGAA 
AGTTCTTCTT CTAGAGCACT TTTAATTTGG TGACTTGAGT TAACACCTGC ACCGCAGGCA 
GCAAGAATTT TAATCATTTA GATTTCCTCC GATTTTATTT TTTAATAGAC AAGATTAAGC 
GGTTGCTTCA GCAATGTAAG TATAAAGGGC TTCTGGTTCA GAAATTTTTG ATAGGTCTTC 
AAGATGACCA TTTCCTGTGA AGAAGTCCAT TAACTGAGCA AGAATGTTCG TTTGACTTGA 
ACTTGAATTA TTAATGATAA AGAAGAGTAG GGATACTTCT ACTTCCTTAT CAGGAGCTAT 
CATATTGTGA AAAGTTATTG GTTTTTCTAA TCGAACAACC ACCACTTTCT CAGCTAGATT 
ATGAACAATA TCTGTGTGAG GAATCGCTAC ATTTGGCAAG TCCTTTCCTA GAAATTCCAT 
ATCTAAACCA GTTGGAAATG ACTTTTCACG CGTGATCAAG GCTTCACGAT AAGTTGGAGT 
GACAATTTCT CGTTCTTCCA ATAAAGTTGC AACCTGATCA AAGAGTTGTT CTTGACTATC 
CGCTTCTAAG CAAAACACAA GGTTTTTGTC AAAGAAATAA TCTAATACCA TAAGTTTTTC 
CGG 

(2) INFORMATION FOR SEQ ID NO: 140: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28882 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
<D) TOPOLOGY : linear 



14220 

14280 

14340 

14400 

14460 

14520 

14580 

14640 

14700 

14760 

14820 

14880 

14940 

15000 

15060 

15120 

15180 

15240 

15300 

15360 

15363 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 140: 
TAAGACTATT TAATAGTGGA GTGAAATAGG ATACGAACAA ATTGATTAGG AAAATCAAAT 
GAATTTATAG AAATCTTTTA GCAGTTATGT TATCCTATTC TAGTTTCAAA ACGCTATAGA 



60 



120 
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AGCAGCATTG TGCTAGTCkA 


GATTCAGTTT 


ACTATACTAA 


AACGAGTAGC 


TTGAAATCAA 


180 


AAAACCCACC CTCACAGGCA 


GGTTTTATCT 


GTATTATTCA 


GCTAGATTAT 


GCTTTACCTT 


240 


CTGAACCGAA TACGTCGATA 


CGTTCTTCAA 


CCGATGCTTG 


GATAGCTTTT 


ACACCGTCAG 


300 


CCAAGAATTT ACGTGGGTCG 


AAGAGTTTTT 


TCTTGTCGTA 


TTCTGCTTCG 


TTTGCTTCGT 


360 


AGTCACGAGC AAATTTACGA GTTGCGTTAG 


CGAATGCGAT 


TTGGCATTCT 


GTGTTAACGT 


420 


TAACTTTGGC AACACCAAGT 


TTGATAGCTG 


CTTGGATTTG 


CTCATCAGGA 


ATACCTGATC 


480 


CACCGTGCAA TACGATTGGG AATCCTGGAA GAGCTTCTGT 


CAATTTTTGC 


AAGTGGTCAA 


540 


GGTCAAGACC TTCCCAGTTT 


ACTGGGTAAG 


GACCGTGGAT 


GTTACCGATA 


CCAGCTGCCA 


600 


AGAAGTCGAT ACCAGTTTCA 


ACCATTGCTT 


TAGCGTCTTC 


GATTGGAGCC 


AATTCACCTT 


660 


TACCGATGAT TCCATCTTCT 


TCACCACCGA 


TAGTACCAAC 


TTCAGCTTCT 


ACTGAGATAC 


720 


CTTTAGCGTG TGCTTTTTCA 


ACAACTTCTT 


TAGCCAATTT 


AAGGTTTTCT 


TCAACTGGAA 


780 


GGTGTGAACC GTCAAACATG 


ATTGAAGTAT 


AACCAACTTC 


GATACACTCA 


AGTGCATCTT 


840 


CGTAGTGACC GTGGTCAAGG 


TGGATAGCTA 


CTGGTACAGT 


GATACCCATT 


GATTCAACAA 


900 


GGTTAGCGAT CAAGTTGCGA 


GCAACTTTGT 


AACCACCCAT 


GTATTTAGCA 


GCACCCATTG 


960 


AAGTTTGGAT CAAAACTGGA 


GCTTTTTTAG 


CTTCTGCTGC 


GCGCAAGATA 


GCTTGAGTCC 


1020 


ACTCAAGGTT GTTTGTGTTA 


AATCCACCAA 


CTGCATAACC 


GTTGTCACGG 


GCTGCTTGGA 


1080 


CAAATTTTTC TGCTGAAACG 


ATTGCCATTT 


TATCAGGCCT 


CCTGTATATT 


TTTATGGGTC 


1140 


ATCCCATTTA CATTGTTCAT 


TTTATCACTT 


TTTGCCAAAA 


AAATCTAGTT 


TTTCCCGCAG 


1200 


TTTCGATTGA TTTTCTTCTA 


ACTCCATCTA 


TGTAAACCCT 


TTCTCTCCCT 


AGTCTTGGAC 


1260 


GACTTTTGGA AAATCTATAA 


AGAAGGTTAA 


ACTATTCTCC 


TCCATCTCGA 


AACGATAAGC 


1320 


TAATTTTTCA TGTTCTAATA 


GACTCTTAAC 


CACAAAGAGC 


CCCATACCAG 


ACCCCTTGAC 


1380 


CTTGCGACTG GCATTGTCAG 


AAAAAGACTG 


GGCTAGTTTT 


TCTTGTTCCT 


CTGAGCTACA 


1440 


GCTATTTTCG ATAAAAAGTT 


CTCCTTCTCT 


TTCTCCAATT 


CGAACTAAGC 


CACCTGGAAC 


1500 


AGAGTGCTTA ATGGCATTGC 


TGATGAGATT 


AGAAAGAATC 


AACTTCATAA 


CTGATGGGTT 


1560 


TAGATAAGCC TGCTGATGGG 


TCAAACTATT 


GTCTATCTGG 


AGCTCTCTTT 


CCTTGGCTAG 


1620 


CAAGGCATAA TCTTTGACCA 


GATTTTGCGT 


CATCTGGAGG 


AGGTCAATTG 


TTTCCCTATC 


1680 


ATCTCGCAAT TCCTGCACAG 


AAGAGAGGGA AAGTATCTGC 


AGAACATGGT 


GATTGAGTTC 


1740 


ATCCACAATC CCCAAGGCAA 


CTCCCAGATA 


CTGGTCTCTA 


TCCTTATAAC 


GACCGATATT 


1800 


CTCTCTCATA TTTTCGATTA 


GGATTTTCAA 


ACTAGCCAGC 


GGTGTTTTCA 


ATTCATGAGA 


1860 


AGCTCCTCGT AGGAATTCGA 


CCTTCATCTT 


CTCCAGCTGG 


AGAATGGCTT 


CATTCTTTTC 


1920 
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ATGCAAGTCC 


GCAATAACAG 


TCAAGAGATG 


CTGGTAGAGG 


CTATTGATTT 


GTTCCTTGAG 


1980 


ATTACCTATC 


TCATCCTTAG 


AATCCACGCG 


CAATCGCACT 


TGGGAATCCA 


GGTCCATCAT 


2040 


CCGACGGGTC 


ACCCGCTTGA 


TTTCCAAAAT 


CGGTGCAACA 


ATAGTCCGAG 


CGTAGATGTA 


2100 


GGCCACCAAA AGGGAAATCA GAAAGGAGGC CAGCAAGGTA 


TAGGGAAGAA 


ACTGGAGACT 


2160 


GATTTGCTCC 


GCTTCCTTTT 


GTAAATCCAT 


GGAAGCTAGA 


AACTGGAGAA 


TCATAGTACC 


2220 


ACCGTCTTGC 


GTTTTCACCT 


CGCGCTCCTC 


AATAAAGAGA 


GAGGTTGTCT 


GGCGGTCTGT 


2280 


GTCCAGAGGA 


AGACTGTCCT 


TGACTTCTAA 


CTTGTCCTCG 


GTCATCTCAC 


CTTTGACGGT 


2340 


CCCCTTGATA 


TCACTAGTCT 


GGGAATACAA 


GTCTAACACT 


TGCTCGATAC 


TCTGCCTATC 


2400 


TTTCCCTTCT 


AGGGACTGGG 


CAATGGCTGT 


TGCCTTTTGA 


CCAATGGTTT 


CCTGACGATG 


2460 


ACTCAGATAA GTCGAAGGAA AAAGAAAATA AATAGCTAAA TGAAGGCAGA TAACCAGAAC 


2520 


ACTAAATATC 


GAGAAGGTAT 


AGATAAATAT 


CTTTGCAAAT 


AAACCTGTTC 


GTTTCATTTT 


2580 


CGCTCCAATT 


TATAACCAAC 


ATTGCGCACA 


GTGAGGATAC 


AATCCAAGTC 


TAGCTTTTTC 


2640 


CGCAATTCCT 


TGATATAAAC 


ATCAATAACA 


CGGTCAAAGG 


GAACCTCATC 


TGTCGCTTTC 


2700 


CAGACGGCAT 


CGATAATCTG 


AGATCGAGTC 


AAGGCCCGGC 


CTTCATTTTT 


CACTAGATAG 


2760 


TCCAGAATTT 


CCAACTCTTT 


GGCATTGATA 


GGCACTTCTT 


GACCTGCGAG 


GCTTGCACTG 


2820 


TAGCTTTCAA 


AGTCCACCTT 


GGTATCCTTG 


TAAGAAAAGA 


TTCGTCCTGT 


ATCGTAGTAG 


2880 


CGCTTGAAAA 


TCGCGTCCAC 


CCTCACTTTT 


AAAAGGGAGA 


GGGAGAAAGG 


TTTTTCCAGA 


2940 


TAGCCATCTG 


CCAAAGAGGC 


AAAGGCACTC 


ATCTTGTATT 


CCTCATCTTG 


AAAAGCTGTC 


3000 


AACATCAAGA 


CAGGAACCTG 


ACTGGTTTTA 


CGAATCTCAG 


CTAGGACTTC 


TAAGCCGTTG 


3060 


AGCTTGGGCA 


TCTGGATATC 


CAGTAAAACC 


AGGGCCACCT 


CATAGCTAGA 


AAATTGCTCC 


3120 


AGAGCTTCCT 


GACCGTCCGC 


TGCCTCAATA 


GTTTCATAGC 


CACAATCCGT 


CAAATAATCA 


3180 


CTGACCCCCT 


CACGGATCAT 


CTCTTCATCT 


TCTACAATTA 


AAATTTTCAT 


ACTTTAACTG 


3240 


CTCTCTATTT 


TTTATTTTTC 


TTAGAATAAA 


TACCTACCCT 


ATTTTCTATT 


ATAGTCTCTT 


3300 


GCTGGCCTTT 


TGTCTGCAAG 


CAACTGACCA 


CTAGATAAAA 


CGTTGTGAAA 


TTCCTTTCTC 


3360 


ATAAATTCCA 


TAACTTTAGT 


ATATTATATT 


TAAGCACTAA 


AGTACAAAGA 


AAGCAACTGA 


3420 


AAGCAATGAT 


TTTCACCACT 


GCTTTCGGAT 


TTATTTTGAA 


TTGTTAAATA 


GCCATTCCTA 


3480 


TCCACTATTC 


TTGAATAGAA 


ACACAAGATG 


CAATCTTTAT 


TCTAGACTCA 


TTTTTTCAAA 


3540 


TTTATTCACC 


ATCCAGCAAG 


AGCTCTTTTG 


GTTGTTTTCT 


AAGGAGATTG 


CTTGAAGCAA 


3600 


GCGCCATAAC 


GAGAACCACT 


AGAACCAAGG 


CAAGGACAAA 


AATGATGATA 


AAGTCTGATG 


3660 
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TCTGAATGGA 


AATGTCTAGG 


CTCGACAAGG TCTTGCTAAA 


GCCATCTACT 


TCTGCACCAC 


3720 


CACCAAGGTT 


AGAGGCTTGA 


GCCGCCTTAC 


TAGCCTGTTT 


GGCAACACCT 


GAAGTCACAT 


3780 


TGGCAAGGAC 


AGTGTTTCCA 


ATTGCACGGG 


CAGTGTAATT 


AGCTAGGAAG 


TAAGCAGAAA 


3840 


CTAGAGCAGG 


GATAGCAATC 


AAGATAGATT 


CGGTGATGAA 


TTGACCCAAG 


ATACTTGCCT 


3900 


GCTTGAGGCC 


GATAGAGAGG 


AGAATTCCCA 


CTTCCTTGCG 


ACGGGCGTTG 


ATCCAAAGGC 


3960 


TGAGCAAGAG 


GGCAAGGAGG 


AGAACTGAGA AGCTCAAGCT 


ACCCCAGAAG 


AGGAGGTTGG 


4020 


CCATCTTGTA 


CATACCAGAG 


ATAGATTGCT 


CAAGAGCTGG 


GTAGTTAGAG 


GAGCTCTTGA 


4080 


CGAGTGTGTA 


GCTCTTCCAG 


TTGATACCAC 


TGATGCCATT 


CAACTCTTTC 


AT AAP ATP AT 


a 1 in 


CCAAGTTCTT 


GTCTGCTGTT 


ACAAAGAAGG 


TTGCGTCCCC 


ATAAATGGCT 


GTGT PTT C TVl 




TGTATCCATA 


AAGTTTTGCA GCAGTGTGAA 


TGTCTGTAAT 


AGCTGTGTTT 






CTTGTGAGTA 


GGTTACTGCT 


GACTTATTAT 


GACCATCAAA 


GAGTfVPTTG 




A 1 0 ft 


CAACTGTTTC 


CTTGGCTCCT 


TTTTCATTAT 


CTGCATCGTA 






/i 1 Q A 


CCTTGTCCCC 


TACTTTCCAG 


CCGTGTTTGG 


CTGCCAAGTC 


CTTGTGCAAG 


AGGATTTTAT 


4440 


CCTTGTCGTC 


GTTGGTTAAG 


TGCTCTCCTT 


CGACTAGTTT 


ATAAGAACCA 


G AG AC AAA PT 


4500 


TGTCTTCTTT 


AGAGGAGTCA 


TTGACACCTG 


TAATCATCAA 




A A&C'Cir*T kr vrzn 

/UV\\.v>\. X X v»0 




CACGATCAGC 


AGTGAGATTC 


TTCTTGGTTT 


CTGGCGTTTC 


AATCAGGTCA 


TATCCAGTCA 


4620 


AATCTCCGAT 


AGCGTTGATA 


CGTTTGACAT 


AAGACTCAAT 


GGCCTTGTTT 


TCGGTGATTT 


4680 


TTTTGATGTC 


TTCACCCTTG 


ATATTCCCAG 


CACCACGAGG 


CGTTCCTTGG 


TTGACGCGAC 


4740 


GATTGATTTG 


CATGGAGAAG 


CTATTGGTGA 


TATTTTTAAA 


GGTCTCCTGA 


GAAGCCTTGG 


4800 


CAGTAGCTCC 


CTTGATTGAC 


AAGCCGACCA 


AACTCAAGCT 


CGCCATGAGG 


AGAATAATCA 


4860 


GGAAGATGAC 


AATCGATTTG 


AAAAACTTCC 


TTGTAACATA 


GGCAAATGCG 


TTGTGTAACA 


4920 


TAGATTCCCT 


TTCTAGATTT 


TGTTTTAATC 


ATTCTATTAA 


AATAAGCTCA 


AATTATTTAC 


4980 


TAGTATTGCG 


CGTTTCAGTC 


AGTTTCTTAT 


CCTTTAATTC 


AAGTGTAATA 


TCTGACGCTT 


5040 


GTGCCACTTC 


TTTACTGTGA 


GTTACGACAA 


TCACACATTT 


ACCTGTTTTC 


TGGGCAAGTG 


5100 


ATTTGAGTAG 


TTCGACAATA TCTCCAGCAG 


TTTTAGGATC 


CAGATTTCCT 


GTTGGCTCAT 


5160 


CAGCTAGAAT 


AACTGGAGCT 


TCTGAGACCA 


AACTGCGAGC 


AATGGCAACA 


CGTTGCTGTT 


5220 


GACCACCTGA 


TAACTGGAGA 


ACATTCCGCT 


TGATCTGGCT 


TTCATCCAAA 


CCAAGCTCAA 


5280 


GAAGTGTATT 


CTTGCTTGCC 


TTTTTGTTGA 


CCAATCGGAT 


ATTTTCCAGC 


GGAGAAAGAT 


5340 


AATCTATCAA 


GTTATAATTT 


TGAAAGACCA 


GGGAAATATG 


GTGCATGCGA 


TGGTAAGAAT 


5400 


AGCCCTTCTT 


ACGAATATCC 


TCTCCTTGAA 


AAAGGATAGA 


ACCTTCAACA 


GGACTATCTA 


5460 
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GACCAGCAAG 


TAGGGACAAG 


AGTGTGGATT 


TTCCTGCTCC 


TGACTCCCCA 


ATAATACTGT 


5520 


AAAATTTTCC 


GGGTTCAAAA 


TTATAATTGA 


TCTGATATAG 


GACTGCTTCA 


GCAGTATTCT 


5580 


TATAACGGTA 


GGTAACATCT 


TGTAATTGTA 


ATAAAGTCAT 


GATTTCTCCT 


TCTTAACTAA 


5640 


TAGATGATAA 


AATTTCTTTC 


GGTGATTTTC 


TAAATAAGAA 


TAGGAAACAA 


AGGGCTACAG 


5700 


ATAAGCAACT 


AAGCAGAACT 


AGAAAAACAT 


AGGATTCTGC 


AAAAGATAAG 


ATGCTAGTTG 


5760 


. ATAAACTGCT 


TGCTTTGGCT 


AGTGTATCTT 


GTAAGCTTGC 


CTGATCTCCA 


CTTGCTAGTA 


5820 


GAGTTTGGAG 


TAGGTAAGTT 


GTGATTGCGT 


TTCCTGCAAC 


AAATGCTGGA 


AGCAAAGCTC 


5880 


CAAGAGATAC 


CAAAACTACC 


TCTAAACAGA 


ATTGTAGGAA 


GATCGAGCTC 


TTGCCTTTTC 


5940 


CAAGTGCAAG 


TAAAATCCCC 


ACTTCATAGA 


CCCGTTCTCT 


CAACCAGAGA 


GACAAAACCA 


6000 


GAATTAAGGC 


TCCAGCTCCT 


GCTATCAACA 


TCCCATAAAG 


GAAGATGGTC 


AGGAAGGTTT 


6060 


GGAAAGTTGC 


AACTGAGTCT 


TTGATTTGTT 


CAAAAGCCTT 


GTTTTCCTTT 


TCGACTTGGT 


6120 


AGCCTTGATT 


TTCCAAGGCC 


AAGTTTTCTA 


CCTGCTTCAT 


GAGTCCGTCC 


ATTTCCTTAG 


6180 


GATTTTCTAC 


ATAGAAGCGT 


GCTGCACTGA 


CTTGAGCTTC 


ACTATTGCCC 


AAAAGGGTTT 


6240 


GGCTACTTTC 


ATAGTCTGTA 


AAGACTTGAT 


TTTCACTGAA 


GTCAGAAGAC 


AAGCCTGTGA 


6300 


ATTTCTCTTG 


TTTTTTACCA 


GAAAAGATGC 


CGATAATCTC 


AAACTCTACT 


GTTTGTCCTT 


6360 


TTCCAGATTC 


AGACTGACCA 


GCATCCAAGC 


CAATCTTGTC 


ATGAAGCGAA 


AGACCGTTCT 


6420 


TCTTAGCCAA 


TTCTTCGTGG 


ATAAGGATTT 


TCTTGGAATC 


CCCTTTTTGA 


AGGTGTCGCC 


6480 


CTTCTTTTAG 


ATTGAAAGCC 


GAACTGGTAA 


AGGTTACATC 


CTTGGATGAA 


TCCTCAAGAG 


6540 


CCGTTAAGCT 


AACCAAGTTA 


TTGTCTGCAG 


CTGATAAATC 


ATCACGCTCC 


ACGCTCTGCT 


6600 


CGCCAGTCAC 


TGCTTCCTTG 


TCTTTTAGTT 


TTGCGACCGT 


CTCAAGTTCA 


GGAGAGACAT 


6660 


TTTCCAGCCC 


CTTAATCTTG 


CTTACAGATG 


CTAGGTCTGA 


CAACTTGAAT 


GTCTGACCAT 


6720 


TCTCTATCTT 


CTTAATAGAA 


AAAGATGTAT 


TGAGTGATTT 


ATAAAGATTG 


CTTTCTACTG 


6780 


TTTTGTTGGA 


CTTCATCAGA 


GTCAAACAGG 


CTGAAATTCC 


GGCCAATAAG 


ACCAATAAAA 


6840 


TCAGAAATAA 


AATAAAACTT 


CTCAGTCGCT 


TTCTGCTGAC 


ATAAGCCCAA 


GATCTTTGGA 


6900 


TTGGATTCAT 


TTGTCACCTC 


CATATTTGTA AGACTATTAT 


AAAACCCAAA 


TATGAAATAT 


6960 


TTATGAAATA 


CGAAAAAAAA 


ATATCGAGTA 


GGGGATAATC 


TCTAGCCCCT 


CTCACACCAC 


7020 


CATACGTGCC GTTCGGCATA CGGCGGTTCA ACTAACTTTT 


AACGCATGTC 


GTTCAAGGTA 


7080 


ATAATCCAAA CACGAAACCA GTCCACGTTT 


TTCAAGGACT 


GGTTTTGATA 


TAGCACGTTT 


7140 


AAGTACCGAC 


TTCTGAGCTA 


CTATAGTAGA 


TTGAAACTAG 


AATAGTACAC 


CTCTACTTCT 


7200 
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AAAATATTGT TAGAAATCGA TTTGACTGTC CTGAACAATT 


CGTCCTATTC 


TTATTTCATT 


7260 


TTACTATAAT TGATAGTGGT CGCCCCAGCC AGATACCTTA 


TCTGCTATCC 


ATTTAGGAAC 


7320 


CCCTAACTTA AGCAATCCCC ATAATCGTCT CGATTTCTTC 


TTCCATTGCT 


TCCAGATAAT 


7380 


CACTCGTAGG CGAGTACGCA AGCGCTCATC TATGCTAGTG 


ACTATACTTT 


TCATATTTAT 


7440 


AATTCATTCC TTTCGTTTCA CTCAAGGCAC AACACAGAAT 


GAAAAAGTGT 


TGTGATCTTT 


7500 


ATTTTGTTTT ATAATAATAG TGAGAAAACC TATCACTACT 


ACAAATCACG 


GGGAGGTGAA 


7560 


TAAGTGAGTG GTACAGCCAC TACCTCGCAT ATTTTGTCAC 


ATCATTTAAC 


GGTAf*ATAAT 
VJVa 1 nun X nn 1 




AAGTTGTACC ATCTGAATAA GTTGCTACAA TATCATTTGC 


ATGCTCTCCT 


1 LAI. 1, 111 AO 


/ ooU 


CAAAGGTTGG AGCTCCTGCT GGATGATTTT TATTTGCCTC TTTCAATTTT 


1 Uift 1 An 1 ulj 


HI A ft 

7740 


CATTTTTTCT GTATCTTTTA TATTATCAGG ATTTTTCACT 


AAGATTTTGT 


CVff* ft I'll TV T> 
1 \?L»A 1 A l\j 1 


7800 


CGGTTTAGCA GAAACAATTT TTACTGTTAC TTCTTTTTTA 


TTCGAAGCAC 


TTGTCCAGTT 


7860 


TCCAGCATTA TCTTTAGCAT TTAATTTTAC AGTAATTCCT 


GAACTAGGAA 




7920 


AGGTTGATTA TCAACATTAT TCAACTTTAA TTTCAAAAGA 


GCTGTTGCAT 


LnvanLo 1 111 


noon 


ATCAATCGTT ATATATAATG ATGAATTGTT ATTATAAACA 


GTTCCTTCAT 


nHl AuL 1 Vi 1 


D H/1 A 

oU4U 


TTGTGAGCTA CTTGAAACAG AACTGAAATT ATACCCACTA 


CCTCCCTGAT 


1 A 1 V- 1 1 trtrt 1 


pi nn 


GCTTACGTCT AAATGAACTT CCCCACTATT ATTTGGCTTA 


GCAACAACTG 


1 1 A 1 nu 1 AAA 


Q 1 C ft 


ATAACATAAA ATTTGCATAA ATAGATTAGG GAAATCAAAG 


CAGCTTCTAG 


GAATGTTTTA 




GCAGTCACAG TGTACTTTCC CAGCATCAAG CCACTATAAC 


TCTGCACATA 


AAAATfTGAnA 


OaDU 


AGATGGCAAT CCTCTTCTCC AAATATTAAC TTCTTTACAA 


ACCAACTATA 


GTTGACAAAG 


8340 


AACCTAAAAT CAATTGATAA CACAAGGTCA GGTCGGTCAA 


CTCTTTCAAC 


TGAAGCCCTG 


8400 


TCAACTCTTC CCATTTATCA ATCTTGTATT GGAGAGAATT 


GCGGTGCAGA 


TAGAGTTGCT 


8460 


GGGCTGTTTT AGTGAGAACA GCACTATTTT CCCAAAGAGA 


GAGAATGATT 


TCCTGAATCT 


8520 


GATCTTGATC CAAAATCATC TGGTGTAGAC ATTCCTTGAT 


TGGCTTCAAG 


TCCACGAGTC 


8580 


TTTCTCCCAT ACTCCAAAGA TAGAGCTGAG AAAAAGTATG 


AACACCTTGG 


TGACCCTGAC 


8640 


GCCACCATGT CTTGAACAAA TCCCGCTCAG CTTTGATTAA 


GTCTGATAGG 


GCTTGATGTC 


8700 


CCGTCTGAGA CCAAACCTGA CCCAACATGA TAGAAAGACG AAGTCCAAAG TCATACTCAA 


8760 


CCGCTTCAAT CGTATCACTT AAAATATCTC TTACAGAAGT 


GTATTTGTCT 


TGTTGAAGCA 


8820 


CGAAAACATA ATCCTGAGCT CCGACCTGTA GCACTGTCTG 


ACAATTCGGA 


AAAAGAGTCC 


8880 


GCATCATATC TAGCCAAGAA GCCAGATTTT CCTGCTGAAA 


ATAAGAAAGA 


TGGCAATAAA 


8940 


CCAACTGAAT CTTTTTAAAA ACTTGCGGTG CCTGTCCCTT 


GCCCTCAACC 


AGATAGGAAT 


9000 
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ACCAAGGGTT 


TAGCGAACGA 


GCCTGCTCCT 


GCTGGGTCAA 


AAGGGCAACC 


AACTGCTTTT 


9060 


CACGCTCGCT 


GAGCCCAGCT 


TCCTCCAGCA 


AAATCCACTG 


CTGAGAAGCT 


AAAGGGAGCG 


9120 


TGAGATAGCC 


CTCTTTCTCT 


ACTGGTTGGT 


CTGAAATCCG 


AGCCTCAGGA 


AACCAGTCTT 


9180 


GTAGTTCTTT 


TGCCCTCATG 


TTCTAGCCCT 


CCACTTTTTG 


GATGCACCAT 


GAAACCAAAC 


9240 


TCTCAAGACG 


TTCCAGATTC 


TCAGTCATAT 


GGAGATAGCC 


CATAACCGCT 


TCAAATCCCG 


9300 


. TGGACATACG 


ATAAGTCACG 


ACATCTGCAT 


TTTTAGCCTT 


TGTGTGGCTA 


TTGGTATTGC 


9360 


GGCCACGTTT 


GTAGATTTCT 


TCTTCTTTTT 


CCGTTAGGAC 


CTGCTCCTCC 


AACATGAGAG 


9420 


CAATCAGGCG 


AGCCTGAGCC 


TTGGCTGACA 


CGTACTTAGT 


TGCTTCTTGA 


TGGAGTTTAT 


9480 


TGGGTTTGGT 


CATACCTTTG 


AGGATGAGGT 


GACGGCGAAT 


ATACATAGAA 


TACACCGCAT 


9540 


CCCCCTCAAA 


GGCTAGCGCA 


ATCCCGTTAA 


TGAGATTGAC 


ATCAATCACG 


TGTCCACCTC 


9600 


ACTCCATCCT 


TGGTATCAAG 


GAGCTTAATT 


CCTTGAGTAA 


CCAATTGGTC 


ACGGATTTGG 


9660 


TCTGCTGTCG 


CAAAGTCACG 


ATTGGCACGC 


GCCTCTTGGC 


GTTTTTGAAT 


CAAGTCTTCA 


9720 


ATCTCTGCAT 


CCAAAACTTC 


CTCAACAAAG 


ACAATTCCAA 


AAATTTCTAA 


CATATCTGCA 


9780 


AGAGCTTGCT 


TGACACTTGC 


ATCATAGTTC 


CCTGAGTTGA 


TCCATTTGGC 


CATTTCAAAG 


9840 


ACAACTGTGA 


TACCGTTGGC 


AGCATTAAAA 


TCTTCATCCA 


TAGCTGCTAC 


AAACTTATCT 


9900 


TTAAAGTTTT 


GTAACTCTTG 


GGCATCCACA 


TTTCCTGTAA 


ATGGTTGTTC 


GTAAGTATTC 


9960 


TTGAGATACT 


TGAGATTGGT 


CTCGGCATCG 


CGAACTGCCT 


TTTCCGTGAA 


GTTGATAGGC 


10020 


TTACGGTAGT 


GCTGGGTCGC 


AAAGAAGAAA 


CGAAGTACTT 


GCCCATCAAG 


AGTTTTAAGG 


10080 


GCATCGTGTA 


CCGTAATGAA 


GTTACCCAAG 


GACTTAGACA 


TTTTGACATT 


GTCGATATTG 


10140 


ACAAAGCCAT 


TGTGCATCCA 


GTAGTTAGCA 


AAAGCCTTGC 


CTGTTTTAGC 


TTCAGACTGG 


10200 


GCAATTTCAT 


TGGTGTGGTG 


TGGAAACTCT 


AGGTCAGCTC 


CACCACCGTG 


GATATCAATG 


10260 


GTATCACCTA AAATCTCTGT 


CGACATGACT 


GAACACTCAA 


TATGCCAACC 


CGGACGTCCA 


10320 


GGTCCCCAAG GACTATCCCA AGAAATCTCA CCTGGTTTGG AAGATTTCCA TAGAGCAAAG 


10380 


TCTACAGGAT 


TTTCCTTACG 


AGCCGTTTCT 


TCATCGGTAC 


GACCTGAAGC 


ACCTAGCTCC 


10440 


AAATCTTCCA AGGTTTTATT 


AGCCAATTTA 


GCATAGTTGT 


GGGATTTTTC 


TACACGGAAA 


10500 


TAGACATCCC 


CTTGACTCTC 


ATAGGCAAAG 


CCTTTCTCGA 


TCAAGTCTTC 


CACAAAACGG 


10560 


ATGATGTCTG 


CCATAAACTC 


CACTACACGC 


GGATGGCGAG 


TCGCAGGTTT 


CACGCCCAAT 


10620 


GCCGTCACAT 


CCTCACGAAA 


GGCAGCGATG 


TACTTATCCG 


CAACCTCCTG 


AGGCGTGATA 


10680 


CCTTCTTCCC 


TGGCACGGTT 


GATAATCTTA TCATCCACAT 


CTGTAAAATT 


GGAAATATAG 


10740 
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GCAACCTTAT 


ACCCACGGTA 


CTCAAAATAG 


CGACGAATCG 


TATCAAAAGC 


TACCGTCGAA 


10800 


CGGGCGTTTC 


CTACGTGGAT 


ATAGTTGTAC 


ACCGTTGGCC 


CACAAACATA 


CATCTTGATC 


10860 


TTGCCGTCCT 


CAATCGGGAC 


AAATTCTCGC 


AAATCACGAG 


ACATGGTGTC 


ATAGATTTTA 


10920 


ATCATAAATC 


ATAATCAGGA 


AAGCTGAAAT 


CCAAGAACAA 


TTAGTTTCAT 


CACTAAAAGT 


10980 


TCAAGTAAAT 


TTCAGTCCGA 


ATATCTCTAC 


ACTTCGGAAT 


CCCTTGCTCC 


TTTCTCATTC 


11040 


AGATAAACCA 


CCTGAGTCTG 


TTTGACAAAG 


CCAATTTTTT 


CATACAAACG 


TTTGGCACCT 


11100 


ACATTGCTAT 


CTTCCACTGC 


AATCTGAAAT 


TCCTTGTCAT 


TTTGCTCAAT 


TAGTTGGTTG 


11160 


ACGAGGGATT 


TTGCTAAGTA 


GCTTCCATAG 


CCTTTTCCAC 


GTTCAGGTTC 


CAATATTGCT 


11220 


AAACCGTAGA 


GGTAATTCGT 


ATTAGTCGAT 


AAATCAACCG 


TACAAGTTCC 


AATAACCTGA 


11280 


CCAGCTTTTA 


ATAAAATATA 


TAGTCGGCTT 


TCTGGATCTT 


TCAGAGCTTC 


AGCGACATAT 


11340 


CTATCCACAA 


CTTCTCTCGA 


TTCATGTTCC 


TCTGAAAATG 


CCTGAAATTT 


TAATTGACTA 


11400 


ATTTGATCCT 


GATACGAACT 


ATCTGCTAAC 


AAAACTTCAA 


GATGGGAAAC 


ATTTGCTAAC 


11460 


GGATAAGGTC 


TTCTATCCTT ACCTAACCAA 


GTTTCTGTCT 


CTTCATCCTC 


GATTAGTCCC 


11520 


CAGTTACTGG 


CAAAGTCAGG 


ATGATTCTCT 


AAAAAAATAC 


GTTCTGTCTG 


AAAAGTGACT 


11580 


GACCGAATGG 


GGAAAGAAGC 


TGTTTCTCTC 


TCAAAACTAG 


TAAACAATGC 


ACGCGCAATC 


11640 


CCCTGACGGC 


GATGACCTGG 


ATGAACCAGT 


ATCGTCACTT 


CTACATCTTG 


GTCATCTGCA 


11700 


TAGACAGTTA 


ATAAACCAAC 


AAGTTCGCCT 


TTTTCATAAT 


AAAGGAAAAA 


GGCGGGCATG 


11760 


TTTGGGTCAA 


AATTAAGCAT 


GTTAGAGAGA 


TAGGGATCGC 


GATAGGTACC 


GTCATAGTTT 


11820 


TGGCAACAGT 


TAATTACTTT 


TTTCGCCTCA 


GATAGCTCCT 


CTTGGCTTAA 


CTTGTTTCTT 


11880 


GCTTGAATCA 


TATAGGTATC 


CTCTACAAAC 


CAGACGATCT 


GTGACTGGCA 


TCTTTAGCCT 


11940 


GCTCGAGTTT 


ATTGACATAA 


TACTCTCGTT 


TTTCTTCGAC 


TTCGTGAATG 


ACAGGCTCAT 


12000 


CTTTCTTACC 


ATGAAGACGG 


ACAATCTTGG 


CCGGAATACC 


GACAACCGTC 


ACGTCACTAG 


12060 


GTACATCTGC 


TACGACAACT 


GCTGCAGCAC 


CGACCTTGGC 






12120 


GCCCGATAAC 


TTGGGCATGG 


GCTGATATGA 


GGGCTCCCTT 


TCGTACAGTC 


GGATGGCGTT 


12180 


TGCCACAGTC 


TTTCCCTGTT 


CCCCCGAGAG 


TCACTCCGTG 


ATAGAGAAGA 


ACGCCTTTTT 


12240 


CAACAATCGC 


TGTCTCTCCA 


ATCACCAGAC 


CAGAACCATG 


GTCAATAAAA 


ACACCTGAAT 


12300 


CAATCTGGGC 


TCCTGGATGA 


ATCTCAATCT 


GAGTCCAAAA 


GCGCCAAAAC 


TGACTGTACA 


12360 


TACGAGCTAA 


TAGTTTGAAG 


CCGTGCTTCC 


AGAGAAAATG 


CGAGAGACGG 


TGGGCCGCCA 


12420 


AGGCCTTGAC 


ACCTGGATAA GTCAGCAAAA 


CCTCCAAAGT 


GGTGCGGGCC 


GCTGGATCAT 


12480 


TTTCTTTTAC 


AATATCAATG 


GTTTCGCGCC 


ACCACCCCAT 


ACATTTCTCC 


TTTTCTTATT 


12540 
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CTGAATCTTT 
GGCGGTGAGG 
GTAGAAGAGC 
CATCAACTTC 
TCTCAGAGAT 
TCTCGATACG 
CAGCAATAAT 
TTCCTTCTTC 
CTCCACCCTT 
TCGGAGCAGT 
GGATTTCAAA 
TCCCTTGAAT 
TGAAGTCCAT 
TATTTCCATC 
CACCAGCCAT 
ATTCCAAAAC 
CTTGAGCAAG 
CGTAACGACC 
TCTTGTACTC 
AGACTGAAAG 
GGAAGTCAAC 
CCTTGTCTTC 
CCACATCACG 
CAGTCACTTG 
GAACTGCCTT 
CCACGTGAAG 
GGAAGGCAAT 
CTTCTGACAA 
CTGTCAATTC 



TGATGTTTCT 
GCGCTCAGAC 
CTTCATAGAG 
ATCCCCGATT 
ATGAACAAGG 
AACGACTTTA 
TTCTTTGGCA 
GTCTATATCA 
ACCGATGACA 
TGGAGCCAAT 
ACGCGCTTTC 
CTTGATATCC 
ATCTCCAAAG 
TGAGATAAGC 
AAGGGCAAGA 
TTCTGCTACT 
AGCACGCTCA 
TGTTTCCCCT 
TGGATCCAAA 
AGCTTGAGTT 
AACCGCATCC 
TGTAATTAAA 
CATAATACGG 
GTCTTTCACT 
TTGGAGGTCA 
CAATTCCACT 
CAATTCTTTG 
TTCTTTGGCA 
AAGAAGAGAT 



GTAAATTCTT 
TTTTCACCTT 
GCATCGATAC 
TCTACCAAAT 
GCATCTGTCT 
GCACGGTAAA 
CGGTTAATAG 
ATCTTAACAC 
ATCTTAATCT 
TCTGGACGAA 
TTGGCTTGAG 
ATTTGAAGGG 
TGATCTTCCA 
CCCATAGCAA 
GTTCCCGCAC 
AGACGGATAG 
CCAAGGGCAC 
ACAGAATATT 
CCATCAATGA 
TGCCCACGAG 
AAAGGACGGA 
CGTCGCACTT 
TCAAATTCTT 
ACTTGAGTCG 
CTGTTGTAGG 

TCTGCTTTTT 
ACAGCTTCGT 
CCAGACTCTA 
TGCTCTGCTT 



947 
TCTTAGGTTT 
TTTCATCATG 
GGCCTTTTTC 
CCTCTACACG 
TATCAAAGAG 
CTTCATCCAC 
CATCTTGGTC 
CTGTTTCAGC 
TGTCCACATC 
CTTCTGGAAT 
CAAGAGCCTC 
CTGTAATCCC 
AACCTTGGAT 
TACCAGCTAC 
AGATAGAAGC 
CGTAGGGGAA 
CGTGACCGAT 
GAGGGAAGTT 
TTTGAGTTTC 
TAAAGAGACC 
TTTCATCGAC 
CTGCGTGTTC 
CGTGGTCCGC 
CAGCTTCACG 
CTGCAATGAT 
CTTTACCGAC 
GCCCTTTAAG 
CCATGTTGAT 
GTTCTTGACT 



GTAATCCTTT 
CTCAGGTTTT 
ATCAATTTTG 
ATTGGTACGA 
GTTAACAAAG 
TTTGGCTTCA 
ACTAGAGTAG 
GATAATCTTG 
AATCTTGATC 
GGTTGCTTCA 
CGTCAAGATT 
ATCACGAGTA 
ATCTGTCAAT 
TGGCGCCTTG 
TTGAGATGAA 
TTCTTCCAAG 
TTCACGACGA 
ATAGTGGTGC 
TCCCATCGGA 
TGAACCATGT 
CTTACGACCA 
CATTTGTTCC 
ATATTTTTCT 
GGCCAATTTC 
TTCAGCTTGC 
AGCAGCAACG 
GAGCGCTTCC 
AGCGTGCTTG 
TGGGTTGATG 



TGATGACGTG 
GGCGGACGAG 
ATAACCTTAA 
GTCCAAGCCA 
GCACCAAATT 
CGAACCAAAC 
ATAGACACAT 
TCGATGGTTT 
GTATCAATTT 
ATGACATCAA 
TCTGCAGTAA 
CCTGCAACCT 
ACTGTGTAGT 
ATTGGCACAC 
GAACCGTTTG 
CTTGGCAAGA 
CCTGGCGCAC 
ATAAAGCGTT 
GCCAAGGTCA 
ACACGAGGAA 
TCAGGACGCA 
AAGATTTCAG 
TCGTAAACGG 
TCTTCTACTT 
AATTCAGCAT 
ATTTCTTCTT 
AACATGATTT 
GTTCCAGCTA 
ATGATTTGGC 



12600 

12660 

12720 

12780 

12840 

12900 

12960 

13020 

13080 

13140 

13200 

13260 

13320 

13380 

13440 

13500 

13560 

13620 

13680 

13740 

13800 

13860 

13920 

13980 

14040 

14100 

14160 

14220 

14280 
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CATCTACATA TCCCACTTGT ACCCCAGCAA TTGGTCCGTC AAATGGAATA TCTGAAATAG 14340 

ACAGTGCCAA AGATGAACCA AACATAGCAG CCATTGGTGC AGATGCATTT TCATCATAAG 14400 

AAAGCACTGT ATTGATGACT TGGACTTCAT TACGGAAACC TTCCGCAAAC ATAGGACGAA 14460 

TCGGACGGTC AATCAAACGC GCTGTCAAGG TCGCATCTGT TGAAGGACGT CCTTCACGTT 14520 

TCATAAAGCC ACCAGGAAAC TTCCCAGCCG CATACATTTT TTCTTCGTAG TTGACTTGGA 14580 

GTGGGAAGAA ATCCCCAGTT GCCATTTTCT TAGACATAAC GGCAGCAGTC AAGACAGTTG 14 640 

ACTCACCGTA ACGTACGACA ACAGATCCAT TTGCTTGCTT AGCAACCTGA CCAGTCTCTA 14700 

CAATTAACTC ACGACCCGCA AAAGTCGTTT GAAACACTTG TTTTGCCATT TTAATCCCCT 14760 

TTGGATTGAT GAAATTATAC GCCTTGCCTA CAAAGATCAA GATACCAAGG ACGTCAAAAG 14 820 

CAAAGTAAAA ATAGGAAACT GACGAAGTCT TCGATGAAGA CAAGACAGTT TATCTTTTTT 14880 

ACACAGCTTT TCGGCCGTGT TCAATTACAC AAGATATTTT GGACGGTTCG GCTTGCCGAA 14940 

CATTTCTGTA GAAAAATAGG AAGGTGACGT CGCACTCGAC GAGTGCTAGG AAGCTTATCT 15000 

TTTTTCCTAA GAAATGAGAC CAAAATTCAA GTCATCAAGA TACCAAGCCG TCAAGCAACT 15060 

CAAAGGAAGA TAGGAAATCG AACGACGGAG CGACTACTCC TAGGGAGATT TATCTTTTTC 15120 

CACAGAGTTG TAGGCAAGTT CAGTTTTCAA GATACATCAT TAGAAAGGTT TAATACTAAA 15180 

GTATCTAAAG CTTTCACGCT AATCGCTATC GGGCGATTAG CTAAATGCTT TACTAACTCT 15240 

CTCGTCAAAT AACATCGATT TGACTCACTC GTGTCGTTAA ATCTTACAGT TTAAATGCAT 15300 

TGTATTATTT AATACCTTCA TCTTTGTATC AAGTACGTAC AGAATTTATT TTATCATATT 15360 

TTTCTTAAAA AGTGAGGTCT TTACCATTAA AAAGGAACCA TTCCCCTCAC CTGAGAAGAA 15420 

TGGTTTGCTT TTATTATCCT AGAGACTGGT GATTAAACAA GGCATGGGTT GCTTGATGGA 15480 

TGTATTTTGC TGTATCAGCA TTATTCATCG TATAGAGATG CACACCGGCA ACATCCTGAG 15540 

TTACCAAGTC CACGATTTGG TCCACTGCAT AGGCAAGTCC TGCTGCTCTG AGCGACTCAG 15600 

GGTCATGCTC ATACTTGTCT AAGATGGCTT TAAATTTGCG TGGAAGATGG ATATTCTCAC 15660 

AAGTCTTCAA GAGTCGGAGA GCCTGATTTC GATTCAGAAT TGGCATAATT CCTGCATGAA 15720 

TGGGAACATC AATCCCAGCC AAGATACACT TGTCCTGAAA ATCATAGAAG CGCTCATTGT 15780 

CAAAGAAGAG CTGAGTTACG AGGCTCGAAC AGCCTGCATC CACTTTCTTC TTAAGATTTT 15840 

GAATATCTGA AATCTGATTT GGCGAATCTG GATGCCCTTC TGGATAGCAA GCTCCAATAA 15900 

TATCAAAGTG AGGGGTTTGT TCCTTGATAA ACTCAATCAA GTCGGTTGCA TAGCGGAAAT 15960 

CCTTTTGTGG TTCCACGTCT GGAATAATAT CCCCACGAAG AGCCAAGATT TTCTGCACCC 16020 

CAACTTTGTC CAAGTCAGCA ATAGTTTCAG CAACCTTGTC CTTAGTTAGA TAAATAGCTG 16080 
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GCAAGTGGGC 


AATGGTCGGA 


ATCGCCAAAT 


CATTTTGGAT 


AAAGTCAGCC 


AAACGAACCG 


16140 


TCGTTTCCTT 


GATATTAAAT 


TTATTATTGC 


TGGCAGTTAC 


ACTGATAAAA 


TGGGGAGCCA 


16200 


ACTCCTGCAT 


ATCCTGCAAG 


GCTGAAATAA 


TGTTATCATT 


ACCCACGGCT 


GGGTTTGGAG 


16260 


GGAACACTTC 


AAATGAGAGT 


GACGGTGTTT 


GGCGTGACAT 


ATGTAATAAC 


CTTTTCTAGT 


16320 


TGATTTCTTT 


TTGAACAACC 


ACTGTATGGA 


GAGAAATCCA 


ATCTTACAAT 


TTCTCACGCG 


16380 


.CAGCTTTAGC 


TGCTTCAACA 


AGGCGGATCA 


AGCTTTCTTT 


TGTTTCTGGG 


ATACCACGTG 


16440 


TTTTCAAACC 


ACAGTCAGGG 


TTGATCCAAA 


CTTTCTTGCT 


TGGCACTTTA 


GCAAGGATGG 


16500 


CTTCGATTGT GTTGTCGATT 


TCGCCTTCAT 


TTGGTACACG 


AGGTGAGTGG 


ATATCGTAAA 


16560 


CCCCAGGTCC 


CACTTCTGTT 


TGGAAGTTTT 


TCGCTTTGAG 


TTCGTCCAAG 


ATTTCAAGGT 


16620 


TTGAACGGTT 


AGCTTCAAAG 


GAAATAACGT 


CTGCATCCAT 


GTTATCGATA 


GCTGGGATGA 


16680 


TATCTGTAAA 


TTCTGAGTAA 


CACATGTGAG 


TGTGGATTTG 


TGTGTCTGGC 


GCTACTGTTG 


16740 


AGTGTACCAA 


GCGGAAGGCA 


GGAATAGCCC 


AGTCAAGGTA 


GTCTTCGTAC 


CAGTCGCTAC 


16800 


GGCGGAGTGG 


CAATTTTTCA 


CGAAGAGCAG 


CCTCGTCGAT 


TTGGATGATT 


TTCACACCAG 


16860 


CAGCTTCAAG 


GTCAAGTACT 


TCATCCTTGA 


TAGCAAGGGC 


GATTTGGAGA 


GTTGAATCCT 


16920 


TGATAGAGAT 


GTCTTCACGT 


GGGAATGACC 


AGTTAAGGAT 


GGTAACAGGT 


CCAGTCAACA 


16980 


TACCTTTAAC 


AGGTTTGTTT 


GTACGACTTT 


GTGCATAGCT 


AGACCATTTA ACAGTGATAG 


17040 


GGTTAAGACG AGTGACATCA CCCCAGATGA TTGGTGGTTT 


TACCCCACGC 


ATACCGTATG 


17100 


ATTGTACCCA 


TCCATTTTTA 


GAGAAGAGGT 


ATCCTGACAA 


GTTTTGACCG 


AAGTACTCAA 


17160 


CCATGTCATT 


ACGCTCAAAT 


TCACCGTGAA 


CAAGGACATC 


AAAGTCAATA 


TCTTCTTGCC 


17220 


ACTTGATCCA 


TTCGTCAATC 


GTTTCAGCAA GGAAAGCGTC 


GTACTCTTTT 


TGAGACAATT 


17280 


CACCTTTACG 


GTAAGCCAAA 


CGTTTGGCAC 


GAACTTCTTT 


TGTTTGAGGG 


AATGAACCAA 


17340 


TCGTTGTTGT 


TGGAAGAGCT 


GGAAGTTTGA 


AAGCTTCTTC 


TTGGATAGCT 


TCACGTTCTG 


17400 


CAAAGGCTGG 


CAAACGAGTG 


TAGTCTGCGT 


CTGTCAAGCC 


AGCGATACGC 


GCACGAAGTT 


17460 


CAGCATTTTC 


ACCAACACGC 


TCAGTCGCAA AGAGTTCTTT 


GTTGGCTGCA 


AGAGCTTCTG 


17520 


AACCTTGACC 


ATTTCGGATA 


GCATCCAAGT 


CACGGATTTC 


ATCCAATTTT 


TCAACTGCAA 


17580 


AGGCAAAGTG GTTCAAGAGT 


GCTGGTTCAA 


ATTCTTCATT 


AGCAGTTGTA 


AATGGCACAT 


17640 


GAAGAAGTGA GCAAGAGCTT 


GTCAAAACGA 


TGTTTTCAGC 


TGGAATTTGC 


TCAAGAACAG 


17700 


CCAAGCTCTT 


TTCGTAGTTG 


TTGCGCCAGA 


TGTTTTTACC 


ATTGACAATA 


CCTACATAGA 


17760 


GAGTCTTGTC 


AGCTGGGAAG 


CCACCTTTAA 


CGAGTTCAAG 


AGTTTTCTTA 


CCTTCAACAA 


17820 
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AGTCAAGACC GATAGCATCT ACTGGTAAGT 


TTACAAGGTC 


AGCGTATACG 


TCACGAACAT 


17880 


CACCGAAATA AGTTTGAAGC AAGACTTCAA 


GACCTTTTTT 


GTCAGCCAAG 


AGTTTGTTGT 


17940 


AAAGGTTCAA GAAGAGAGCT TTTTCTTCAG 


CTGTCAAGTC 


TTTTACAAGA 


GCCGCTTCAT 


18000 


CCAATTGGAT GCGAGTCGCA CCAAGTTCAG 


CCAATTTAGC 


AAAAACTTCT 


TGGTAAGCAG 


18060 


CCACTAAGCT ATCTACGAAG TCGTCTGCTT 


TCACGCCTTC 


TTCAAAGTCT 


GACAATTGAA 


18120 


GGAAAGTGAA GGGACCTACA AGAACAGGAC GAGTGTTCAA TCCAAGTTCT 


TTGGCTTCTT 


18180 


GGAACTCATC GAAAATCTTG TGACCAGCCA 


ATTTTACTTG 


AGTGTCTTTT 


TCAAATTTAG 


18240 


GAACGATGTA GTGGTAGTTA GTGTTGAACC 


ATTTCTTCAT 


TGGAAGGGCG 


CGAACGTCCC 


18300 


CTTTTTCTCC CTGGTAACCA CGTCCCAAAG 


CGAAGTAGCG 


CTCAAGGTCA 


GACAAGTCCA 


18360 


AGTTTTGAAC GGATGCAGGC ACCACGTTGA 


AAAGGAAAGC 


CGCATCTAGG 


AAGTTATCAT 


18420 


AGTGAGAAAA GTCATTTGAT GGAATTTCAG TGATGCCTTT TTCTTTGACA 


ATGTTCCAGT 


18480 


GTTTAGCACG CAAGTCTTTT GCTGCTGCTA 


AAAGTTCTTC 


TTCTGAGATT 


TCTTTTCTAA 


18540 


AGTATTTTTC AGTTGTAAAT TTTAATTCAC 


GGAATTCGCC 


CAAACGAGGG 


AAACCGATGA 


18600 


TTGTAGTTGA CATGATGTGT CCTCCAAAAT 


TTGTTGTTGA 


AACTATCTTA 


ACAGAAAAGA 


18660 


AAGCGTCTGT ATAATTGTAA AAAATTAGGG 


TTTGATATAG 


TTTGAAACTA 


TATATCTGTT 


18720 


TCGGACAAAA GAAAAAGACT TGAAGCAAAC 


GTCTCAAATC 


CTTTGTAATT 


CTTACTTTAC 


18780 


AGCTATATTC CAATTAGAAT ACTAAAACAT 


GTTATTAGTA 


ATTCTTATAA 


GTGACTATGA 


18840 


CCTGTTATTA GAAAAGACTA TAACTGATTC 


TAGTCAACTT 


TTTCCCTGTT 


CAAGTGGGAC 


18900 


GATTGCTAGT GTCTTTCCTA AACTGGCTAG 


GACTTTTAAG 


ACTGTATCCA 


ACTGAGGACT 


18960 


AGTCTTTCCT GTCTCCATCC TAGCTATGAC 


AGGCTGGCTT 


ATTCCACTGA 


CTTCTTCCAG 


19020 


CTTTTTCTGA CTGATTCCTT GTTCATACCT 


AGCCTCAATC 


AACTCGCTCA 


TGATAGCCAC 


19080 


TCGCATATCA CTTTCAAGGA TTTCCTCCTT 


GCTAAAGAGC 


TCAGATGGAC 


ATCCTTCCAA 


19140 


TTACTCCCAA TAGCACTATT CTTCATCACT TAACCCTCTT TTTTTTACGT CTATGTATTT 


19200 


TTAAAAAAAT GAGCGAATTA TGATTCGATA 


GATTGACCAG 


TGGGTTTAAA 


GTTGGTGCTA 


19260 


GCCTATTTCT TAAGCGATTT TCCTTTTCTA 


GGATAAAGCA 


GTTCCTGCTT 


GCTTAACCCC 


19320 


AATTTTCCAC GATGAATCCA ATAGTAAATG 


GTTGAAATTC 


CCACGTTAAC 


CCCTTTAGCC 


19380 


ATCACCATCA TTTCAGGCGA AAATTTTTGG 


TTATGTTTTT 


GGTTATGTAT 


AGTGGAGAAT 


19440 


CTTTTCCTTT AGTTTCTTAA GACTGTTGAG 


CGTAGTCGGC 


AGAATAAATC 


TCTTTGAAGC 


19500 


GCCCTTTTCC AAGACATTGT CGGACTGTCC 


CACGCTTGAT 


TTCAGTGTGG 


ATAGTTTGAG 


19560 


GAGCTTTTCC AAGTAGAGAG GCAATTTCTC 


TATTTGATTT 


TCCTTCTTTT 


TTCCATCGTT 


19620 
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CGATTAAGCG 


ACGGCTATCG 


ATTGTCAAAT 


GTTTGCCTTT 


TGTAGTATAA 


TTGTCTTGCA 


19680 


TTTCTGTGCC 


TTTTAATCAT TTCAATCTTA AATTGGACTT 


TTTTTACTTG 


GGTTGTACTT 


19740 


AATCTATGAG 


GAAGACAAGA 


AAAAGAATAT 


CAATCAAGTA 


AAGTCACAAA 


GTCACATTAG 


19800 


CTCCGAGCAA 


CCATTGCAAA TTGAGGTACT 


CACACAATGA 


TTAAAACATT 


TCTCTCTGCC 


19860 


CTTTCGGTCA 


TTCTCTTTTC 


TATCCCTATC 


ATAACTTATT 


CTTTTTTCCC 


ATCTTCTAAT 


19920 


CTTAAC ATTT 


ulaL-lAlClAL 


CCAACCTATC 


TTGGCACAGA 


TTTATGCCTT 


CCCCTTAGCT 


19980 


ACTGCAACTA 


TGGCTGCTAT 


TTTAAGTTTC 


TTATTTTTTT 


TCCTATCTTT 


TTACAAGAAA 


20040 


AATAAACAAA 


TACGGTTTTA 


CTCTGGCATT 


TTGCTCTTAC 


TATCGCTCAT 


ATTACTATTA 


20100 


TTCGGAACAG 


ATAAAACCCT 


TTCTTCTGCA 


TCAAATAAGA 


CTAAAACCTT 


AAAATTAGTA 


20160 


ACTTGGAACG 


TCGCTAATCA 


AATAGAAGCA 


CAACATATTG 


AGCGAATTTT 


TAGCCATTTT 


20220 


GACGCCGATA 


TGGCTATATT 


CCCTGAACTA 


GCTACCAATA 


TCAGAGGTGA 


GCAAGAAAAC 


20280 


CAGAGAATCA 


AACTATTGTT 


TCATCAAGTT 


GGACTTTCTA 


TGGCCAACTA 


TGATATTTTC 


20340 


ACTTCTCCAC 


CTACCAATAG 


TGGAATAGCT 


CCTGTGACTG 


TGATTGTCAA 


GAAAAGTTAT 


20400 


GGTTTCTATA 


CAGAAGCTAA 


AACTTTTCAT 


ACAACACGGT 


TCGGGACAAT 


TGTATTACAT 


20460 


TCGAGAAAAC 


AAAATATACC 


AGATATCATT 


GCCTTGCATA 


CTGCGCCTCC 


TCTGCCAGGT 


20520 


TTAATGGAAA 


TCTGGAAGCA 


AGACTTAAAC 


ATCATTCATA 


ATCAATTGGC 


TTCAAAATAT 


20580 


CCAAAGGCTA 


TTATTGCAGG 


TGATTTTAAT 


GCAACTATGC 


GTCATGGAGC 


ACTTGCAAAA 


20640 


ATAAGCTCTC 


ATAGGGACGC 


ATTAAATGCA 


CTGCCACCTT 


TTGAAAGAGG 


AACTTGGAAT 


20700 


AGCCAAAGTC 


CAAAACTTTT 


TAATGCAACA 


ATAGATCATA 


TTTTATTGCC 


TAAAAACCAC 


20760 


TACTATGTTA 


AAGATTTAGA 


CATTGTAAGT 


TTTCAAAACT 


CTGATCATAG 


ATGTATTTTT 


20820 


ACAGAAATCA 


CATTTTAATT 


ATTTTATATA 


AAATCACCCC 


TCTAATGTTC 


ATAAACTAGA 


20880 


GGGGGAATTT 


GTATCCTACT 


ATCGTTTAAC 


GCACTTCTGC 


ATTGACTTTT 


TCTTCGAGAG 


20940 


ACGCTTGGAT 


TTTTTCCATA 


TAGCGTGCGA 


CTTCTTCGTC 


CGTTAAGCTG 


TCTTCTGGAT 


21000 


TTTGGAAGGT 


CAAGCTATAA 


GCCATTGACT 


TCATACCAAG 


TCCCAGTTTT 


TCACCTGAGA 


21060 


AGACGTCAAA 


GAGTTTGATA 


TCTGTCAAAC 


GTTTCACGCC 


GGCAGCTTGG 


ATAGCATCTA 


21120 


CAACTTCTTG 


GTGAGTCACT 


TCTGCCTTGA 


GGAGAAGGGC 


AACGTCACGG 


CTGACTGCTG 


21180 


GGAATTTGGT 


GATTTCCACA 


AATGGAACAG 


CAGGTTGGAG 


CGCCCCTTCG 


ATGGCTGAAA 


21240 


GGTTAAGCTC 


AGCTACATAC 


GTTTCTGGAA 


TATCGTAAGC 


CTTGGCAGTG 


ACTGGATGCA 


21300 


CTTGGCCAAG 


GAAACCAAGA ACTTGGTCAC 


CGAGTGAAAT 


CACGGCTGTA 


CGACCTGGAT 


21360 
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GAAGGCTAAC 


GATTTCAGAT GTTGCTGTAT 


AGGTTACTTG 


GAGTCCCAAA 


CGAGTAAATA 


21420 


GGGCTTCAAG 


GATTCCCTTA GCATAGAAGA 


AATCAACTGG 


AACTGCTGCT 


GTTTGGAAAT 


21480 


CTTTTTCAGC 


AACCAAGCCT GTCAAGGCAA 


AGGCAAAGCT 


GTTGATCTCA 


TTTGGAAGTT 


21540 


CTTCTTTTGG 


ATTACCTGTT TGTTCAAAGA 


CTTTTCCAAT 


CTCATAAAGG 


GCCAAGTTTT 


21600 


TATTCTTACG 


AGCCACGTTG TAGGCAACGG 


TATCAAGGAT 


CCCTGAAATC 


ATATTTTGAC 


21660 


GGAGGACTGA ACGATCCACA GTCATTGGCC ACATGAGTTC AGTAAGGTTA 


CTTGGTTGAG 


21720 


CTGTGAACTC 


AACTGCTTTT TCAGGAGTTG 


TCAGAGCATA 


GGTGATGATT 


TCTGTCAAAC 


21780 


CTGCTCCTTC 


AGCAATGGTA CGAACTTGAC 


GGCGGAGTTT 


TTGTATCACA 


GTCAATTCAC 


21840 


CAGCTGTACC 


ATCGTCTTTT GGAAGGCTGG 


TTGGCAAGCG 


GTCATATCCA 


TAGATACGAG 


21900 


CGATTTCTTC 


AAAGAGATCA GCTTCGATTG 


TGATATCCCA 


ACGACGACGT 


GGTACGCTGA 


21960 


CTGTAAAGCT 


GTCTGCATTT CCAGAAAGAC 


CAAAGCCAAG 


ACGACGGAAG 


ACGTCTTCTA 


22020 


CATCAGCATA AGACAGCTCA GTTCCGAGGA CACGGTTAAC ATCAGCAAGG GTTGAAGAAA 


22080 


CTTCCACATC 


AGAGGTATCA AGCTCACCCG 


CTGAAACGAT 


ACCCTTACGC 


ACCGTCGCGC 


22140 


CTGCAAGCTC 


TGCAATCATG CTAGCTGCCG 


CATCAAGGGC 


TTCATTAACT 


GTTGCCACAT 




TAATTCCTTT 


TTCAAAGCGA GAAGATGACT 


CAGAACGAAG 


GTTCAGGCGA 


CCACTTGTCT 




TACGGATAGA 


TTTGCCATTA AAAACAGCAG 


CTTCAAGGAT 


AACACGACTA 


GATTTTTCAG 


22320 


AAATTTCTGT 


AGCCTGACCA CCCATAACAC 


CGGCAAGGGC 


TACTGGTTTG 


TCAGCAACTG 


22380 


TAATCACGAG 


GTCTGTCTCA GCCAAGTCTC 


GTTCTTCACC 


GTCCAGGGTC 


ACTAATTTTT 


22440 


CACCATCACG 


CGCTTCACGC ACACGGATGT 


CAGTCCCTTC 


AAATGTGTCC 


AAGTCAAAAG 


22500 


CATGCATAGG 


TTGACCAAAG TAGAGCAGGA 


TGTAGTTTGT 


CACGTCTACA 


ACGTTATTGA 


22560 


TGGGACGGAT 


GCCTTCGTTC ATGAGAAGGT 


TTTGCAACCA 


TTGTGGACTT 


GGTGCGATAG 


22620 


TCACATTGTC 


CAAGATACGA GCTGCATAGT 


AAGGCGCCTT 


GTCTGTCTCA 


ATGCTGACAG 


22680 


AAAGGGCATC 


TGCCGCAGCT TCATTAGTTT 


CTGTTAGAGT 


AAATTTTTTA 


AAGTTGACTG 


22740 


CCTTGTCATA 


GATGGCTGCC ACTTCGTGAG 


CCACTCCACA 


CATAGAAAGG 


GCATCTGCAC 


22800 


GGTTTGGTGT 


GATGGAAAGT TCGATGATTT 


CATCATCCAA 


GTCTAGGTAA 


GAAAAGACTT 


22860 


CCTCACCTGG 


CACGGCATCT TCAGGCAAGA 


TTTGGATGCC 


ATCTGCGAAT 


TCCTTAGGCA 


22920 


CAACTGAGTC 


AGAAATTCCC AATTCACCAA 


GTGAACAGAT 


CATTCCAAGT 


GACTCCAAAC 


22980 


CACGGATTTT 


TCCTTTTTTG ATTTTGTAGT 


TATCAGCGAT 


ACGAGCTCCT 


GGAAGAGCCA 


23040 


CCATGACCTT 


GATCCCAGCA CGCACATTTG 


GGGCACCACA 


AACGATCTGA 


CGCTCTTCTT 


23100 


CTTCGCCAAC 


GTTAATCTGA CAAACATGGA 


GGTGAGTCTC 


TGGCACATCT 


TCGCAAGACA 


23160 
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AGACCTCACC 


GACGACAATT 


TTTGAGAGAC 


CAGCAGCTGG 


TGATTCGACA 


CCCTCTACCT 


23220 


CGATCCCTGT AGTTGACATT TTTTCAGCCA ACTCTTGTGA TGGCACATCA ATGTCCACCA 


23280 


ATTCTTTTAA 


CCATTTATAA 


GATACAAGCA 


TAATTTAGTT 


CTCCAGAATG 


ACAGTTGTCA 


23340 


UlQ-iAVjlTCT 


TTTCCTTTCC 


TATCATTTCA ATAGAAGAAT 


CCTCTTCTTA 


CCTTAATTTC 


23400 


TTTCTCAGTA 


ACCAATCCGT 


ATCTACTTTT 


TGACCAACCA 


TAAAATGATG 


TTGGCTAAAT 


23460 


TTTTCAAAAC 


CATATCGGTT 


ATAAAACGCT 


TGAGCTTTTG 


TATTATGCTC 


CCAAACACCT 


23520 


AGCCAAGCCC 


AAGAAAAACT 


ATTTTTTGTA 


GCAAGTTCAA 


GTGCGAATTC 


AAACAGTTGC 


23580 


TTACCTAGTC 


CAAATCCTTG 


GAATTTTTGT 


AGCACATAGA 


GACGTTGAAT 


TTCAAAAGCG 


23640 


TCCTCTAATT 


CTCTCTCAGT 


TTGAGCACTT 


CCCCAGTTGA 


CTTTGAGAAA 


ACCAGCTATC 


23700 


TCCTCCTCAT 


GCATAATGAA 


ATAGGTTTCA 


GAGTCAGGAT 


TTCCCAACTC 


AGTTGACAAA 


23760 


GTTTTCAGAC 


TATAAGCCTC 


TTCAAAGTAT 


TCCTGTAACT 


GCTCTTCCGT 


ATTATCATAC 


23820 


GCAAAGGTTT 


CACGAAAGGT 


TTGTTTGGCA 


ATTTTAGCCA 


ACACCTCAAC 


ATCTGCCATT 


23880 


TCTACTTTTC 


TAATCATTAT 


TTAAACTGTT 


CTGAGAAGCG 


GACATCTCCT 


TGGTAGAATC 


23940 


CACGGATATC 


GTTGATTCCA 


TAACGGAGCA 


TAGCTACACG 


CTCTTGTCCA 


AGACCAAAGG 


24000 


CAAAGCCAGA 


GTATACAGTC 


GCATCGATAC 


CACTCATTTC 


AAGGACACGT 


GGGTGAACCA 


24060 


TACCGGCCCC 


CATAATTTCG 


ATCCAACCTG 


TTTTCTTACA 


TACATTACAG 


CCTTCTCCAC 


24120 


CACACTTGAA 


GCAAGAAACA 


TCCACCTCAA 


CAGATGGCTC 


TGTGAATGGG 


AAGTAAGATG 


24180 


GACGCAAACG 


AATTTGACGC 


TCTTCACCAA 


ACATTTTTTG 


GACAATCAAC 


TGAAGCGTTC 


24240 


CTTGAAGATC 


AGCCATAGAG 


ATATTTTTCC 


CAACTACCAA 


GCCTTCGATT 


TGGTGGAATT 


24300 


GGTGACTGTG 


GGTCGCATCG 


TCCGTATCGC 


GACGGAAGAC 


ACGCCCTGGC 


GAGATCATCT 


24360 


TCAAAGGACC 


TTTAGAAAAA 


TCATGGGCAT 


CCATAGCACG 


CGCCTGAACT 


GGAGACGTGT 


24420 


GGGTACGGAG 


CAAGATTTCT 


TCAGTGATAT 


AGAAAGTATC 


CTGCATATCA 


CGAGCTGGGT 


24480 


GGTCTTTTGG 


AAGGTTCATA 


CGTTCAAAGT 


TATAGTAGTC 


TTGCTCCACT 


TCAAAACCAT 


24540 


CCACGACTTG 


ATAACCCATA 


CCGATGAAGA 


TATCTTCGAT 


TTCTTCACTG 


GTTTGTGTCA 


24600 


AAACGTGACG 


GTGACCAGTC 


GCAACTGGAC 


GACCTGGAAG 


CGTCACATCT 


ATACTCTCGC 


24660 


TAGCCAGTTG 


AGCCGCGACT 


TTCTTTTCTT 


CCAAGAGCTT 


AGCTGTTTCT 


TGAAAAGCAG 


24720 


CAGTCAAGAC 


ATCACGAGCT 


TCATTGACGT 


GTTTCCCGAT 


GATTGGACGC 


ATCTCAGCAG 


24780 


AAACATCTTT 


CATCCCTTTG 


AGGATTTCAG 


TGAGCGAACC 


CTTTTTACCA 


AGGACAGAGA 


24840 


CACGCAAATC 


TTGCATCTCT 


TTTTCATTTC 


CAGCAGTAAT 


CTGCTTCAAG 


CTAGCCAGCG 


24900 
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TTTCTTCGCG AAGCGCTTTT AATTGTTCTT CAATAGTTGA CATATTTCCT CCATCAGTCT 24 960 

CTCGTAGATA AAAAGAAAAC CACATGCCAA AAACTCCACT CGGAGCGTTG ACACGCGGTA 25020 

CCATCCGTTT TCATCTGACA AGTCAGACCT TCATTTCTAA ATCCATGCGC AAGTGAATTC 25080 

ACCCAGCTTT CATATAGAGA GCTTGCAGTC ACGGCTCTCC TCCCTGATAT ACTTCCCTTG 25140 

GGCTACTAGT CTTTCAGATT CCTATTCAAT TACTACTTAG TTTATCAGAT TTTTACCATT 25200 

CTTGCAAGAC CTATCTTACT TCTGCTTGTT AGCTTATTCT TATCTAAATT TATATAAACc 25260 

TTATCTAAAT TAACTATTTA TAATTTTTGT AACAAAATTA AATTAATTGA CACTCCCCTA 25320 

TAAAATAAAG AAGTTTAGAA TTTAATGTCT TCCAAACTTC TTTATTCCAT ATTTAATGAA 25380 

ATGCCACCTT AACCGTGATA ATAGCTAGTC ATCAATAAAA AACTATTTGA ATAAGGATTC 25440 

TCCATTTGAT TCAATCACTT CTTTATACCA AGTAAAAGAC ATTTTCTTAT ATCGATTTAA 25500 

TGTACCACTT CCATCATCGT TTCGATCAAC ATAAATGAGA CCGTACCTTT TAGAAAGTTG 25560 

TGCAGTGGAC ATAGAAACAC AGTCAATACA TCCCCAAGAC GTATAGCCCA TAATTTCAAC 25620 

ACCATCCTGT AGAGCTTCAG CAACTTGCAA TAAATGTTCT TTCATATACT GAATTCTATA 25680 

ATCATCTTGG ACGGTTAAGT TATTAAGTTC ATCTTTTATT AGTTGATCTT TAGCACCTAA 25740 

TCCATTTTCT ACTATAAATA ATGGGATTTG ATAACGGTCA TAATATCTAT TTAAAATTAT 25800 

ACGTAGTCCA ATTGGATCAA TTTGCCATCC CCACTCTGAA GACTCTAAAT AAGGATTTAC 25860 

TAAACCACCA ATAATATTCC CTTCTCCTGA ATTATACTGT GTTGGAAGAG CAGATTGAGT 25920 

CACACTCATG TAATAGCTAA AGGATAAAAA ATCTACGGTA TAATTTTTTA ATAACTCTGC 25980 

ATCTTCAGCT GCAAACTCTA TGTTAATGTC ATTTTCCTTA AAATATCTTT TTGCATAATT 26040 

CGGATAATAA CCTCTAACAT GCACATCTGA AAATAGATAA TTTAGATTCT CATACTCATG 26100 

AGTCGCCCAT ACATCTTTTG GATTTGGAGT CATTGGATAA GCTGGCATAG CTAATACCAT 26160 

ACATCCCACC TTAAACTCTG AATTAATCTC ACGAGCAATT TTTGTAACCA AACTTGAGGC 26220 

GACTAATTCA TGATGTATAG CTTGATATAA TTCTTGTTTC GAAAGATTCT CCTTAGGTAT 26280 

ATCTATTCCT CCACTAGTAA ATGGTAATTC CAAAACAGAG TTTACTTCGT TAAATGTAAG 26340 

CCAATATTTA ACTTTATCTT TATACCTTTC TAAAACTGTT CGAGCAAATT TTTCATAAAA 26400 

ATGAATCATT CTCCTATCAA CCCATCCATG ATATTTTCTT GCTAAATATA ATGGAGTCTC 26460 

ATAGTGTGAA AGAGTTACAA GTGGTTCTAT CCCGTGAGCA TGTAGTTCAT CAAACAATTC 26520 

ATCATAATAT TTCAACCCAG CTTCGTTAGG TTCTTCCTCA TCTCCTTTTG GAAAAATTCT 26580 

ACTCCATGCA ATAGAAGTAC GAAAAACATT AAAGCCCATT TCAGAAAACA AGGATATATC 26640 

TTCCTTATAT TTATGATAAA AATCAATACC TATCAATTTT AAGTTATCTT CTGTAGGATT 26700 
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TTCTGTTGCT 


TCTCCTAATC 


CACCTTTGGG 


TAACACATCC 


TGAACTGATA 


AGCCCTTACC 


26760 


ATCTTCATTA 


TATGCTCCCT 


CTACTTGATT 


AGCTGCAACA 


GCTCCACCCC 


AAAGAAAATC 


26820 


ATCTGGAAAA ATGGTCATAA CTTTCCTCCA TTATAATATT 


ACCAGTAATT 


CCTTAGAATG 


26880 




TGATTATTAG 


GTAATACTAA 


TACATCTAGA 


AAATCATTGG 


TATTCGTTAC 


26940 


An I IACTGGT 


GTAACTGTTT 


CGTAGCCTTT 


AGTCTTGATT 


AAATTCAAGT 


CCATTTCAAA 


27000 


AATCAACTGA 


TTTTTGAAAA 


CTCTGTCTCC 


TTCTTCTACA 


TGACTAATAA 


AACCTTGACC 


27060 


TTTTAGCTCA 


ACAGTATCTA 


ATCCAATATG 


AATTAGTAAC 


TCAACACCCT 


CATCACTCTT 


27120 


CAATCCAATT 


GCGTGCTTAG 


TCGGAAAAAT 


ATTTGTAATT 


TTCCCATCAA 


ATGGTGCATA 


27180 


AACCTTACCT 


TCACTTGGGA 


TAATCGCTAC 


TCCGTCTCCA 


ATTAGTTTAT 


CTGAAAATGT 


27240 


TTTATCCTGG 


ACATCGCTTA 


ACGGAATGAT 


TTCTCCTGAT 


ATAGGAGAAA 


ATATCATTTT 


27300 


TTTATTTGAA 


ACTCCAGCTT 


CAACTTCTAA 


ATTGCTAGAA 


CTCTCTTCTT 


CATCGATTCC 


27360 


AAATATATAA 


GCTAATACAA 


AGGTAATAAC 


AACCGAAATG 


ACCGCCACAA 


TTAAAGCATT 


27420 


TACAATATTT 


GATGGCACAT 


CAGAATAAAT 


AAATTGAGGC 


AACGCTATCA 


AAGATGGGAC 


27480 


AGCAAATAGA 


TATGCTTTAA 


CACTAGTAAG 


ACCTGCAAAT 


AATCCCGCTA 


ATCCACCACC 


27540 


AATCATAGCT 


GCATAAAGCG 


GTTTTTTATA 


TTTTAAAGTC 


ACACCATATA 


ATGCAGGTTC 


27600 


GGTAATCCCT 


GCAAGTAAGG 


CTGAGAAACC 


TGCTGCAAAA 


GCAATTTGTT 


TTGTATTATT 


27660 


ATTTTTACTC 


TTTAATGCAA 


CAGCCATCGA 


AGCAGCCCCT 


TGAGCTAAGT 


TTGACCCTAA 


27720 


CATTGCTGGA 


AGAATTAATA 


CGTCTGGAGT 


AGCAATAGAT 


GCCGCCAAAA 


AAATAGGTGC 


27780 


AAAAGCCCAA 


TGCATTCCAG 


TCATAACAAT 


AAATGGCATA 


ATAGCACCAA 


GAATAGCTAA 


27840 


TGTAAGCCAT 


CCAGCTACAC 


CATACATTTG 


CCCAACTAGA 


TTTGATAATC 


CTTCACCAAC 


27900 


AATTACTCCA 


ATAGGTCCGA 


CTACAACTAA 


GGCAATACAG 


CTTGATACTA 


ATAATACTAG 


27960 


CGTAGGTTGC 


AAAAAACTCT 


TAGTAATAGC 


TAGTGTTAAT 


TTAGCAATTA 


TTTTTTCAAT 


28020 


ATATTTCATC 


AACCAAACCA 


TAATAAGAAT 


TGGAACGACT 


GATGAACCAT 


AACTAGCTGG 


28080 


TGTCACAGGT 


GCACCAAATA 


AACTAAGAGG 


ATTCCCTGAT 


TGCACCATTT 


GAACAAAATT 


28140 


TGGATGGAGA 


AGTACACCTG 


CTACAGACAT 


AGCTAATGTA 


GATGTTACTT 


TTAATTTTTG 


28200 


TGATGCAGAA 


TAAGCTAATA 


ACAGCGGTAA 


GAAATAATAT 


GGAGCATCCC 


CAAAAAATGT 


28260 


CAAAAAAGCA ATAGTCTGAG 


AATCTGATTG 


CAATATACCA 


AGCATTGGTA 


AAATGATTAC 


28320 


CAAGACTTTC 


AACATACCTC 


CCCCTAACAT 


TGCTGGAATG 


ATTGGAGTCA 


TGGAACCAGC 


28380 


GATATACTCA 


ATGATTCTTT 


CTAAAATATT 


CCCTTTGTGC 


CCTTGAACAA 


CTGAATCGGA 


28440 
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TTCAAAATTG CCAAGTTTAA CGAATTCTTT ATAATAATTA GCTACATCAT TACCAAGTAT 28500 

AATTTGATAT TGTCCATTCT TTTTCATAAT ACCTATTACA CCTGGTATCT TCTTCACATC 28560 

ATCATCATTG ACTAAATTTT CATCTTTTAA TTCTAATCTT AAACGTGTTA CACAATGGGT 28620 

AACTCTATTG ACATTTTTTT CACCTCCAAT TACATCGAGG ATTTTTTGTA CCGTATCTTT 28680 

ATAACTCATG GTATTCTCCT ATTCTATTAA TCTAAATTTT TTGTTAAGCG ACGAATATGA 28740 

GCCATCAAAT AAACTAATTC ACTAGAAGTC AGCAAATAAT TGTACTCCGT TTGTATAAAC 28800 

ATTGCTACCT GTTCACCACA TTCATATTCT CTAGGATATT TATTTTTCAT TAATGCTAAC 28860 

AAGTCTTCAT CATCATCGTC GG 28882 



(2) INFORMATION FOR SEQ ID NO: 141: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 12835 base pairs 

(B) TYPE: nucleic acid 
{C) STRANDEDNESS: double 
<D> TOPOLOGY: linear 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 141: 



GCCTATGTCT 


TTTTCAAAAA 


AATGCTTGAC 


TTGAGACGGG 


AACTAGGGAA 


GTCTAAAGGC 


60 


GGAAGGCATT 


GATTTATACT 


CTTCGAAAAT 


CTCTTCAAAC 


CACGTCAACG 


TCGCCTTGGA 


120 


TTATATATGT 


AACTGACTTC 


GTCGATGCTT 


ATCTACAACC 


TCAAAGCAGT 


GCTTTGAGCA 


180 


ACTTGCGGCT 


AGTTTCCTAG 


TTTGCTCTTT 


GATTTTCATT 


GAGTATTATA 


TTACTTTCTA 


240 


TTTGTAGGAG 


GTGGCTTATG 


AAGATTCCTC 


TCTTAACTTT 


TGCAAGGCAT 


AAATTTGTTT 


300 


ATGTCTTGCT 


TACTTTGCTT 


TTTCTTGCTT 


TGGTTTATCG 


TGATGTTTTG 


ATGACTTATT 


360 


TCTTTTTTGA 


TATTCATGCG 


CCCGATCTAG 


CTAAATTCGA 


TGGACAAGCA 


ATTAAAAATG 


420 


ACTTATTAAA 


ATCAGCATTA 


GATTTTCGTA 


TTCTCCAGTT 


CAATCTAGGT 


TTTTATCAAT 


480 


CATTTATTAT 


TCCAATCATC 


ATTGTTTTGC 


TAGGTTTTCA 


ATATATTGAG 


CTGAAAAATA 


540 


AAGTTTTACG 


ATTGAGTATT 


GGAAGAGAAG 


TGAGTTATCA 


AGGGTTAAAA 


AGAAAGTTGA 


600 


CTTTGCAAGT 


TGCAAGTATC 


CCTTGTTTGA 


TATATTTAGT 


GACTGTGCTG 


ATAATTGCAA 


660 


TTATAACCTA 


TTTCTTTGGG 


ACTTTTTCTC 


CTCTTGGATG 


GAATTCTCTA 


TTTTCTGATG 


720 


GAAGTGGTTT 


ACAAAGACTC 


CTAGATGGAG 


AGATAAAAAG 


CTATTTGTTC 


TTTACTTGTG 


780 


TCCTACTAAT 


CGGTATTTTC 


ATCAATGCAA 


TCTATTTTTT 


ACAAATAGTT 


GATTATGTGG 


840 


GGAATGTGAC 


TCGTTCGGCA 


ATCACCTATT 


TGATGTTTCT 


TTGGCTTGGT 


TCTATGCTGC 


900 


TTTATAGTGC 


CTTGCCTTAC 


TATATGGTTC 


CTATGACGAG 


TTTGATGCAA 


GCTAGCTATG 


960 
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GGGATGTAAG TTTGATGAAA CTCTTTACTC CTTATATCCT TTATATTGTC CCTTACATGG 1020 
TGCTTGAAAA ATATGAAGAT AATGTTTAAG AATTTTAACA ATATTTTGCT AAATAGAAAG 1080 

ATTGTTTTAC TACTTCGTAT AGTTCTGATG ATGATTTTGA TAAACCATCT ATTGTCAACA 1140 

GCGGTTCAAA AGCAGGATGC TGTTATCTTT TTCAAGAGAG AATTGATTTC AATTTTTTCC 1200 

TATAATGACT ATTCTGAAGC GAATTTAGAA ATCCCCAAAC TATTGTTAAA CCTTTCGCTT 12 60 

, TTC ATGGTAG GATGGCTCTC TGTCATTTTA CTTGAAAGTG ATTTGGCAGA CCATTACCAT 1320 

CACTTGATTC GCTATCAATC AAGCTCCTTT TTCGATTATA CAAGGAAACG ATTGGTTGTC 1380 

ATTTCTAAAT TTTTTACTCA AGATTTGTTT GTCTGGTTTC TTGGTTTACT TCCTCTAGGA 1440 

ATTCATTTCA AAACAGTCGC ACTTTTCTTT TTACTTGCTC AGTTAATGAT GTTGTACTTA 1500 

CTACTGTCTT ATCTGATAGC ACTGATTAGT GCGGGCGCTG GTTTTTCCTT TTTTCTCTAT 1560 

TTTTTAGCAT TTGTGGGACA AGAATGGATG ATGGATCATA TTGTAACAGT GTATTTAGTA 1620 

CTCTTAAGTT TATTAGTTAT GTTGATTGTT AGTCGCTTGG AAGAGAAATT TAAGAAAGGA 1680 

TAAACGATGA GACTTGAAAT TATAAATGGA CAGAAAATTT ATGGGAAAAG ACCTATTTTA 1740 

AATCAGTTGA ATTTGGTGTT TCAATCAGGA AAAATTTATG GACTTAAAGG TGATAATGGA 1800 

TCTGGCAAGA CGGTTCTTTT AAAGATACTT GCTGGTTATA TTAAGCTTGA CAAAGGAAAA 1860 

GTTCTTCAAG ATGGTAAAGT TTACGGGGTA AAAAATCATT ATATTCAGGA TGCAGGAATT 1920 

TTAATTGAAA AAGTCGAGTT TTTATCTCAT TTATCCCTGA GAGAAAATTT GGAACTGTTA 1980 

AGGTATTTTT CATCTAAAGT TACGGAAAAA AGAATTGCCT ATTGGATTCA ATACTATGAT 2040 

TTACAGGAAT TTGAAGACAT TGAATACCGT CATTTATCCT TAGGAACAAA GCAAAAAATG 2100 

GCCTTGATTC AAGCCTTTAT TTCCTCTCCT TCTATACTCT TTCTCGATGA ACCTATGAAT 2160 

GCTTTGGATG AGAAGAGTGT GAGGTTAACC AAACAGGTCA TTTTATCTTA CCTGAAAAAA 2220 

GAAAATGGTC TGGTTATCCT GACGTCGCAC ATATCGGAAG ATATTTCAGA CCTTTGTACA 2280 

GATGTATTAG TTGTCGAAAA TGGACATATA CAAATGTAAA GGATATACAA TCCTAGGAGA 2340 

TGGCTTATGG CACATCTAAA ATCATTTATT ACACGATATT CCAAGGTTTA TATTGGTTTA 2400 

GTTCTGCTGA TCTGGCTGTC TTTCTTCTTT ATCCCTTGGG ATAAACCACT TCTGGGGATA 2460 

AGGATTGACA TCTTCATCAT ACAGAAAATC TTGCTAGCTT TTGGAATTCT GTCCATTCTC 2520 

ATGGCCTTGC TGTCCAAGAA AGTCAGTCTC TTTGTTTTTG GACTGATTTG CTGTCTTTCT 2580 

CTTTGGATTA ACTTATTTAT CACATTTGCC ATTTTGCCGA TTTTTGGCAA TTAAACAGTC 2640 

ATAAAAGTCG GAGAGGTTAG CTTGAAAACT AACCTCTTTT TCCTTTTCAA AATGGGGATT 2700 
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CTTCCTTGAA AATAATCAGT AATTGTGCTA AAATTAAAGG AACATTCTAA AATATTCGGA 2760 

ATTTAAAGTA AGGAAAAACA TGGCTAATAT TTTAAAAACA ATTATCGAAA ATGATAAAGG 2820 

AGAAATCCGT CGTCTGGAAA AGATGGCTGA CAAGGTTTTC AAATACGAAG ACCAAATGGC 2880 

TGCTTTGACT GACGACCAAC TAAAAGCAAA AACAGTTGAA TTTAAGGAAC GTTATCAAAA 2940 

TGGAGAATCA CTGGATTCAT TGCTTTACGA AGCATTTGCG GTTGTCCGTG AAGGTGCCAA 3000 

ACGTGTCCTA GGTCTCTTCC CTTATAAGGT TCAGGTCATG GGGGGGATTG TTCTTCACCA 3060 

TGGTGACGTG CCAGAGATGC GTACAGGGGA AGGGAAAACC TTGACTGCGA CCATGCCGGT 3120 

ATACCTCAAT GCCCTTTCAG GTAAAGGGGT TCACGTAGTT ACGGTTAATG AATACCTGTC 3180 

AGAACGTGAC GCGACTGAGA TGGGTGAATT GTACTCTTGG CTTGGTTTGT CAGTAGGGAT 3240 

TAACTTGGCT ACCAAATCTC CAATGGAGAA AAAAGAAGCC TATGAGTGTG ATATTACTTA 3300 

CTCAACTAAC TCAGAAATCG GATTTGACTA CCTTCGTGAC AACATGGTCG TTCGCGCCGA 3360 

AAACATGGTA CAACGTCCGC TTAACTATGC CTTGGTCGAT GAGGTTGACT CTATCTTGAT 3420 

TGACGAGGCT CGTACACCTT TGATTGTATC AGGTGCCAAT GCGGTTGAAA CCAGTCAGTT 3480 

GTATCACATG GCAGACCACT ATGTAAAATC TTTGAACAAA GATGACTACA TCATCGATGT 3540 

GCAGTCTAAG ACTATTGGTT TGTCTGATTC AGGGATTGAC AGGGCTGAAA GCTACTTCAA 3600 

ACTTGAAAAC CTCTATGACA TCGAAAACGT GGCTTTGACT CACTTTATCG ATAACGCCCT 3660 

TCGTGCCAAC TACATCATGC TTCTCGATAT TGACTATGTG GTGAGCGAAG AGCAAGAAAT 3720 

CTTGATTGTC GACCAATTTA CAGGTCGTAC CATGGAAGGT CGTCGTTATT CTGATGGATT 3780 

GCACCAAGCT ATTGAAGCCA AAGAAGGTGT GCCAATCCAG GATGAAACCA AGACATCTGC 3840 

CTCAATCACG TACCAAAACC TCTTCCGTAT GTACAAGAAA TTGTCTGGTA TGACGGGTAC 3900 

AGGTAAGACT GAGGAAGAAG AATTCCGTGA AATCTACAAC ATTCGTGTTA TTCCAATCCC 3960 

AACAAACCGT CCTGTTCAAC GTATTGACCA CTCAGACCTT CTTTATGCAA GTATCGAATC 4020 

TAAGTTTAAA GCGGTTGTCG AAGACGTTAA GGCTCGTTAC CAAAAGGGTC AACCTGTCTT 4080 

GGTTGGTACA GTAGCGGTTG AAACTAGTGA CTACATTTCT AAGAAATTGG TTGCAGCTGG 4140 

TGTTCCTCAC GAAGTCTTGA ATGCCAAAAA CCACTATAGA GAAGCCCAAA TCATCATGAA 4200 

TGCTGGTCAA CGTGGTGCCG TTACCATCGC AACCAACATG GCGGGTCGTG GTACCGACAT 4260 

CAAGCTTGGT GAAGGTGTTC GTGAACTTGG AGGACTTTGT GTTATTGGTA CAGAACGTCA 4320 

TGAAAGTCGT CGTATCGATA ACCAGCTTCG TGGACGTTCA GGTCGTCAAG GAGATCCAGG 4380 

TGAGTCACAA TTCTACCTAT CTCTTGAAGA TGATTTGATG AAACGTTTTG GTTCTGAACG 4440 

CTTGAAGGGA ATCTTTGAAC GCTTGAACAT GTCTGAAGAG GCCATTGAGT CTCGCATGTT 4500 
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GACGCGTCAG GTTGAAGCAG 


CTCAGAAACG 


TGTCGAAGGA 


AATAACTACG 


ATACCCGTAA 


4560 


ACAAGTCCTT CAATACGATG 


ATGTCATGCG 


TGAACAACGT 


GAGATTATCT 


ATGCTCAACG 


4620 


TTACGATGTC ATCACTGCAG 


ATCGTGACTT 


GGCACCTGAA 


ATTCAGTCTA 


TGATCAAACG 


4680 


CACGATTGAA CGTGTCGTTG 


ATGGTCATGC 


GCGTGCCAAA 


CAAGATGAAA 


AACTAGAGGC 


4740 


AATTTTGAAC TTTGCTAAGT 


ACAACTTGCT 


TCCTGAAGAT 


TCTATTACGA 


TGGAAGACTT 


4800 


GTC AGGCTTG TCTGATAAGG CCATCAAGGA AGAGCTTTTC 


CAACGTTCCT 


TGAAGGTTTA 


4860 


CGATAGTCAG GTTTCAAAAC 


TACGCGATGA 


AGAAGCAGTT 


AAAGAATTCC 


AAAAAGTTTT 


4920 


GATTCTACGA GTGGTGGATA ACAAGTGGAC 


AGATCATATC 


GATGCCCTTG 


ATCAATTGCG 


4980 


1 AAl»t»CAj\jTT GGACTTCGTG 


GCTATGCTCA 


GAACAACCCT 


GTTGTTGAGT 


ATCAGGCAGA 


5040 


AGGTTTCCGT ATGTTTAATG 


ATATGATTGG 


TTCGATTGAG 


TTTGATGTGA 


CACGCTTGAT 


5100 


VjA rUAAAC»L.A CAAATTCATG 


AACAAGAAAG 


ACCACAGGCA 


GAACGTCATA 


TCAGTACAAC 


5160 


noLVanl. 1 LbC Aft i A i LuL 1 1> 


CTCACCAAGC 


AAGTATGCCA 


GAAGATTTGG 


ATTTGAGCCA 


5220 


VJ/\ 1 1 uuALul. Aft i LjAAL. i 11 


GCCCATGTGG 


TTCTGGTAAG 


AAATTTAAAA 


ACTGTCACGG 


5280 


TAAAAGACAA TAAAATGAGA 


TAGTTTAGAG 


oCuViA lAlLl 


TGTGAAAAGT 


AAATTTTTAC 


5340 


TGGGTATCCG TTTGCTTTAT 


AAGGAGATGA 




TTACAGCAAA 


AAGCTCTAAA 


5400 


ATAAATATAG AAGAAGTTCG 


TGCCTTGTCA 


AAATTAGAAG 


GTCAGGCTTT 


GGAGAGGAAA 


5460 


TCACAGCGAG ATCAAGAGCT 


AGAAGCCATT 


ATACGTGGAG 


AAGACCAGCG 


AATTCTCTTG 


5520 


GTAATCGGGC CATGCTCATC 


TGACAACGAA 


GAAGCTGTCC 


TTGAATACGC 


TAAGCGTTTG 


5580 


GCAGTCCTAC AAGAAGAAGT 


GGCAGATCGT 


ATCTTTATGG 


TTATGCGTGT 


TTATACTGCC 


5640 


AAACCCCGTA CCAACGGAGA 


TGGCTATAAG 


GGCTTGATTC 


ACCAGCCTAA 


CGCGACAGAA 


5700 


GCGCCTAGTC TTATCAATGG 


AATCAAAGCC 


GTTCGCCATC 


TTCACTATCG 


TGTCATCACA 


57 60 


GAAACAGGGA TGACAACTGC 


TGATGAAATG 


CTTTATCCTG 


AAAACCTTCC 


GCTTGTAGAT 


5820 


GATTTGATTT CTTACATGGC 


AGTTGGTGCC 


CGTTCAGTTG 


AAGACCAGCA 


ACACCGCTTT 


5880 


GTGGCAAGTG GGGCAGGATT 


TTCTACTGGT 


TTTAAAAATC 


CAACCTCTGG 


AAATCTCAAT 


5940 


GTCATGTTTA ATGGGATTTA 


TGCTGCTCAA 


AACAAACAAA 


GTTTCCTTTT 


CTTAGGAAAA 


6000 


GAAGTAGAAA CAACTGGGAA 


CCCGCTTTCA 


CACGCTATTC 


TTCGTGGTGC 


TCTTAATGAG 


6060 


TATGGAAAAA ATATTCCCAA 


CTACTATTAT 


GACAATTTAA 


TTGATACCAT 


TGCCCAGTAT 


6120 


GAGAAAATGG GCTTGGAAAA 


TCCTTTTATC 


ATCATTGATA 


CCAATCATGA 


CAATTCTGGT 


6180 


AAGCAGTATA TTGAACAGAT 


CCGAATTGTC 


CGCCAGACCT 


TGATTAACCG 


TGCTTGGAAT 


6240 
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GAAAAAATTA AGCAGTTCGT TCGTGGTTTT ATGATTGAGT CTTATCTGGA AGATGGTCGA 6300 

CAAAATGAGC CAGAAGTATT TGGTAAGTCT ATCACAGACC CTTGCCTGGG TTGGGATAAC 6360 

ACAGAAGCTC TTGTCAGAGA AATTTACAAA ACGTTAGGAG AATAAGATGG CATTTATTGA 6420 

AAAAGGTCAA GAAATCGATA TGGAAGTCAT CAAGGCTGAA ACCCAATTGT CTGCGGAAGC 6480 

CTTGAGACTC AAGGAAAGCC GTGACAGGGA ATTGGCAGAT ATTATTTCAG GGGAAGATGA 6540 

CCGTATTCTC TTGGTGATTG GTCCTTGCTC TTCTGATAAT GAAGAGGCGG TCTTGGAATA 6600 

TGCTCGCCGT TTATCTGCCT TGCAAAAGAA GGTAGCGGAT AAGATTTTCA TGGTCATGCG 6660 

CGTGTATACT GCTAAGCCTC GTACCAATGG AGACGGCTAT AAAGGATTAG TTCACCAGCC 6720 

AGATACTTCT AAGGCTCCAA GCCTGATTAA TGGCTTGCAG GCTGTGCGCC AGTTGCACTA 6780 

CCGCGTGATT ACAGAGACTG GTTTGACAAC GGCAGATGAG ATGCTTTATC CGTCAAATCT 6840 

GATCTTGGTG GATGACTTGG TCAGCTACCA TGCCGTTGGA GCTCGTTCTG TGGAAGACCA 6900 

AGAGCACCGC TTTGTGGCTT CTGGGATTGA TGCACCAGTA GGGATGAAAA ATCCAACCTC 6960 

AGGAAATTTG GGTGTTATGT TTAACGCCAT CTATGCTGCT CAAAACAAGC AAACCTTCCT 7020 

TTATCATGGG CAGGAAGTTG AGACATCAGG TAATCCTTTG GCCCATGTTA TCCTCCGTGG 7080 

AGCAGTCAAC GAGTATGGCA ATTATATGCC GAATTACTAC TATGAAAATC TACTCCAAGC 7140 

CATTGAACGC TATGAAACCA TGGGACTTGA AAATCCTTTT ATCCTCATTG ACACCAACCA 7200 

TGATAACTCA GGCAAGCAAT ATATGGAGCA GATTCGAATT GTTCGCCAGA CCTTGCAGAA 7260 

TCGTGATTGG AATGAGAAAA TTAAAAAGAC GGTTCGAGGA TTTATGATTG AATCTTACCT 7320 

AGCAGATGGT CGTCAAAACC AACCAGAGAT CTTTGGTTGC TCTATTACTG ACCCTTGCCT 7380 

AGGTTGGGAA AATACAGAGG CCTTGGTAGA AGAGATTTAT GTTACCTTGA CAAAATAAGT 7440 

GAAAAGGATG GAGTTGGGGA ATCTCAACTC CTTTTGATGA GAATGATAGT TGGACACGGA 7500 

ATTGACATCG AAGAATTGGC TTCGATAGAA AGCGCAGTTA CACGACATGA AGGATTTGCT 7560 

AAGCGTGTAC TGACCGCTCA GGAAATGGAG CGCTTCACCA GTCTCAAAGG ACGCAGGCAA 7620 

ATAGAATATT TAGCTGGTCG CTGGTCGGCT AAGGAGGCCT TTTCCAAGGC TATGGGAACG 7680 

GGCATTAGCA AGCTCGGTTT TCAGGATTTG GAAGTCTTGA ACAATGAACG TGGGGCGCCT 7740 

TATTTTAGTC AGGCACCATT TTCAGGAAAG ATTTGGCTGT CTATCAGCCA CACCGATCAG 7800 

TTTGTGACAG CCAGTGTCAT TTTGGAGGAA AATCATGAAA GCTAGTCCAC ATAGACCAAC 7860 

CAAGGCTCTG ATTCATCTGG GAGCTATTCG ACAAAATATT CAGCAAATGG GGGCTCATAT 7920 

CCCTCAAGGA ACGCTCAAGT TGGCTGTGGT TAAGGCCAAT GCTTATGGTC ATGGAGCTGT 7980 

TGCCGTTGCC AAGGCAATTC AAGATGATGT TGATGGCTTT TGCGTTTCCA ATATCGATGA 8040 
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AGCCATTGAA CTCAGACAAG CTGGACTCAG CAAGCCAATC CTCATTTTAG GAGTTTCTGA 8100 

AATCGAAGCT GTTGCTCTAG CTAAAGAATA TGACTTCACC TTGACAGTGG CTGGACTGGA 8160 

GTGGATTCAA GCACTCTTAG ATAAGGAAGT GGACCTAACT GGATTGACAG TCCACCTCAA 8220 

GATTGATTCA GGGATGGGAC GGATTGGTTT TAGAGAGGCA AGTGAGGTTG AGCAGGCTCA 8280 

AGATTTGCTC CAACAACACG GTGTTTGTGT TGAAGGAATC TTTACCCACT TTGCTACTGC 8340 

TGATGAGGAA TCAGATGACT ATTTTAATGC CCAGTTAGAA CGGTTTAAAA CTATTTTAGC 8400 

TAGTATGAAG GAAGTTCCAG AGCTGGTTCA TGCTAGCAAT TCTGCAACGA CTCTTTGGCA 8460 

TGTAGAGACT ATTTTCAATG CGGTTCGTAT GGGAGATGCC ATGTATGGCC TCAATCCAAG 8520 

TGGAGCGGTC TTGGATTTGC CTTATGATTT GATACCGGCC TTGACCTTGG AGTCTGCTCT 8580 

GGTTCATGTC AAGACAGTTC CAGCTGGAGC TTGCATGGGC TATGGAGCAA CTTATCAAGC 8640 

GGATAGCGAG CAAGTCATCG CGACCGTGCC AATCGGGTAT GCAGATGGAT GGACAAGAGA 8700 

CATGCAAAAT TTCTCTGTCT TGGTAGATGG CCAAGCTTGC CCAATTGTCG GCAGGGTTTC 8760 

GATGGACCAA ATCACTATTC GATTGCCTAA GCTTTATCCG CTAGGAACCA AGGTAACCTT 8820 

GATTGGCTCC AATGGGGATA AGGAAATCAC TGCAACTCAG GTAGCGACCT ACCGCGTAAC 8880 

CATTAACTAT GAGGTGGTTT GCCTCCTCAG CGACCGTATT CCGAGAGAAT ATTATTAGAA 8940 

AAGAAAGGAG TGGAGCATGA ATCTACATCA ACCCTTGCAT GTCTTGCCTG GTGTGGGACC 9000 

AAAGTCAGCA GAAAAATACG CCAAACTAGG AATTGAAAAC TTGCAAGATC TCTTGCTCTA 9060 

CTTTCCTTTC CGTTATGAAG ACTTCAAAAC CAAGCAGGTG CTGGAGCTGG AAGACGGTGA 9120 

GAAGGCAGTT CTTTCTGGTC AGGTAGTGAC TCCTGCTAGT GTCCAGTATT ATGGTTTCAA 9180 

GCGCAATCGC CTGCGTTTTA GTCTCAAGCA GGGAGAGGTC GTTTTTGCGG TGAATTTCTT 9240 

TAACCAGCCC TATCTGGCTG ATAAAATAGA GTTGGGAGCA ACCCTTGCTG TCTTTGGAAA 9300 

ATGGGACCGC GCTAAGGCTA GTCTGACTGG GATGAAGGTT CTGGCTCAGG TAGAAGATGA 9360 

CCTCCAGCCT GTCTATCGTC TGGCTCAGGG AATCAGTCAG GCCAGTCTGG TCAAGGTCAT 9420 

CAAGACGGCT TTTGATCAGG GACTGGACCT CTTGATAGAA GAAAATCTGC CCCAGTCTTT 9480 

ACTAGACAAA TACAAACTCA TGTCCCGTTG TCAGGCAGTC CGTGCTATGC ATTTTCCAAA 9540 

GTATTTGGCA GAATACAAGC AGGCTCTTCG CCGTATAAAG TTTGAGGAAC TCTTTTATTT 9600 

CCAAATGCAG CTGCAGATGC TCAAGTCTGA AAATAGAGTT CAGGGAAGTG GTCTGGTTCT 9660 

GAATTGGTCT CAGGAAAAAG TGACAGCAGT TAAAGTAAGT CTTCCTTTTG CCCTGACCCA 9720 

AGCTCAGGAA AAGAGTTTGC AGGAAATTTT AACTGATATG AAGTCCGACC ACCACATGAA 9780 
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TCGTCTCCTA CAAGGGGATG 


TGGGGAGTGG 


AAAAACGGTA 


GTCGCTGGCT 


TGGCCATGTT 


9840 


TGCGGCAGTG ACAGCAGGTT 


ATCAGGCTGC 


CCTAATGGTA 


CCAACAGAAA 


TCCTCGCAGA 


9900 


GCAACACTTT GAGAGTTTAC 


AGAACCTTTT 


TCCCAATTTG 


AAACTGGCTC 


TCTTGACAGG 


9960 


TTCCTTGAAA GCTGCAGAAA 


AGAGAGAAGT 


CTTGGAGACC 


ATTGCCAAGG 


GTGAGGCTGA 


10020 


TTTGATTATA GGAACTCACG 


CTCTGATACA 


AGATGGGGTG 


GAGTATGCTC 


GTCTTGGTTT 


10080 


GATTATTATC GATGAGCAGC 


ACCGTTTTGG 


TGTAGGGCAA 


AGGCGTATTT 


TACGGGAAAA 


10140 


AGGTGACAAT CCAGATGTCC 


TCATGATGAC 


GGCGACTCCC 


ATTCCACGGA 


CGCTTGCCAT 


10200 


CACAGCCTTT GGAGATATGG 


ATGTTTCCAT 


TATCGACCAG 


ATGCCAGCAG 


GTCGGAAGCC 


10260 


TATTGTGACG CGCTGGATCA 


AACATGAGCA 


ACTACCTCAG 


GTCTTGACTT 


GGTTAGAGGG 


10320 


GGAAATTCAA AAAGGTTCCC 


AAGTCTATGT 


CATCTCTCCT 


TTGATTGAAG 


AATCAGAAGC 


10380 


TCTAGATTTG AAAAATGCCA 


TTGCCTTATC 


AGAGGAGTTG 


ACGACTCATT 


TTGCAGGCAA 


10440 


GGCAGAGGTG GCTCTTCTAC 


ATGGTAGGAT 


GAAGAGTGAC 


GAAAAAGACC 


AGATCATGCA 


10500 


GGATTTCAAG GAGAGAAAGA 


CGGATATTCT 


GGTTTCGACG 


ACGGTTATTG 


AGGTTGGGGT 


10560 


CAACGTTCCC AATGCGACTG 


TCATGATTAT 


CATGGATGCC 


GATCGCTTCG 


GTCTCAGTCA 


10620 


ACTTCACCAG CTTAGAGGTC 


GTGTCGGTCG 


GGGGGACAAG 


CAGTCCTACG 


CTGTTCTCGT 


10680 


TGCTAATCCC AAGACGGATT 


CTGGGAAAGA 


CCGCATGCGT 


ATCATGACAG 


AAACGACCAA 


10740 


TGGATTTGTC CTTGCGGAGG 


AAGATTTGAA 


AATGCGTGGT 


TCTGGTGAGA 


TTTTTGGAAC 


10800 


CAGACAGTCA GGACTTCCAG AGTTCCAAGT GGCTGATATT ATCGAAGATT 


TTCCGATTTT 


10860 


AGAAGAAGCA AGAAAGGTTG 


CTAGCTACAT 


TAGTTCTATA 


GAAGCTTGGC 


AAGAAGATCC 


10920 


AGAGTGGCGC ATGATTGCCC 


TTCATCTGGA 


AAAGAAAGAA 


CATCTGGATT 


AAGCTTTCTC 


10980 


TAAGGAAAAC TTATACTCAA 


TGAAAATCAA 


AGAGCAAACT 


AGGAAGCTAA 


CCGCAGGTTG 


11040 


CTCAAAACAC TGTTTTGAGG TTGTGGATGA AACTGACGAA GTCAGCTCAA AACACCGTTT 


11100 


TGAGGTGGCA GATAGAACTG ACGAAGTCAG TAACATATAT ATACGGTAAG GCGACGCTGA 


11160 


CGTGGTTTGA AGAGATTTTC 


GAAGAGTATT 


AAGCTAGTTT 


TTAGGTTTGG 


CTCTTATACT 


11220 


AGAGTCATCA AAAAGAAACG 


AGGACTCTCA 


TATGACAGTA 


ACGATTAAAG 


TAAATTACCA 


11280 


AACCACTTTC CAAAAGAAGG 


AAGCAAAAAA 


CTAGTATAAA 


CAGAAGAGAG 


AGCGAAATGC 


11340 


TCTTTTTTCG TTTCTAAAAC 


TACTTTCAGC 


CCATCATCCT 


AAAAGTAAAG 


AATCTAAATT 


11400 


CACTTTCTAT TTACCCTTCT 


TTCTTGCATT 


GATTACATAG 


ATATGCTACA 


GTTGTGGTAA 


11460 


CGATTACAAA ATAAAAGGAG 


CATGCTATGA 


AAAATCCAGC 


TTTGCTAGAA 


GAAATTAAGA 


11520 


CCTATAGAGG AAGGGATGAG 


GTTCCGGAAG 


ACTTTGATGA 


TTTCTGGGAT 


GGGGAAGTGA 


11580 
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AAAATGTTTC CACGCTTCCA TCCTACCACT TGGAGGAAAG AGATTTCCAC ATTCCTCAAG 11640 

TCAAGTGCTA TGAGTTAACA TTTGAAGGAA GCAAGGAAGG AAAGGTCTAT GCACGCATTG 11700 

TTCTTCCAAA GAGTGAGGAG AAGGTCCCAT TAATCTTCCA TTTTCATGGT TATATGGGAC 11760 

GTGGCTGGGA CTGGGCCGAC ATGCTGGGCT TCACCGTAGC TGGTTACGGT GTTGTTTCCA 11820 

TGGATGTGCG GGGCCAGTCA GGTTACTCAC AAGACGGCTT GCGTTCTCCT TTAGGAAATA 11880 

CCGTGAAGGG GCATATTATC CGTGGTGCTG TGGAAGGTCG GGACCACCTC TTTTATAAGG 11940 

ATGTTTATCT GGATATTTAC CAGTTGGTCG AAATTGTTGC TAGTCTGTCT CAGGTTGATG 12000 

AGAAGCGTCT TTCTAGCTAT GGTGCCTCAC AAGGAGGGGC TCTAGCTCTA GTTGCAGCAG 12060 

CGCTCAATCC TCGAATTCAG AAAACAGTTG CCATTTATCC CTTCTTGTCA GACTTCAGAC 12120 

GGGTGATTGA GATTGGTAAT ACTAGCGAGG CTTACGACGA ACTTTTCCGT TATTTCAAGT 12180 

TTCACGACCC CTTCCATGAA ACAGAGGAGG AAATCATGGC GACCCTTGCC TATATCGATG 12240 

TCAAAAATCT TGCCCATCGT ATCCAAGGTG AGGTTAAGAT GATTACGGGC TTGGACGACG 12300 

ATGTTTGCTA TCCCATTACC CAGTTTGCGA TTTATAATCG TCTGACCTGC GATAAAACCT 12360 

ATCGCATCAT GCCTGAGTAT GCTCACGAAG CCATGAATGT ATTTGTCAAT GACCAAGTCT 12420 

ACAACTGGCT CTGTGGAAGT GAGATTCCTT TTAAATATCT AAAATAAGGA GTCGACTCTA 12480 

AGCACAAAAT CTTAAAAATT ACAAACACGC ATAGTATCAG GGGATTAAGA AAACTTTATA 12540 

CTATGCGTTT TATCATGGAA ATATAGTAAA ATGAAATAAG AACAGGACAA ATCGATCAGG 12600 

ACAGTCAAAT CGATTTCTAA CAATGTTTTA GAAACAAATG TGTACTATTC TAGTGTCAAT 12660 

CTATTATATT TATAGAATTT TTTGTTGCTA GATTTGTCAA ATTGCTTAAA ATAATTTTTT 12720 

TCAGAAAGCA AAAGCCGATA CCTATCGAGT AGGGTAGTTC TTGCTATCGT CAGGCTTGTC 12780 

TGTAGGTGTT AATACTTTTC AAAAATCTCT TCAAACCACG TCAGCTTCGC CTTGC 12835 

(2) INFORMATION FOR SEQ ID NO: 142: 

(i) SEQUENCE CHARACTERISTICS: 

{A) LENGTH: 5020 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 142: 

GGGGATATGA AGAACAAAAG AATATTTAAA GACTTCCAAG CTTCAAAAAT GAGTTTAAAC 60 

ATTTACACAA GCCCCTTGTT AGCCTTTGTT TTTGTCTTCA TAGGAGAGTT TGTGGCTTTT 120 
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ACTTTGTATG 
GGTCAAAATC 
GACTTTCGTT 
GACAAGAAAA 
CAGCAATCTT 
TTTAGTGGTC 
CTTTGTCGTC 
CCGTGCTTGG 
ATCTAGCCTG 
AGTAAATCTC 
TTGGGGTGTT 
TTTAGTTAGT 
GATTGGCTAT 
CTACTGCTGA 
CGGTCCGTCC 
AGAGAGGATT 
TCAACCATCA 
AGAAAAAGCT 
GGTCATTTCT 
TCAGCTCTAT 
CTACTATGAT 
GGATAGTTCT 
GGAGCGTAAT 
CAAGGAATAC 
ACTCTTGAAT 
AAGATTTCGC 
CTTTCGAGTA 
AGGTCAGGTG 
CAATGACGAC 
AGCTGTCTTT 



GTATTGGCTT 
TTGCAAGCTA 
TAATTTTAGG 
GTTGAGAAAA 
CTGAAAGGAT 
TTAGGTCAAT 
TTTACTATCC 
CTACTTCCTC 
TTCTTTACCC 
TTTTTATTCG 
GCAGGTATTC 
GGTCAACCGT 
CAGGTGGTTC 
TTGTCTATCT 
TTTTCTTCGT 
CTTATGATCA 
GGAGATCAAC 
CAGATTCTGA 
AAAGTCAATA 
GGGGAGTTTA 
TATTACCAGC 
GTCAATGACG 
GATGTTATTG 
GCTGATAGTG 
GACTTGGTCG 
GTTCGTGGGG 
GAATTTTTTG 
TTGGGAGAAG 
CACATGGAAG 
GAAAAGGAAG 



GTTAGCTCTC 
CTTGCAGACC 
ATTACTGGCC 
GACCTATTCG 
TTAGTCTAGG 
ATCGTTTGGA 
CATTTTGGAT 
AATTGGCCTC 
TGCTTCATAT 
GAGTTGCCAT 
ATGGTGCTTG 
CAGaACGTCT 
TTTTGGCATA 
TGCTAATAAA 
GAAAATACTA 
ATCACATTAC 
CCCAAGCTAT 
TGGGGGCGAC 
AACCAACTCT 
AGGAATTTTT 
CAGAGGCCTA 
AGATTGACAA 
TCGTGGCCTC 
TCGTTAGTCT 
ATATTCAGTT 
ATGTGGTAGA 
GAGACGAAAT 
TGGATCATTT 
TTGCCATTGC 
GTAAACTGCT 



964 
ATCGGACTTG 



TTGCATCAGA 
TTTGGTTATT 
AACCTTGGGA 
CCTGGCACTT 
ATCCATTCAC 
TTTACAGGGG 
AAGAACCAAT 
GGGCAATTCT 
GGCTCTTTAC 
GAATTTTGCT 
CTGATGACCT 
GAAGGTTCCA 
TTAAAGAAAG 
TAAGTATGCT 
AGATAATCAA 
CGAGCAGTTG 
TGGAACAGGG 
GGTTATTGCC 
CCCTGAAAAT 
TGTCCCTTCT 
GCTTCGCCAC 
AGTCTCTTGT 
CCGTCCTGGT 
TGAACGTAAT 
GATTTTCCCA 
TGACCGTATT 
AGCGATTTTC 
AAAGATTCAG 
TGAAGCCCAG 



CTAGAAATTT 
GCTTGACGGA 
CTTAACACTG 
TTTTATAGAG 
TTTCTTCTGA 
TTGAATCCTT 
ACAGCAGAAG 
CTAAAACTAG 
GGTCTCACCC 
CTTCTCAAAA 
CAGGGTAATC 
TTTTACCACA 
TTATGACAAG 
AAAATGAAAG 
AAAATAGGAA 
TTTAAACTAG 
GTGGATAACA 
AAGACCTATA 
CACAATAAAA 
GCAGTTGAGT 
AGCGATACCT 
TCAGCTACCT 
ATCTATGGTT 
CTAGAGATTT 
GATATTGATT 
GCTTCCCGAG 
CGTGAAGTTG 
CCAGCGACAC 
GCCGAGTTGG 
CGTTTGAAAC 



TGGAGAGGCT 
TAAAACAAGT 
TGTTCAGATG 
AGAATTTCCT 
CCTTGTTAGG 
ATTCTCTTGC 
AAGTGGTGGC 
CTATTCTTAT 
CTCTATCTCT 
CTGATACAGT 
TCTTTGGGAT 
AGGCAATCAA 
TCTGGTATTA 
GATGTGACTT 
TAGCACATGG 
TATCAAAATA 
TTGAGGGGGG 
CTATGAGTCA 
CTCTGGCTGG 
ATTTCGTATC 
ATATTGAGAA 
CAGCCCTTTT 
TGGGTTCGCC 
CTCGTGATAA 
TCCAACGCGG 
ATGAACATGC 
AGGCTCTGAC 
ACTTTGTGAC 
AAGAACAATT 
AGCGGACAGA 



180 
240 
300 
3 60 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
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GTATGATATC GAAATGTTGC GTGAGATGGG CTATACCAAT GGGGTTGAAA ATTATTCTCG 1980 

CCACATGGAT GGACGGAGCG AAGGAGAGCC TCCTTATACG CTTCTCGACT TCTTCCCAGA 2040 

TGATTTCTTG ATTATGATTG ACGAGAGTCA TATGACCATA GGGCAAATCA AGGGCATGTA 2100 

CAATGGAGAC CGTTCGCGTA AAGAAATGCT GGTTAATTAT GGTTTCCGTT TGCCGTCTGC 2160 

TTTGGACAAT CGTCCTCTCC GTCGGGAGGA GTTTGAGAGT CACGTTCATC AGATTGTTTA 2220 

CGTTTCAGCG ACACCTGGTG ACTATGAAAA TGAACAGACC GAGACAGTGA TTGAGCAAAT 2280 

CATTCGTCCA ACGGGACTCT TGGATCCAGA GGTGGAAGTC CGTCCGACTA TGGGACAGAT 2340 

TGATGACCTC TTGGGTGAAA TCAATGCCCG CGTTGAAAAA AATGAGCGTA CCTTTATCAC 2400 

AACTTTGACC AAGAAAATGG CAGAGGATTT GACCGACTAC TTCAAGGAAA TGGGTATCAA 2460 

GGTCAAGTAC ATGCACTCGG ATATCAAGAC CTTGGAACGG ACGGAGATTA TCCGTGACCT 2520 

GCGCTTGGGT GTCTTTGATG TCTTGGTCGG AATTAACCTG CTCCGTGAAG GAATTGACGT 2580 

TCCTGAAGTG AGCCTCGTAG CTATTCTCGA TGCTGACAAG GAAGGTTTCC TTCGCAACGA 2640 

ACGTGGACTC ATCCAGACCA TTGGACGTGC TGCACGTAAT AGCGAAGGTC ATGTTATCAT 2700 

GTATGCGGAC ACGGTTACCC AGTCTATGCA ACGTGCTATC GATGAAACTG CCCGCCGTCG 2760 

CAAAATCCAG ATGGCCTATA ATGAAGAACA TGGTATCGTT CCACAAACCA TCAAGAAAGA 2820 

AATCCGTGAC TTGATTGCTG TGACCAAGGC AGTTGCTAAG GAAGAAGACA AGGAAGTCGA 2880 

TATCAATAGC CTCAACAAAC AAGAGCGCAA AGAACTAGTC AAAAAGCTTG AGAAACAAAT 2940 

GCAAGAAGCA GTTGAAGTGC TTGACTTTGA ACTAGCAGCT CAGATTCGTG ATATGATGCT 3000 

GGAAGTCAAG GCCTTGGATT AGGGGAATAG TATGATTTAT TTAAGAAAGT TAAAGAAAGA 3060 

AGATTTGATG TCTTTATGGG AAATGGCTTA TTCACAGCTT AATCCAGTTT GGAAACAGTA 3120 

TGATGCTCCC TATTATGATG ATTATCAGTA TTTTTCAAAT TTTAAAGAAT TCGAACTACA 3180 

AAAATCAGAA TCCATTTTAA GCAACTCAAA TCGCCTTGGT ATTTTTGTTG ATGATAAACT 3240 

AGTTGGGACT GTTTCGCGTT ATTGGGTATG TAAAGAAACA AGATGGATGG AATTGGGAAT 3300 

TGGTATTTAT GATAAAAAAT TCTGGAACAC TGGTATTGGG AAAGTTGCTA TGTTGCAGTG 3360 

GATAGATAGG ACGTTTCAGG ATTACTTGGA GTTGGAGCAT CTGGGTTTGA CAACTTGGTC 3420 

AGGAAATATT GGTATGATGA AACTTGCTGA AAAATTAAGA ATGAAAAAAG AAGCTCATAT 3480 

TCCAAAAGTT CGTTATTATC AAGGTAAATA TTTTGACAGT ATTAAATATG GTATTTTGAG 3540 

AGAAGACTGG GAGAAAATAA ATGACGGTTA TTATCAAATC AATGGAAACT CCTGAAGAGA 3600 

TAGAAGGTAA ATCCTTCGTT CACTGGCAAA CGTGGAGAGA GGCTTATGAT GACCTTTTGC 3660 
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CTGCGGAATT TCAGGAGACA ATGACATTAG AAAGATGTCG ACTCTTTAGT CAAAAGTATC 3720 

CAGAAAATAC ATTGATTGCG ATGGATGGTG TGAAGATAGT TGGTTTTATA AGTTATGGCA 3780 

ACTGTCGTGA TGAGACTATT CAAGCTGGTG AAATTATTGC TTTATATGTT TTAAAAGACT 3840 

ATTATGGAAA AGGAATCGCA CAAAAGTTAG TGAAAGCAGC TTTGACTGAT CTTAATCATT 3900 

TTTCTGAAAT TTTCTTATGG GTATTGAAAG ATAACAAGCG CGCCATTGCT TTCTATCAAA 3960 

AAATGGGTTT TACTTTTGAT GGACAAGAAA AAATACTTGA ACTTGGAAAG CCTATAAAGG 4020 

AAAAACGGAT GGTATTCTAT TCTAAATAAT TCTCAAAAGT AAAAGCTAAT ATGGTACCAA 4080 

GTCTGAAAAT TTAATAAATT AGAAAGCGAG TAAATTTATG TCCCGTTCCC AATTAACAAT 4140 

TTTAACAAAT ATCTGTCTGA TTGAAGACCT CGAAACTCAG CGCGTGGTGA TGCAGTATCG 4200 

CGCCCCTGAA AACAATCGCT GGTCTGGTTA TGCCTTTCCT GGAGGTCATG TAGAAAATGA 4260 

TGAGGCTTTT GCGGAGTCTG TCATTCGTGA AATCTACGAA GAAACAGGGT TGACTATCCA 4320 

AAATCCTCAA CTTGTCGGCA TTAAAAATTG GCCACTAGAT ACAGGTGGGC GCTATATTGT 4380 

CATTTGTTAT AAGGCGACTG AGTTCTCTGG TACCCTTCAA TCTTCAGAAG AGGGAGAAGT 4440 

TTCTTGGGTG CAAAAAGACC AGATTCCAAA CTTAAATCTG GCCTATGATA TGCTACCATT 4500 

GATGGAAATG ATGGAAGCTC CCGACAAGTC AGAGTTTTTC TACCCTCGCC GTACAGAAGA 4560 

CGATTGGGAA AAGAAAATCT TCTAGTCTTT TACTAAATAA CCTAGCTGAT CCAAGGCCTC 4620 

CTCGATATAG TGGAGGTCTT GTTGTGTCTC GGCTTCAACT AGGTGATAAT GAATACCATC 4680 

TGTTAACTCA GAAATTGGCT TAAAGTCAGA ACGTTCAACT TGTTCTAGAA AATGTTGCAC 474 0 

GTCGCGGCGA CAGGTCAGTT TTAGTAAGGT TTCAATCTCT CCATAAACAG GATGATCAAT 4800 

CAAGATATTT TGAACGCGAC CACCATTATC TACGATAGCA AGTAATTCTC GTCCAATTTC 4860 

TTCAACTTCA TGCTTGACCT TAAATAATTT GTGATGATAA GTATTTGCAT TAGCATCTTT 4920 

ATAGATATAA CCACGATTGG TAGATAGAAT TGGAGATCCA TCAGCTCTTA AAATTGCAAT 4980 

ATCTTGAACA ATAACTTGTC GAGTGACATG AAAGTGCTCA 5020 
(2) INFORMATION FOR SEQ ID NO: 143: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4965 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 143: 

AAAAAGTGGC AATCCATTGA TTGGCCACTT CATTTAGAGA ATTATCGTCT CGCCCTTGAA 60 
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GAAGAAGGTC 


GTGTAGTACT 


TGAGTTACTG 


CTATCGCTAG 


AACTACTACT 


TGAACTGCTG 


120 


GAGCTGGATG 


GAGTTGGTAG 


ACTCCCCACA 


ATACTAGACC 


AAGCATTCTG 


ATAATCCGCA 


180 


TCACTTCCGC 


CAATAGCAAA 


GCGATAACTT 


GTCGCTGGCG 


CTCCTGACTT 


ATTAGCCCAA 


240 


TAGCTGGTAA 


CAGTCGAACC 


TGTGACCTCT 


ACTTCTTTTC 


CTTCAACAGA 


AACCTTCTCT 


300 


GGTTTTTGAC 


CTGTTGATTT 


CAAGACTTCC 


GATTTCACTA 


CACTAGGATC 


TAAAGCAAAG 


360 


CGCTCGTTCC 


CCCAAATGCT 


TGGGGAAGCT 


TGCTGAATCG 


CATTTACCAG 


ATGAGCCATG 


420 


TAATTAGAGT 


TATTAGAATA 


ACCTGCTCTA 


CGTGACAATG 


AATGATTATC 


ATCATGCCCA 


480 


ATCCAGCCAC 


CTAGGGTTAA 


TCTAGGTGTC 


GAAAGCATGA 


GCCACATATT 


TTCGTCTTGG 


540 


TTGGTTGTAC 


CAGTCTTCCC 


AATCCAATCT 


GCATTAGCCA 


GAGTAGGATT 


TAAAGAAGTC 


600 


AGGTTAGACT 


TGAAGGTTGT 


TGTCACACGA 


GAGGATAGAA 


CTTCTCGTAG 


CAATCCCTGC 


660 


ATAATCGTCG 


CAGTAGCTTT 


TGAATAGACT 


TGAACCGGTT 


TATCCTGATA 


CTCATACACC 


720 


ACTCTACCAT 


CTGCTGCTTC 


AATCTTTGAA 


ATCACATGCT 


TCTGATGATA 


AACTCCATTA 


780 


TTAGCTAAGG 


TCTGATAGCC 


ATTGGTATGC 


TGGGCAACTG 


TGACTTCAAT 


ACCACCACCC 


840 


ATTGGCAAGC 


TCTCAATACC 


GTACTCAGGA 


ATCTCGTAAC 


CCATCTTTTC 


CATATAACCC 


900 


TTGACATCAA 


CACCCTTTTC 


ACGGAGCATA 


CGATAGGTCC 


AGTAAGCAGG 


GATATTCCAT 


960 


GAATAGTTCA 


GAGCTTCTCC 


CAAGGTCATC 


ATTCCTGTTC 


CCTTGCTATT 


AGCATACATA 


1020 


ATCGGATTGC 


CATTAGCAAA 


GTTTGTTGGA 


TAGTTAGATA 


GAATCGTTTC 


ACTTCCCATC 


1080 


AAGCCCTGGT 
i 


CAATAGCAAT 


ACCGTAGGCC 


AGCAAGGGCT 


TGGTAGTAGA 


AGCTGGCGAA 


1140 


CGTTTGGTAT 


CAAAGGCATG 


ATTATTTTGA 


TTTTCTTGAT 


AATTACGACC 


ACCTACAAAG 


1200 


CCTAGAATAG 


CACCTGTTTG 


GTTATCCATC 


AAGACATTCC 


CTACTTCTAC 


ACGACCTGTT 


1260 


CCATCGTCTA 


AAAGATAGCC 


ATAATCAGCA 


ACCGCACTTT 


GCATGGCAGA 


ATGAATTTTC 


1320 


TGATCTATGG 


TAGTAGTAAT 


CTTATAACCA 


CCATTTTCAA 


TTTCCTTGGC 


TGCCAAATCT 


1380 


CGATAAAACT 


TCTGAGTTGC 


CTCATTTTTC 


AACTCCTTAG 


CGGAGACATT 


GTCTCTCTGA 


1440 


GCTAGATAGT 


CATACATACG 


TTCTTGAGCT 


TCTGCCAAAG 


TTGTAAAGTA 


TAAATAGTCT 


1500 


CGTGAAATTC 


CTGTAACCGT 


GCCCGATGGT 


AAAAAGTCCT 


GTTTAAGGTC 


ATAATCCTTG 


1560 


TACTGAGAAT 


ACTCGTCTTT 


GCTTAATGCA 


CCTGTACGAT 


ACATACTGTA 


AAGAACTGCC 


1620 


TTAGCCCGTC 


TTAAGCCAAT 


TTCTAGGTCT 


TCATCACTCT 


TCAACTCCCC 


AGTATTTTCA 


1680 


TAAGGAGAGT 


AAGTAATGGG 


ACTCTGTGGA 


AGTCCTGCTA 


AAAATGCTGC 


TTGAGGAACA 


1740 


GTCAACTGAC 


TGGCATCTAC 


ACCGAAAATT 


CCCTCAGCTG 


CTTGCCGAGC 


CCCTGCAATA 


1800 
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TTCTGTCCCT TATTATTTCG GCCAAAGGGA GCCACATTGA GATAGGTCGT TAAAATCTCA 1860 

TCTTTATTCA TGGCGCGTTC CAAGGCAAGA GCATCCACAA TCTCTGCCGC CTTACGAGCC 1920 

AAGGTCGGCG CATCCCCAAC CACCTGCTGT TTAATTAGTT GCTGGGTCAA GGTTGAACCC 1980 

CCACTAGAGG AACCCAAACC TACAAATTTC CCCAAGGTCG CACGAATCAC CGCCTTGGGT 2040 

ACTACACCCT TATGTTCTTT AAAGTGTTCA TCTTCTGTCG CAATGATAGC CTTCTTCAGA 2100 

TTTTCCGAAA TTTGCTCAGA TGAGATAGAA GTGCGCAACA AATCACTCTC TATGGAAGCA 2160 

ATCACCGTCC CGTCCGAATA GGTAATCTCT GAAATAGAAG AGATGTCCTT GACCTGATTC 2220 

ACCAATTCTT CTGTCTGAGG CACCCGAACC TTGTCAAATA AGGCCACTCC GTATCCCAAA 2280 

GCAATCCCAG CTCCCAACAT TCCTCCTAGA AAACCGAGTA CAAAGAGTAA GTTAAATAAG 2340 

GCTTTTATAC TCAGTAAAAT AGCTGGGAAA ATGACTGACT TATCTAAGGT TTTAGATTTT 2400 

TTGGTACTTG AACCTTTCTT GCCAGGTCTA GCTGATTTTT TATTTTTTTG TTTTTGCTGG 2460 

AAAAATTCCA GCATTTTTCG TTTTAATTCA TTTAATTGAT TTTGCATGGA TTTCCTCACT 2520 

TTATCTATTA TACCACAAAA GGGAAATTTT CAATAAAATA GCCACTTTCT TCCCTATTCT 2580 

GCTAGGCTAT TGCCCAAGTT TGTGATACAA TAGGTAGAAA CAATAATTTT AAAAAGGAGA 2640 

AAAAACACAT GCACATTTTT GATGAGCTAA AAGAGCGTGG TTTGATATTT CAAACGACTG 2700 

ATGAAGAAGC TTTGCGTAAA GCCCTAGAAG AAGGTCAAGT TTCTTATTAT ACTGGCTACG 2760 

ATCCAACTGC TGACAGCCTT CACCTAGGCC ACCTTGTCGC AATCTTGACA AGTCGTCGCT 2820 

TGCAACTAGC AGGTCACAAA CCTTATGCGC TCGTTGGCGG TGCTACAGGT CTCATCGGAG 2880 

ATCCGTCCTT CAAAGATGCT GAACGTAGTC TCCAAACAAA AGACACAGTA GATGGCTGGG 2940 

TCAAGTCTAT CCAAGGACAA CTTTCTCGTT TTCTTGACTT TGAAAATGGC GAAAACAAGG 3000 

CTGTCATGGT CAACAACTAC GACTGGTTTG GCAGCATCAG CTTCATTGAC TTCCTCCGTG 3060 

ATATTGGAAA ATACTTCACG GTCAACTACA TGATGAGTAA GGAATCTGTT AAAAAACGGA 3120 

TCGAAACAGG AATTTCTTAC ACTGAGTTCG CTTACCAAAT CATGCAAGGG TATGACTTCT 3180 

TCGTCCTTAA CCAAGACCAT AATGTCACTC TTCAAATCGG TGGTTCTGAC CAGTGGGGAA 3240 

ATATGACAGC TGGTACCGAA TTGCTTCGTC GTAAGGCGGA CAAGACTGGT CACGTTATCA 3300 

CTGTTCCACT AATCACAGAT GCAACTGGTA AGAAATTTGG TAAATCAGAA GGAAATGCCG 3360 

TCTGGCTCAA TCCCGAAAAG ACTTCTCCAT ACGAAATGTA CCAATTCTGG ATGAACGTGA 3420 

TGGACGCTGA CGCTGTTCGC TTCTTGAAAA TCTTTACTTT CTTGTCACTT GATGAGATTG 3480 

AAGATATTCG TAAACAATTT GAAGCAGCGC CACACGAACG CTTGGCTCAA AAAGTCTTGG 3540 

CTCGTGAAGT TGTTACACTT GTTCACGGAG AAGAAGCCTA CAAAGAAGCA CTTAACATCA 3600 
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CTGAGCAACT CTTTGCAGGA 


AACATCAAAA 


ACCTTTCTGT 


CAAAGAGCTC 


AAACAAGGAC 


3660 


TTCGTGGTGT GCCCAACTAC 


CAAGTACAGG 


CAGACGAAAA 


CAACAATATC 


GTGGAACTGC 


3720 


TCGTCTCATC TGGTATAGTT 


AACTCAAAAC 


GCCAAGCCCG 


TGAAGACGTC 


CAAAACGGAG 


3780 


CCATCTACGT AAACGGCGAC CGCATCCAAG AGCTTGACTA 


TGTCTTGAGT 


GACGCTGATA 


3840 


AGTTAGAGAA TGAACTGACT 


GTTATCCGTC 


GTGGGAAGAA 


AAAATACTTT 


GTATTGACTT 


3900 


ACTAAACTAT TCAACATTTA 


TCTATAAACA 


AAGGAGTTAA 


CCTCGAGAAA 


GGTAACTCCT 


3960 


TTTTGCTGTT AATAACTCTC 


ATCTATCTAT 


TTTTAATAGA 


CAGGCTACGC 


AGGACAATGC 


4020 


vtCAAQjLj 1 I\a r TAGATTATGT 


AAGATAGAGA 


GATTTGAAGG 


ACTGAACCAA 


TTAAATAAGC 


4080 


CAAAGCCAAT CAAACTACTA 


TTTACGACAA 


CGGTATCCTG 


AATATTTTTC 


TTGATGAGTG 


4140 


TTTGCAAAGA TGATGATAAC 


GAATCCAACT 


CTTGGAAGAA 


ATCCAAACGA 


TTATCTAACA 


4200 


ATAAGATATC ACTCATCTGC 


TTAGAAATAT 


CTGCACTCTC 


ATTCATCACC 


ACACCGATAT 


4260 


CTGATAGAGT TAAAGCCGCT 


GAGTCATTCA 


ATCCATCTCC 


AACCATCAAA 


ATAGTGTGAC 


4320 


CTGCTTTCTG CAGTTTCTCT 


ACTAACTCAA 


ATTTCCCATC 


AGGTTTCAAG 


TCTGTATAGA 


4380 


CCTGATCAAA GGGCAAATCT 


TTGACTAATT 


CCTCTGTCCT 


AATCAAGGTG 


TCTCCTGTTG 


4440 


CCAGAATCAA TTTTTTCCCC 


TGTGCCTTAA 


GTTTATCCAA 


GGCTGTTTTT 


GCTTCTTTTC 


4500 


TCAAAGGAGT ATGAATGCAG 


AACATTCCAA 


TCAATTCATT 


TTGATAAGCC 


AAGAATAAGA 


4560 


GATTGTAGTG ACTCTTGTAC 


TCTTCAATTA AAGCATTTTG 


TTCTGAACTG 


ATATGAATCT 


4620 


GCTCATCCTG CATCAAGACA 


TAATTCCCAA 


TAAGAACTGG 


TTGGCCATCT 


ATATGAGATT 


4680 


TGATCCCCTT GCTTGCGATA 


TATTGGAGTT 


TCCCATGCAT 


TTCCTCATGT 


TCAATTCCCT 


4740 


CTATCTCAGC TTGCTTGACG 


ATGGCATTAG 


CAATAGGATG 


ATAAATGTGT 


TCCTCAAGAC 


4800 


AGGCACTGAT TCTGAGAATA 


TCTTCCTCAC 


TATAGTCTCC 


AAAAGGTAAC 


ACCTTTTCAA 


4860 


CTATAGGATA ACTAGTTGTG 


ATTGTTCCTG 


TCTTATCAAA 


CAAGAAAGTA 


TCAACTTCCA 


4920 


GATATTTCTC CCTGTTGTGG 


CCTCTGGCTG 


TCATCTCTGT 


GCTGG 




4965 


(2) INFORMATION FOR SEQ ID NO: 144: 









(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 3232 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 144: 
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CAGGGGCGTA 


TTACGTGACA 


ATTCAATGTA 


GGCTGTCGCT 


ACTTGCGCCA 


AAACAAGGAT 


60 


TCGATAATGT 


CGGATGATAC 


TAACGATTAA 


ACCGAGCAGA 


AAGGATCCCA 


AAATTCCCCA 


120 


AACTGCAATA 


TGCAAGGTCA 


GAAAGAATGC 


CTTTTGATAT 


AGTGGTAGAT 


ATTGTTCAAC 


180 


AATGGATCAA 


TCCAAAAATA 


GAACCTCCCA 


TCTAGAAATA 


ATACAGTTAT 


TGTAGCACTT 


240 


AAAATCTTCT 


TTGGATAATA 


TCTATTTTTT 


ATTGCCGTTA 


TAAGGATTTT 


TATCATAGAC 


300 


ATAAAATTTC 


TGAAATTTCC 


AAACAAAATA 


TTTTAAAAGT 


TTTGAAAAAG 


AGTTAAGATA 


360 


TTTTTGTAAT ACACAAAGTA AACGCTTACT TATTAAGGAG 


GACATTTTAT 


GTCATACAAA 


420 


ACAAGCAATG 


CAGAAGGTCA 


TGTAGATTTC 


ATCAATACCT 


ATGATTTGGA 


GCCAATGGCG 


480 


CAACAAGTTA 


TTCCTAAAGC 


AGCATTTGGC 


TATATCGCTA 


GTGGGGCGGG 


AGATACTTTC 


540 


ACTTCTTTCC 


AGTGATTTTA 


GCGTCAGGTT 


CTTTTTAGTT 


TTTAAAGATT 


ATCCGTGAAT 


600 


TTCTTGCTTA 


TTTATGATAA 


AATGGGAGTG 


TCGCAAAAAA 


TGACTCATCG 


TATTCAATTT 


660 


TGAGTAAAAC 


TAGGAGGATC 


CCATGTCTAC 


AGAACATATG 


GAAGAACTAA 


ATGACCAGCA 


720 


GATCGTTCGC 


CGTGAAAAAA 


TGGCTGCGCT 


CCGCGAACAA 


GGAATCGATC 


CTTTCGGAAA 


780 


ACGTTTTGAA 


CGTACTGCAA 


ATTCACAAGA 


ATTAAAAGAT 


AAATATGCCA 


ACCTCGATAA 


840 


AGAACAATTA 


CACGATAAAA 


ACGAAACAGC 


TACTATCGCA 


GGACGCTTGA 


TAACCAAACG 


900 


TGGTAAAGGA 


AAAGTTGGTT 


TTGCCCACCT 


TCAAGACCGC 


GAAGGCCAGA 


TTCAGATCTA 


960 


CGTTCGTAAG 


GATGCTGTCG 


GTGAAGAAAA 


CTACGAAATC 


TTCAAAAAAG 


CAGACCTTGG 


1020 


TGACTTCCTT 


GGTGTCGAAG 


GTGAAGTGAT 


GCGTACGGAT 


ATGGGAGAAC 


TCTCTATCAA 


1080 


GGCAACCCAC 


ATCACACACT 


TGTCTAAGGC 


TCTTCGTCCT 


CTTCCTGAGA 


AATTCCATGG 


1140 


TTTGACAGAC 


GTTGAAACAA 


TTTACCGTAA 


ACGTTACCTT 


GACTTGATTT 


CTAATCGTGA 


1200 


AAGCTTTGAA 


CGCTTTGTCA 


CTCGTTCAAA 


AATCATCTCT 


GAAATCCGTC 


GTTACCTTGA 


1260 


CCAAAAAGGA 


TTCCTTGAAG 


TGGAAACACC 


TGTTCTTCAT 


AATGAAGCCG 


GTGGTGCTGC 


1320 


TGCCCGTCCA 


TTTATCACCC 


ACCACAATGC 


CCAAAACATT 


GACATGGTGC 


TTCGTATCGC 


1380 


GACTGAGCTT 


CACTTAAAAC 


GCCTTATCGT 


GGGTGGTATG 


GAACGTGTCT 


ATGAAATTGG 


1440 


CCGTATCTTC 


CGTAACGAAG 


GAATGGACGC 


TACTCATAAC 


CCTGAGTTCA 


CTTCTATCGA 


1500 


AGTTTACCAA 


GCTTATGCAG 


ACTTCCAAGA 


CATCATGGAC 


TTGACTGAAG 


GCATTATCCA 


1560 


ACACGCTGCT 


AAATCAGTCA 


AAGGTGATGG 


CCCAGTCAAC 


TACCAAGGTA 


CTGAAATCAA 


1620 


GATTAACGAA 


CCATTTAAGC 


GTGTTCATAT 


GGTGGATGCT 


ATCAGAGAAA 


TTACTGGTGT 


1680 


CGATTTCTGG 


CAAGACATGA 


CTTTGGAAGA 


AGCTAAAGCT 


ATCGCTGCTG 


AGAAGAAAGT 


1740 


TCCAGTTGAG 


AAACACTACA 


CTGAGGTTGG 


TCACATCATC 


AATGCCTTCT 


TTGAAGAGTT 


1800 
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ACTTTAATCC 


AACCAACCTT 


TGTCTATGGA 


CATCCAGTAG 


CTGTATCTCC 


1860 




AAAAATCCTG 


AAGACCAACG 


CTTTACTGAC 


CGTTTCGAGC 


TCTTTATCAT 


1920 


GACTAAGGAG 


TACGGTAATG 


CCTTTACTGA 


GTTGAACGAC 


CCAATCGACC 


AACTTAGCCG 


1980 


TTTTGAAGCC 


CAAGCTAAAG 


CCAAAGAACT 


TGGTGATGAT 


GAAGCGACAG 


GAATCGACTA 


2040 


TGACTACATT 


GAAGCTCTTG 


AATACGGTAT 


GCCACCAACA 


GGTGGTTTGG 


GAATCGGTAT 


2100 


CGACCGTCTC 


TGCATGCTCC 


TCACTGATAC 


AACAACTATC 


CGTGATGTAT 


TGCTCTTCCC 


2160 


AACAATGAAA 


TAAATTCTTA 


TCCTCTGGGT 


CTTATCAGAG 


GATTTTTTGA 


TTCAAAAAGA 


2220 


GACTGAATTT 


AAGGAGAAAA 


TGAAGTGTAG 


TATATTGAAA 


TTGAAATAGT 


ACACTTTGAT 


2280 


TTCTAAGACA 


TTGTTAGAAA TTGGTTTAAA 


TTCCCTAAGC 


AATTTGTGCA 


TGTTTTATTT 


2340 


CATTTTACGA TAGTACGCTG 


AAACTTTTCA 


AAAAGTACTA 


GAAATTGACT 


TGGATTCCCC 


2400 


AATTGATTTG 


TTCAGATTCA 


CTATAAATAA 


AAAATTAATA 


AGTGGGATAG 


GAAGTTAGCG 


2460 


TCAACTAGGA TAGTATCTTG CTTAAACAGT ATATATGGGA TTGATATAAG 


TCCATAGGTC 


2520 


CTATTAGAGG 


ATGTTCTGGT 


GTCTTATTCA 


CTTGTTTTTT 


ATAGTATTAG 


TAGATAGAAT 


2580 


CAGCAAATAA 


AAACCCAAAT 


CATTCATACC 


TCTCTCAACT 


AGATGTAACT 


TACAAAACCC 


2640 


CTGACCTCAT 


GAGCCACTTT 


CTTCCTCCTC 


ATGAGGTCAG 


TTTTACTTTC 


TGCTGTTCCA 


2700 


GTATCGTTTT 


TCCTCGCTAG 


ATTTCCTCAA 


AAGGGCAGAC 


TCCTCCCTTG 


GTGCGTCACA 


2760 


CGATTTTTTC 


ATCTCGACTG 


TTCTTTAATG 


CATCATTAAC 


GACGCTTTTC 


TTCTAGGTGG 


2820 


TTCATAAGGA 


ACAGGAAGAT 


TCAGGTTGAC 


TTTTCTAATC 


CTAGAATAAA 


GTGCTGAAAA 


2880 


CAATTCGGAA 


TAGGCATAGA 


GACTAGACAA 


TTTGAGGAGC 


TGCTTGCGTC 


CTGTTCGAAC 


2940 


ACATTTTCCC 


ACCACGTGAA 


GAAAAAGATG 


GCGGAAGCGT 


TTGATTGTTA 


AAGTTTGGAA 


3000 


GTCACCTCCA 


GCTAGATGTT 


TGAGAAAAAG 


ATAGAGATTG 


TAGGCGATAC 


AGCTCATCAT 


3060 


CATACGAACT 


TCGTTTTTGA 


TTAAGGTTGA 


ACTATCCGTT 


TTATCGCCAA 


AAAATCCCTC 


3120 


CTTCATCTCC 


TTGATGAAAT 


TCTCGGCTTG 


ACCACGTCCA 


CGATAAAGCT 


GAAACTGGTC 


3180 


TTGGCTTGTT 


CCACTCGTCA 


TATTTGTAAC 


GAGAGAAATA 


ACATCGTAGA 


AC 


3232 


(2) INFORMATION FOR SEQ ID NO: 145: 









(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10711 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS: double 
(D) TOPOLOGY: linear 
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(xi) 


SEQUENCE DESCRIPTION: 


SEQ ID NO: 


145: 






CCGGAGAAAA TGATGAAAAG 


1 TTCAAAACTA 


, TTTGCCCTTG 


1 CGGGCGTGAC 


ATTATTGGCG 


60 


GCGACTACTT 


1 TAGCTGCATG 


CTCTGGATCA 


. GGTTCAAGCA 


. CTAAAGGTGA 


GAAGACATTC 


120 


TCATACATTT 


ATGAGACAGA 


CCCTGATAAC 


CTCAACTATT 


TGACAACTGC 


TAAGGCTGCG 


180 


ACACAAATAT 


TACCAGTAAC 


GTGGTTGATG 


GTTTGCTAGA 


AAATGATCGC 


TACGGGAACT 


240 


TTGTGCCGTC 


TATGGCTGAG 


GATTGGTCTG 


TATCCAAGGA 


TGGATTGACT 


TACACTTATA 


300 


CTATCCGTAA 


GGATGCAAAA 


TGGTATACTT 


CTGAAGGTGA 


AGAATACGCG 


GCAGTCAAAG 


360 


CTCAAGACTT 


TGTAACAGGA 


TTAAAATATG 


CTGCTGATAA 


AAAATCAGAT 


GCTCTTTACC 


420 


TTGTTCAAGA 


ATCAATCAAA 


GGGTTGGATG 


CCTATGTAAA 


AGGGGAAATC 


AAAGATTTCT 


480 


CACAAGTAGG 


AATTAAGGCT 


CTGGATGAAC 


AGACAGTTCA 


GTACACTTTG 


AACAAACCAG 


540 


AAAGCTTCTG 


GAATTCTAAG 


ACAACCATGG 


GTGTGCTTGC 


GCCAGTTAAT 


GAAGAGTTTT 


600 


TGAATTCAAA 


AGGAGATGAT 


TTTGCCAAAG 


CTACGGATCC 


AAGTAGTCTC 


TTGTATAACG 


660 


GTCCTTATTT 


GTTGAAATCC 


ATTGTGACCA 


AATCCTCTGT 


TGAATTTGCG 


AAAAATCCGA 


720 


ACTACTGGGA 


TAAGGACAAT 


GTGCATGTTG 


ACAAAGTTAA 


ATTGTCATTC 


TGGGATGGTC 


780 


AAGATACCAG 


CAAACCTGCA 


GAAAACTTTA 


AAGATGGTAG 


CCTTACAGCA 


GCTCGTCTCT 


840 


ATCCAACAAG 


TGCAAGTTTC 


GCAGAACTTG 


AGAAGAGTAT 


GAAGGACAAT 


ATTGTCTATA 


900 


CTCAACAAGA 


CTCTATTACG 


TATCTAGTTG 


GTACAAATAT 


TGACCGTCAG 


TCCTATAAAT 


960 


ACACATCTAA 


GACCAGCGAC 


GAACAAAAGG 


CATCGACTAA 


AAAGGCTCTC 


TTAAACAAGG 


1020 


ATTTCCGTCA 


GGCTATTGCC 


TTTGGATTTG 


ACCGTACAGC 


CTATGCCTCT 


CAGTTGAATG 


1080 


GACAAACTGG 


AGCAAGTAAA 


ATCTTGCGTA 


ATCTCTTTGT 


GCCACCAACA 


TTTGTTCAAG 


1140 


CAGATGGTAA 


AAACTTTGGC 


GATATGGTCA 


AAGAGAAATT 


GGTCACTTAT 


GGGGATGAAT 


1200 


GGAAGGATGT 


TAATCTTGCA 


GATTCTCAGG 


ATGGTCTTTA 


CAATCCAGAA 


AAAGCCAAGG 


1260 


CTGAATTTGC 


TAAAGCTAAA 


TCAGCCTTAC 


AAGCAGAAGG 


AGTCCAATTC 


CCAATTCATT 

^» win x x wi i 1 




TGGATATGCC 


AGTTGACCAA 


ACAGCAACTA 


CAAAAGTTCA 


GCGCGTCCAA 


TCTATGAAAC 


1380 


AATCCTTGGA 


AGCAACTTTA 


GGAGCTGATA 


ATGTCATTAT 


TGATATTCAA 


CAACTACAAA 


1440 


AAGACGAAGT 


AAACAATATT 


ACATATTTTG 


CTGAAAATCC 


TGCTGGCGAA 


GACTGGGATT 


1500 


TATCAGATAA 


TGTCGGTTGG 


GGTCCAGACT 


TTGCCGATCC 


ATCAACCTAC 


CTTGATATTA 


1560 


TCAAACCTTC 


TGTAGGAGAA 


AGTACTAAAA 


CATATTTAGG 


GTTTGACTCA 


GGGGAAGATA 


1620 


ATGTAGCTGC 


TAAAAAAGTA 


GGTCTATATG 


ACTACGAAAA 


ATTGGTTACT 


GAGGCTGGTG 


1680 


ATGAGACTAC 


AGATGTTGCT 


AAACGCTATG 


ATAAATACGC 


TGCAGCCCAA 


GCTTGGTTGA 


1740 
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CAGATAGTGC 


TTTGATTATT CCAACTACAT 


CTCGTACAGG 


GCGTCCAATC 


TTGTCTAAGA 


1800 


TGGTACCATT 


TACAATACCA TTTGCATTGT 


CAGGAAATAA 


AGGTACAAGT 


GAACCAGTCT 


1860 


TGTATAAATA 


CTTGGAACTT CAAGACAAGG 


CAGTCACTGT 


AGATGAATAC 


CAAAAAGCTC 


1920 


AGGAAAAATG 


GATGAAAGAA AAAGAAGAGT 


CTAATAAAAA 


GGCTCAAGAA 


GATCTCGCAA 


1980 


AACATGTGAA 


ATAACTGTTG CAAAATATAA 


GAAAGGATTT 


AGTATTTCCC 


TTGAATGCTG 


2040 


_ AATCCTTTTT 


TACATTTGTA AAGAAAGATT 


CTAAAATGTA 


CGGACCCCCA 


AAAGTTGGAG 


2100 


CCTCTTTTTG 


TCAGAATAGA GAAAATTTTT 


GTTAATTTTA 


CTTGTTTCCT 


ATTGCTTTCT 


2160 


CAGCTATTAT 


TTGTTATATT AAAAGTATAA 


TTATTTTTTA 


TTTATCAGAG 


TTAAGCATTG 


2220 


CACTTTCAGA 


GGAAGGAGTA TTTTTTAAAA 


AGAAAATGTA 


AACGTTTGCT 


CAAAAATGAA 


2280 


AGGATTTAGA 


AGTTTATGAA TAAAGGATTA 


TTTGAAAAAC 


GTTGTAAATA 


TAGTATTCGG 


2340 


AAATTTTCAT 


TAGGTGTTGC TTCTGTTATG 


ATTGGAGCTG 


CATTCTTTGG 


GACAAGTCCG 


2400 


GTTCTTGCAG 


ATAGCGTGCA GTCTGGTTCC 


ACGGCGAACT 


TACCAGCTGA 


TTTAGCTACT 


2460 


GCTCTTGCAA 


CAGCAAAAGA GAATGATGGG 


CGTGATTTTG 


AAGCGCCTAA 


GGTGGGAGAA 


2520 


GACCAAGGTT 


CTCCAGAAGT TACAGATGGA 


CCTAAGACAG 


AAGAAGAACT 


ATTAGCACTT 


2580 


GAAAAAGAAA 


AACCGGCTGA AGAAAAACCA AAAGAGGATA AACCTGCAGC TGCTAAACCT 


2640 


GAAACACCTA 


AGACGGTAAC CCCTGAATGG 


CAAACGGTAG 


CGAATAAAGA 


GCAACAGGGA 


2700 


ACAGTCACTA 


TCCGAGAAGA AAAAGGTGTC 


CGCTACAACC 


AACTATCCTC 


AACTGCTCAA 


2760 


AATGATAACG 


CAGGCAAACC AGCCCTGTTT 


GAAAAGAAGG 


GCTTGACCGT 


TGATGCCAAT 


2820 


GGAAATGCAA 


CTGTTGATTT AACCTTCAAA 


GATGATTCTG 


AAAAGGGCAA 


ATCACGCTTT 


2880 


GGTGTCTTTT 


TGAAATTTAA AGATACCAAG AATAATGTTT 


TTGTCGGTTA 


TGACAAGGAT 


2940 


GGCTGGTTCT 


GGGAGTATAA ATCTCCAACA 


ACTAGCACTT 


GGTATAGAGG 


TAGTCGTGTT 


3000 


GCTGCTCCTG 


AAACAGGATC AACAAACCGT 


CTCTCTATCA 


CTCTCAAGTC 


AGACGGTCAG 


3060 


CTAAATGCCA 


GCAATAATGA TGTCAATCTC TTTGACACAG 


TGACTCTACC 


AGCTGCGGTC 


3120 


AATGACCATC 


TTAAAAATGA GAAGAAGATT 


CTTCTCAAGG 


CGGGCTCTTA 


TGACGATGAG 


3180 


CGAACAGTTG 


TTAGCGTTAA AACGGATAAC 


CAAGAGGGGG 


TAAAAACAGA 


GGATACCCCT 


3240 


GCTGAAAAAG 


AAACAGGTCC TGAAGTTGAT 


GATAGCAAGG 


TGACTTATGA 


CACGATTCAG 


3300 


TCTAAGGTCC 


TCAAAGCAGT GATTGACCAA 


GCCTTCCCTC 


GTGTCAAGGA 


ATACAGCTTG 


3360 


AACGGGCATA 


CTTTGCCAGG ACAGGTGCAA 


CAGTTCAACC 


AAGTCTTTAT 


CAATAACCAC 


3420 


CGAATCACCC 


CTGAAGTCAC TTATAAGAAA 


ATCAATGAGA 


CAACAGCAGA 


GTACTTGATG 


3480 
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AAGCTTCGCG 


ATGATGCTCA 


CTTAATCAAT 


GCGGAAATGA 


CAGTACGCTT 


GCAAGTTGTA 


3540 


GACAATCAAT 


TGCACTTTGA 


TGTGACTAAG 


ATTGTCAACC 


ACAATCAAGT 


CACTCCAGGT 


3600 


CAAAAGATTG 


ATGACGAAAG 


CAAACTACTT 


TCTTCTATTA 


GTTTCCTCGG 


CAATGCTTTA 


3660 


GTCTCTGTTT 


CTAGTAATCA 


AACTGGTGCT 


AAGTTTGATG 


GGGCAACCAT 


GTCAAACAAT 


3720 


ACGCATGTCA 


GCGGAGATGA 


TCATATCGAT 


GTAACCAATC 


CAATGAAGGA 


TTTGGCTAAG 


3780 


GGTTACATGT 


ATGGATTTGT 


TTCTACAGAT 


AAGCTTGCTG 


CTGGTGTTTG 


GAGTAACTCT 


3840 


CAAAACAGCT ATGGTGGTGG 


TTCGAATGAC 


TGGACTCGTT 


TGACAGCTTA 


TAAAGAAACA 


3900 


GTCGGAAATG 


CCAACTATGT 


AGGAATCCAC 


AGCTCTGAAT 


GGCAATGGGA 


AAAAGCTTAT 


3960 


AAGGGCATTG TTTTCCCAGA ATACACGAAG GAACTTCCAA 


GTGCTAAGGT 


TGTTATCACT 


4020 


GAAGATGCCA ATGCAGACAA GAACGTTGAT TGGCAAGATG 


GTGCCATTGC 


TTATCGTAGC 


4080 


ATTATGAACA ATCCTCAAGG 


TTGGGAAAAA 


GTTAAGGATA 


TCACAGCTTA 


CCGTATCGCG 


4140 


ATGAACTTTG 


GTTCTCAAGC 


ACAAAACCCA 


TTCCTTATGA 


CCTTGGATGG 


TATCAAGAAA 


4200 


ATCAATCTCC 


ATACAGATGG 


TCTTGGGCAA 


GGTGTTCTCC 


TTAAAGGATA 


TGGTAGCGAA 


4260 


GGCCATGACT 


CTGGTCACTT 


GAACTATGCT 


GATATTGGTA 


AGCGTATCGG 


TGGTGTCGAA 


4320 


GACTTCAAGA 


CCCTAATTGA 


GAAGGCTAAG 


AAATATGGAG 


CTCATCTAGG 


TATCCACGTT 


4380 


AACGCTTCAG 


AAACTTATCC 


TGAGTCTAAA 


TACTTCAATG 


AAAAAATTCT 


CCGTAAGAAT 


4440 


CCAGATGGAA 


GCTATAGCTA 


TGGTTGGAAC 


TGGCTAGATC 


AAGGTATCAA 


CATTGATGCT 


4500 


GCCTATGACC 


TAGCTCATGG 


TCGTTTGGCA 


CGTTGGGAAG 


ATTTGAAGAA 


AAAACTTGGT 


4560 


GACGGTCTCG 


ACTTTATCTA 


TGTGGACGTT 


TGGGGTAATG 


GTCAATCAGG 


TGATAACGGT 


4620 


GCCTGGGCTA 


CCCACGTTCT 


TGCTAAAGAA 


ATTAACAAAC 


AAGGCTGGCG 


CTTTGCGATC 


4680 


GAGTGGGGCC 


ATGGTGGTGA 


GTACGACTCT 


ACCTTCCATC 


ACTGGGCAGC 


TGACTTGACC 


4740 


TACGGTGGCT 


ACACCAATAA 


AGGTATCAAC 


AGTGCCATCA 


CCCGCTTTAT 


CCGTAACCAC 


4800 


CAAAAAGATG 


CTTGGGTAGG 


GGACTACAGA 


AGTTATGGTG 


GTGCAGCCAA 


CTATCCACTG 


4860 


CTAGGTGGCT 


ACAGCATGAA 


AGACTTTGAA 


GGCTGGCAGG 


GAAGAAGTGA 


CTACAATGGC 


4920 


TATGTAACCA ACTTATTTGC 


CCATGACGTC 


ATGACTAAGT 


ACTTCCAACA 


CTTCACTGTA 


4980 


AGTAAATGGG 


AAAATGGTAC 


ACCGGTGACT 


ATGACCGATA 


ACGGTAGCAC 


CTATAAATGG 


5040 


ACTCCAGAAA 


TGCGAGTGGA 


ATTGGTAGAT 


GCTGACAATA 


ATAAAGTAGT 


TGTAACTCGT 


5100 


AAGTCAAATG 


ATGTCAATAG 


TCCACAATAT 


CGCGAACGTA 


CAGTAACGCT 


CAACGGACGT 


5160 


GTCATCCAAG 


ATGGTTCAGC 


TTACTTGACT 


CCTTGGAACT 


GGGATGCAAA 


TGGTAAGAAA 


5220 


CTTTCTACTG 


ATAAGGAAAA 


GATGTACTAC 


TTCAATACGC 


AGGCCGGTGC 


AACAACTTGG 


5280 
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ACCCTTCCAA 


GCGATTGGGC 


AAAGAGCAAG 


GTTTACCTTT 


ACAAGCTAAC 


TGACCAAGGT 


5340 


AAGACAGAAG 


AGCAAGAACT 


AACTGTAAAA 


GATGGTAAAA 


TTACCCTAGA 


TCTTCTAGCA 


5400 


AATCAACCAT 


ACGTTCTCTA 


TCGTTCGAAA 


CAAACTAATC 


CTGAAATGTC 


ATGGAGTGAA 


5460 


GGCATGCACA 


TCTATGACCA 


AGGATTTAAT 


AGCGGTACCT 


TGAAACATTG 


GACCATTTCA 


5520 


GGCGATGCTT 


CTAAGGCAGA 


AATTGTCAAG 


TCTCAAGGGG 


CAAACGATAT 


GCTTCGTATT 


5580 


CAAGGAAACA 


AAGAAAAAGT 


TAGTCTCACT 


CAGAAATTAA 


CTGGCTTGAA 


ACCAAATACC 


5640 


AAGTATGCCG 


TTTATGTTGG 


TGTAGATAAC 


CGTAGTAATG 


CCAAGGCAAG 


TATCACTGTG 


5700 


AATACTGGTG 


AAAAAGAAGT 


GACTACTTAT 


ACCAATAAGT 


CTCTCGCGCT 


CAACTATGTT 


5760 


AAGGCCTACG 


CCCACAATAC 


ACGTCGTGAC 


AATGCTACAG 


TTGACGATAC 


AAGTTACTTC 


5820 


CAAAACATGT 


ACGCCTTCTT 


TACAACTGGA 


GCGGACGTCT 


CAAATGTTAC 


TCTGACATTG 


5880 


AGTCGTGAAG 


CTGGTGATCA 


AGCAACTTAC 


TTTGATGAAA 


TTCGTACCTT 


TGAAAACAAT 


5940 


TCAAGCATGT 


ACGGAGACAA 


GCATGATACA 


GGTAAAGGCA 


CCTTCAAGCA 


AGACTTTGAA 


6000 


AATGTTGCTC 


AGGGTATCTT 


CCCATTTGTA 


GTGGGTGGTG 


TCGAAGGTGT 


TGAAGATAAC 


6060 


CGCACTCACT 


TGTCTGAAAA 


ACACAATCCA 


TATACACAAC 


GTGGTTGGAA 


TGGTAAGAAA 


6120 


GTCGATGATG 


TTATCGAAGG 


AAATTGGTCA 


CTCAAGACAA ATGGACTAGT 


GAGCCGTCGT 


6180 


AACTTGGTTT 


ACCAAACCAT 


CCCACAAAAC 


TTCCGTTTTG 


AAGCAGGTAA 


GACCTACCGT 


6240 


GTAACCTTTG 


AATACGAAGC 


AGGATCAGAC 


AATACCTATG 


CTTTTGTAGT 


CGGTAAGGGA 


6300 


GAATTCCAGT 


CAGGTCGTCG 


TGGTACTCAA 


GCAAGCAACT 


TGGAAATGCA 


TGAATTGCCA 


6360 


AATACTTGGA 


CAGATTCTAA 


GAAAGCCAAG 


AAGGCAACCT 


TCCTTGTGAC 


AGGTGCAGAA 


6420 


ACAGGCGATA 


CTTGGGTAGG 


TATCTACTCA ACTGGAAATG 


CAAGTAATAC 


TCGTGGTGAT 


6480 


TCTGGTGGAA 


ATGCCAACTT 


CCGTGGTTAT 


AACGACTTCA 


TGATGGATAA 


TCTTCAAATC 


6540 


GAAGAAATTA 


CCCTAACAGG 


TAAGATGTTG 


ACAGAAAATG 


CTCTGAAGAA 


CTACTTGCCA 


6600 


ACGGTTGCCA 


TGACTAACTA 


CACCAAAGAG 


TCTATGGATG 


CTTTGAAAGA 


GGCGGTCTTT 


6660 


AACCTCAGTC 


AGGCCGATGA 


TGATATCAGT 


GTGGAAGAAG 


CGCGTGCAGA 


GATTGCCAAG 


6720 


ATTGAAGCTT 


TGAAGAATGC 


TTTGGTTCAG 


AAGAAGACGG 


CTTTGGTAGC 


AGATGACTTT 


6780 


GCAAGTCTTA 


CAGCTCCTGC 


TCAGGCTCAA 


GAAGGTCTTG 


CAAATGCCTT 


TGATGGCAAT 


6840 


GTGTCTAGTC 


TATGGCATAC 


ATCTTGGAAT 


GGTGGAGATG 


TAGGCAAGCC 


TGCAACTATG 


6900 


GTCTTGAAAG 


AACCAACTGA 


AATCACAGGA 


CTTCGCTATG 


TTCCGCGTGG 


ATCAGGTTCA 


6960 


AATGGTAACT 


TGCGAGATGT 


GAAACTTGTT 


GTGACAGATG 


AGTCTGGCAA 


GGAGCATACC 


7020 
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TTTACTGCAA CTGATTGGCC AAATAACAAC AAACCAAAAG ATATTGACTT TGGTAAGACA 7080 

ATCAAGGCTA AGAAAATTGT CCTTACTGGT ACCAAGACAT ACGGAGATGG TGGAGATAAA 7140 

TACCAATCTG CAGCGGAACT TATCTTTACT CGTCCACAGG TAGCAGAAAC ACCTCTTGAC 7200 

TTGTCAGGCT ATGAAGCAGC TTTGGTTAAG GCTCAGAAAT TAACAGACAA AGACAATCAA 7260 

GAGGAAGTAG CTAGCGTTCA GGCAAGCATG AAATATGCGA CGGATAACCA TCTCTTGACG 7320 

GAAAGAATGG TGGAATACTT TGCAGATTAT CTCAACCAAT TAAAAGATTC TGCTACGAAA 7380 

CCAGATGCTC CAACTGTAGA GAAACCTGAG TTTAAACTTA GATCTTTAGC TTCCGAGCAA 7440 

GGTAAGACGC CAGATTATAA GCAAGAAATA GCTAGACCAG AAACACCTGA ACAAATCTTG 7500 

CCAGCAACAG GTGAGAGTCA ATCTGACACA GCCCTCATCC TAGCAAGTGT TAGTCTAGCC 7560 

i CTATCTGCTC TCTTTGTAGT AAAAACGAAG AAAGACTAGT ATTTAGTAAA ACCTCTTAAC 7620 

AAGATTACGG AAGCAGTCTC TATCTTTTCC AATGAGGTTT ATAGTACAGA AAAAGCCTGA 7 680 

GAAGATGTCT TCTCAGGCTT TTGTTAAGCA CATAAATACA ATAGTGCTAT GACAAAATCA 7740 

CCCAGAAAAA TCTGGGTGAT AAATGTTATG GTTGTGCTGG TTGAGGATTC TGATTTTGTT 7800 

GATCAGGGGT TGTATTTGAT TGTTGCGTAT TATTGTTAGG ATTGGTAGTC GTACTATTAT 7860 

TTGTGCTTGG AGTGGTTGAG CTAGACTGTG AAGTTGAACT ATCTGATGAT GAGCTTGAAC 7 920 

TTTCAGTTGA TGGGGGTTGT TGTGGAGCAG GTGAGTTCCA CGTAGAACGA GCACCATTTT 7980 

TAAATACGAA TTCTCCATTT CTGTAGAGCC CCTCTGGTAT ATTCCAATCT TCTGGATTGC 8040 

TTCCTTCAGA CAGGTAGGTC ATCATAGAGC GGTAAACTTT GGCAGCGACC GTAAGGCCAT 8100 

TGCCTACAAG TGGTGTCAGA CGGTTAGAAT AGCCTGTCCA TACAGCCATT GAATATTTAC 8160 

GCGTATAGCC AGCAAATAGT TCATCAGGTG CTACAAATTG AGAGGTCTTG ATGTGGTTTT 8220 

CAATTTCCTC GTCTGTATAG TTAGAGGTTC CTGTTTTACC AGCCTGAGGG AGCCAAGCAA 8280 

GATAGGCATT TCGTCCAGTT CCATAAGTCA AGACTGTTTT CATCATGTCG GTCATCATAT 8340 

AGGCTGTCGT TTCCTTCATG GCACGAGTTC CGACATTAGA GAACTCTTTT TCACTCCCAT 8400 

CACTAAAGAC GACTTTATGG ATATACATTG GTTTATAGTA AGTTCCACCA TTTGCAAAGG 8460 

CAGCGTAAGC AGCAGCCATC TTTTCACTAC TTGCTCCATA TTTTTTGTCT GATTCGGTTG 8520 

TGTTACTTGA AATGGCATTT GAGTAGTGAA TACTTGGGTA GTCGATTCCT AGACCATTTA 8580 

GGAAAGTCTT GGCGCGGTTG AGTCCGACCT TGTTTAGAGT TTCCACGGCT GGGACGTTTC 8640 

GCGATTGTTG CAGGGCGTAT TGCAAGGTGA TGTTGCCAAA GTAGCCCCTA TCCCAGTTAT 8700 

AAACAGGAGT ATTTGTCCCA GGGTAGTTAT AGGGCTCATC GTGAACGATA GTAGCAGTTG 8760 

AATCGTAGAC ACCGTACTCC AAGGCAGGAG CATAGTCTGT GATCGGTTTC ATAGTTGATC 8820 
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CCCAGTCGCG GTTTGTTTCT ACTGCTTGGT TAATTCCGAA GGAAACATTA CTTGACTGAT 8880 

GGCGTGCTCC TAGCTGGGCA ATGACTTTAC CGTTAGAAAC ATCAACAATG GTAGAAGCGA 8940 

CTTGCAATTC ATCGTCTGGA TAGGCAACGT ATTCGTCTGT ATTGTAAATA TCCCACAGAT 9000 

GTTTTTGAGC TTCTTGGTCT ACATTTGTGT AGACATCCAT CCCAGTTGTG AGTAGGTTAT 9060 

AGCCTGTTTC TTCTTCAACT TGATTGATGA CTTCCTTGAG GTAATTATCC ATGTAAGCAG 9120 

GGTAATTACT TGCTGATTTG AGACTTTGTA GTCCATCAGT AATTGGTGTA TTGACTGCTT 9180 

TCTCATACTG TTCAGCAGAG ATGTAGCCTT GATTTTTCAT TTCAGATAAG ACCAAGTTTC 9240 

GGCGGTCTTG GGCTGCTTCT GGATGTGAAT AGGGGTCATA TTGGTTTGGT GCCTGAGGCA 9300 

TTCCAGCCAG CAAGGCTAAC TGAGGTAAAC TTAAATTATT GAGGTCTTTA CCATAGTAGT 9360 

TTTGAGCTGC TGTCTGCATT CCATAGTTCC CATTAGACAT GTAGACCTTA TTTATATAGT 9420 

AGGTCAAGAT TTCTTGCTTG GTTGCTTTTT GTTCTAACTG AATCGCTAAC CAAGCTTCCT 9480 

GAGCCTTACG AGAAATAGTC TGGTCGGAAG TCGAAGTTGA AAAGTAAGTC AACTTAATCA 9540 

ACTGTTGGGT GAGAGTTGAT CCACCTTGGA GGGAATTGCT TTGCAGATTG CGCAAGAAAG 9600 

CTCCCAGGAT ACGGATGGTA TCAATCCCCC TGTGGTCGAA GAAGCGATGG TCTTCGATAG 9660 

AAACGATTGC CTTAACCAAA TCTGTGGGAA TATCATTAGC TTGGGCATTG ACGCGGCGTT 9720 

CAGAACCCAA GTCAGCAATG AGTTGATTTT TATTGTCGTA GATTTTACTA GAAGTTGTTG 9780 

CAACTAGTTT ACTCTCGGAT AGGCTAGGAG CCTTGCTAAC GTAGTAGAAA AAAACTCCTC 9840 

CGCCTAAGAC AATGGCTGCG ATAACCAAGC TTAAGAAGCT AATGCTCAGA TACTTGATTA 9900 

GGCG CAGAAT CGTTGGTTTG TTCATCTTGT TTTACCACCT AATAAATGTT CTTTGATAAC 9960 

ATTGAGATAA GGAATTTGAG GGAAGGCACC AGCCTTGATT TCATATCCAT ATTCTCGAAT 10020 

ATATTCAAGT GGCATTGATT TTTGTCCCTT ATCTTGATGA TAGAAGCGAA TCAAATCGAA 10080 

TGCCGGCAAT AAGTAGGTTT CTTGCTGAGA AGAAAAGTGA AGAAGGACAA AGCAGATTCC 10140 

TTGTTGGGCA AGGACTTGTT CCATATGCTG AATCTGATGT GGATGAAAAT TTTTCATCGG 10200 

AATCGCACGT TTTTGTTTTG TTTCCTTGAC TTCAAAGTCG ATGTAATATC CATTATAAAC 10260 

GC CAGAAT AG TCCGTCGTTG AAGCTTGTCG AAAATAGGCT TCAACAATCT TGGCACGACT 10320 

TCGTTGTGGA TAGTCCACTT GTACGATTTG AATAGGAGTT GGTTTCTTAT GTATAACAGC 10380 

CAAGCCCTGA GACAAATAGT AGTCGTTGGT AGCATTGATC ATCTTTTCAA AGGGTACCGA 10440 

GCTCGAATTC GTAATCATGT CATAGCTGTT TCCTGTGTGA AATTGTTATC CGCTCACAAT 10500 

TCCACACAAC ATACGAGCCG GAAGCATAAA GTGTAAAGCC TGGGGTGCCT AATGAGTGAG 10560 



iff 
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CTAACTCACA TTAATTGCGT TGCGCTCACT GCCCGCTTTC CAGTCGGGAA ACCTGTCGTG 10620 
CCAGCTGCAT TAATGAATCG GCCAACGCGC GGGGAGAGGC GGTTTGCGTA TTGGGCGCTC 10680 
TTCCGCTTCC TCGCTCACTG ACTCGCTGCG C 10711 
(2) INFORMATION FOR SEQ ID NO: 146: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11887 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



<xi> SEQUENCE DESCRIPTION: SEQ ID NO: 146: 

TACATTCATT CCATCGGCTA CTCCATAATA CTTAGATAAA ACCATAGCTG AAGTCGAATA 60 

CGGATACTGT AAAGTATTAT CAATTTTAAT CAAATCATCA TTACCGATAA TACTTCTGAT 120 

TGCTTTTGGT AGTATGAACC ATACGTTGGT GAAATCTCAG ATAATGAAGA ATCATTAGAC 180 

TCTGGACCTT TTTCTAGTGT CTCACTTACC TCATATTCTT CACCCTTACT AGAAATAACA 240 

CTCAAAGCAG ATACTGTCGA TAACTGGCTA GCCAATAAAG TACTCGCAAT AATTGAAATA 300 

CCCAATTTTT TATAAACAGT TTTCTTCATT ATTGTATCCT CCTAATGTAA TTATAGCGTA 360 

CTATTCTAAA TTTCTTAATC TACTATAGAA TCAAGAAATC TACCACCTTC TTTAAATACC 420 

CTCCATTATC ACATAAACAG GTAAACTTTT CAATTAATGA CTGCGCTTTT CAATCACGCT 480 

AGAGGTACTT GCTTGCTTCT TTGATACTAA GTTCAGCCAT TCTTTCCTTG TTTTTCTCAA 540 

TAAAGCATGT TACCCAAGTG GGATTCGTTT TGGAGTAGTC TCGCAGAGTC CAGCCAATGG 600 

CTTTATTGAT AAAAAATTCT GTTTGGTTCA AGTTATGAAG GAGAATCTTT TCCATTAATT 660 

GAGTATTGGT CTTCTCTTTT CTTAACAACT GGTGGTCAAT AGCGACACGT CTCAGCCAGA 720 

TATTATCTGA TAGGCTCCAT TTTATACTCA ATGAAAATCA AAGAGCAAAC TAGGAAGCTA 780 

GCCGCAGTTG CTCAAAACAC TGTTTTGAGG TTGCAGATAG AGCTGACGTG GTTTGAAGAG 840 

ATTTTCGAAG AGTATTAAGA TTATTTCTTC TAGTTCAGGG TGTTCATACA CCAAACTCCC 900 

TACTACTCGA TCTAGGATAT CTACCGTGTC CCACAAGGAT TTTGTCACGA CTAACTGCTC 960 

TAGCTTAGGC AAATCGGTTT CCTTTAGATA AGACTGCATT GCTTTCAAAT AGTTAGCAGC 1020 

CACATATTGG TATTTTCTAG GATCCTTTTC CCAGCAAGTG TCTGCAAAAT CCCAATCGAT 1080 

AATCTTTGTT TTTTTCGCTT CTGGAAAATA TTTTATAGAG TTTATTTCTT TCAGGCACCG 1140 

CAATACCTAG AAAAGAAAAT TGATGGCGCA TATAGGCTTC CATGGACCTT GCTTTTTTAG 1200 

AGTCTTTTGC TGCTTCTAGC TCCTCAAGTA AATCTGCTAA ACTCATCTAA AACTCCTCTT 1260 
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GCCCCACCAA 


ATGGTGCTGA 


AAGGCATAGA 


CAGCCGCCTG 


GGTACGATCG 


CTGACTTCAA 


1320 


GTTTGGCAAG 


AATATTGGAC 


ACGTGGGTCT 


TGACCGTCTT 


GAGAGAGATA 


AAGAGGTCAT 


1380 


CTGCGATGCG 


CTGATTTTCG 


TAGCCCTTGG 


CGATGAGTTG 


GAGAACATCT 


CGCTCACGCG 


1440 


CAGTCAATTC 


TTCATGAAGT 


TCCATATGAT 


TGCGGTGGTA 


TTCAACCTTC 


TTGCTAACCT 


1500 


CTTGCTCAAT 


GGCCAGCTCG 


CCAGCAGCTA 


CCTTACTGAC 


GGCATGAAGC 


AATTCATCTG 


1560 


CACTAGAAGT 


CTTGAGCATA 


TAGCCTTTGG 


CACCAGCATC 


TAAGACTGGC 


ATGATTTTTT 


1620 


CATTGTCCAA 


ATAAGAGGTC 


ACAATCAAAA 


TCTTGGCTTC 


AGGCCATTCT 


TTAAGGATTG 


1680 


CTAAGGTCGC 


GTCAATCCCA 


TTCATCTCAG 


GCATGACAAT 


ATCCATGACA 


ATGACATCTG 


1740 


GACGCAGTTC 


CAAGGCCAAG 


TCAATCCCTT 


GAGACCCGTT 


GGACGCCTCA 


CCCACAACTT 


1800 


CTACATCGTC 


TTGGAGGTCA 


AAGTAGCTTT 


TCAAGCCCAA 


TCGGACCATT 


TCATGGTCAT 


1860 


CTACTAGTAA 


AATTTTCATC 


TTTACTCCTT 


TATCATTCCT 


TATCTAACAG 


GGGAATACGG 


1920 


ATATCAACCG 


CCAGCCCTTG 


CTTGGGAGCT 


GTCAAGAGTT 


GAACTGTTCC 


AGCCATATCT 


1980 


TCAACCCGCT 


CCTTGATATT 


TCGCAGTCCA 


TAACTCAAGT 


CGTCTAAGCT 


CCCTAACTGG 


2040 


AAACCAATCC 


CATTGTCCAC 


CACCTTCAGT 


TGCAATTCAA 


CATCTGTCTG 


ATAGAGGTAG 


2100 


ACATCTAGGC 


AAGATGCCTG 


GGCATGGCGG 


AGGGTATTGC 


TAATCAACTC 


TTGCAGGATA 


2160 


CGGAAGATAT 


GCTCCTCGAT 


TTTCTTAGGC 


AATTTCGTCA 


TATTCTGCTT 


GAGACTAACC 


2220 


CTAAGATCAC 


TCTTGTCCTC 


AAGCTCTTTT 


AAAAGAATTT 


GAATCCCTTC 


TATCAAGCTC 


2280 


TTCTGCTCCA 


GTTCAACTGG 


TCGCAAATGC 


AAGAGCAAAA 


CCCGCAAATC 


CTTCTGGGCT 


2340 


GTTTCTAAAA 


TAGCTGTGAC 


ACTCTGCAAC 


TGGGTCTGCA 


TCTTTTCTCT 


ATCCAATTTC 


2400 


AAAGCCTGCT 


GACTGATACC 


CGATAAAATC 


ATGTGGGCCG 


CAAACAACTC 


CTGACTGACT 


2460 


GTATCGTGCA 


AATCCCGAGC 


AATTCGCTTC 


CGTTCCTTCT 


CGATGATTTC 


CTCTTCCTGA 


2520 


GCAAGGCTCT 


GATTTTCAGC 


TTTTTGAAGA 


GCCTCTGTCA 


AAAGGTTAAG 


TTTACCTGAT 


2580 


AAGGACTTGA 


AACTGGCATC 


CAAATCTGGA 


TCTGCAACCT 


GAACCACTTC 


TTGCCCTGCT 


2640 


AATAAACGCT 


TGAGATTAGC 


CTGCATTTTT 


CTTAGAGAAA 


GCTCTTCGAT 


CCCTCGCCAA 


2700 


AACAGGGCTA 


AGAGACAGGT 


CATGGACATG 


CTGAAAACCA 


ACAATAAAAA 


GACAAATTTT 


2760 


TCTGTTTTTT 


CGACATCGTG 


CAAAAAGATA 


GACCAGTCAA 


AATCAAGTAT 


TTCCAGCAAG 


2820 


CTGTGGGAGA 


AAAAAAAGAC 


AAATAGGAAG 


GAGGTGAGAG 


CAATAATGAC 


ATAGGCTTGT 


2880 


TTTTTCATCC 


TCTAACCACC 


TCCACATCAC 


CAATCATAGT 


GGTCAAGAAA ATCTTGACAC 


2940 


TCTTGTTACT 


CTTGAGATAG 


TCTTTTGTTT 


CTTGATGATA 


GTGTTCATTG 


CGGAGGGCTC 


3000 
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GCTTGGGCTG 


GTTGAAAAAA 


ATCAAATCCC 


CATAGAGACA 


GTTAACGCTG 


AGACTGACTT 


3060 


CCACATCTAC 


AGGTACGATG 


ATTTTGGTCG 


TTCCTACCAT 


CTTTCTGAGG 


ATAATGACAT 


3120 


TGTCATGATT 


GGTTAAGATG 


ACCCTCTCCA 


GATGAATAGT 


GTCCTTGCCC 


ATGAAGCGAA 


3180 


AGAGATTGAT 


ATCATCGAAT 


TGGCAAGTCT 


GGTAGCTTGA 


AAAATGATGA 


AGATTTCCAA 


3240 


ACCAACGATT 


TTTCTCCTTC 


TTAACCGTCA 


CGACCTCTTC 


AAAAACCAAA 


TTGGTCTGCT 


3300 


CTTTTTCCTG 


GTTCATCATC 


GGGTAAAGAA 


GAAAGAGGCT 


ATAGATAACC 


GCAACAAAAA 


3360 


TAGCTAGAAT 


CACAAAAGGA 


TTGAGCATAA 


CGATGAAAAA 


GAAGAGAATG 


GTTGCCGCTA 


3420 


CTAAAAGAAG 


ATTATTTCCC 


TCTTTACCAG 


TGTAGTAGCG 


AATCAAAAGC 


AAAAAGAGGA 


3480 


ATAGTATCAG 


CAGAAAACGC 


GAAAAATGCT 


CTGATACCAT 


CAAAATCAGA 


GCTCCTGTCA 


3540 


GAAGACAGGC 


TTCGATAAAT 


AAAAAGATTT 


TAAATTTTCT 


CATAGGTTCA 


TCCTCTCCCT 


3600 


TCTATTTTAT 


CACAATTCAA 


AAAAGTCACC 


TCAGTCTGAG 


GATGGAAAAA 


AGGCGCTGGT 


3660 


TACGCCTTTT 


TCATCTGATC 


CTTTGCTTCT 


TTTAATTTTC 


CATAAAGAAG 


ATAGTCTACT 


3720 


TTTTGTAGAT 


CTGCTATGGT 


GGCACAGTTA 


AGGGAACACA 


TAATCAAGCG 


TAGATCTGCT 


3780 


TTCCAGCCTT 


GGACAATGCC 


AATCACTTCT 


TCAACTGTGT 


AGGTTTCAAC 


CAATTCCAGA 


3840 


ACGGTTCGTG 


ACAATCCCAC 


AGCCTTAGCA 


CCAAAAACCA 


AGCACTTAAT 


CATATCCAGC 


3900 


GGATTCCGAA 


CCCCTCCACT 


AACCAAGAGT 


TCGACCTTAT 


CTTTCCATTC 


TTGGGCATTG 


3960 


AGAAGGGCCT 


GCATGGTAGA 


CTGACCCCAT 


TGATTGAGGT 


AATCACGCTG 


GCCACTACGA 


4020 


CGGTTTTCGA 


TATAGGCAAA 


GCTGGTGCCA 


CCACGACCCG 


ATAGGTCCAC 


TGTACGAACA 


4080 


CCGAATTCAT 


AGGCTCTTTC 


GATTGTCTTG 


GCATCCATTC 


CAAAGCCCAC 


TTCCTTGAGG 


4140 


ACAATAGGAA 


CGGGAATTTG 


CTTGCTATAA 


TCTGCTAGAT 


GCGATTGCCA 


GCTTCTAAAC 


4200 


TTCCTTTCTC 


CCTCGGGCAT 


GAGTAATTCC 


TGCATGACAT 


TGACATGCAC 


TTGCAATAGA 


4260 


ACAGGATTCA 


TCTCTTCTAC 


AGTCTGAAGT 


CCTAACTCGA 


CAGGCTTGTC 


CAATCCAATA 


4320 


TTGGTTCCAA 


GGAGGAGATT 


GGGATGACTA 


GACTTGACAG 


AAAAAGAATC 


ATCCGTTGGA 


4380 


TTTTTGAGGG 


CTGCGCTATA 


AGAACCCGTT 


ACAAATAAAA 


TACCACAGGA 


TTCCGCCACC 


4440 


TGAGCCAGCT 


TTTGATTGAT 


TTCTCTTCCC 


TTATTACTTC 


CACCAGTCAT 


GGCATTGATA 


4500 


TAAAAAGGAA 


AGTCCCACTT 


TCGACCAGCA AACTCTGTCG 


AAAGATCGAT 


TTCATCCAGA 


4560 


TTGTAAAGAG 


GCAAGGAAGA 


ATGAATCAGC 


TCCACCTCAT 


CAAAGCTATT 


ATAGGAACTT 


4620 


TTCTGCTCAA 


GGGCATAGAG 


GATATGCTCG 


TCCTTACGAT 


TTGTCGTCAT 


GTCCTATCCT 


4680 


TTCTTGATAT 


AAGAGCTCAA 


TCCCCAGATC 


GGCCCAACGA 


TTTTTTAAGG 


TTTTGGTTGA 


4740 


TTGCGCATCA 


AAACTCAGGG 


CGATGCCACA 


GTCACCACCA 


CCAGCACCAC 


TACTCTTGGC 


4800 
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AACGGTCTGC 


AAATCTTGAC 


TGGCTTCTTT 


CAACTGTCTA 


AGCAAAGGCG 


TGTAAATATC 


4860 


TGTACTCAAG 


CCTTCTAAAA 


GCTTGCTGGC 


TACTTCTACT 


TGATCGATAA 


TCTTTTCTGA 


4920 


TTTCCCCTGT 


TCCAAGGCTT 


CTACCAGAGA 


AGTCACCGTT 


TCTTTTGAGG 


AAGTTAAAAA 


4980 


ATTTTGATTG 


ATATTTTGCT 


TGATTTGCTG 


GACCATGTGA 


CTCGATACAG 


CCACTTCCTT 


5040 


GGTCCATCCC 


ACTAAGAAAT 


CACATTCTAA 


AGTTGGTTTC 


ACTTGTGAAA 


TTGAAAAGCC 


5100 


CCAATCACGC 


TCCAGAACTG 


TCGCCAAGTT 


TTCTTCTTCT 


AACCAAGCAG 


CCACCTTCTG 


5160 


GCGATCAAAT 


GACTGGTAGA 


GAACCAAATC 


CTCTGCCACA 


ATACAGGCAA 


GGTCGCCCAT 


5220 


GGAACCATTG 


TCTCCTCGCT 


TAAGCAAGAC 


AGCGCTAGTC 


AGCTTGAACA 


AGAGCTCCTG 


5280 


ATCAACAGAA 


ACATCATACA 


GAGCCAGTAA 


AGCCTTGACA 


ACCAAGACAA 


CGACGCTGCC 


5340 


ACTAGAACCT 


AGACCAAACT 


TTTTCCCTTC 


TCGTTCCATT 


TTGCCACAGA 


TTTCTAGAGA 


5400 


AAAAGGTCTT 


AAATTCTGAC 


CACGAACAGC 


GAGGAAGTCT 


CCCATCAAAG 


CAATCGTTTC 


5460 


TTGAATCAAG 


CTATAGTCAG 


GATTAGGCCT 


TAAGTCCACT 


GCGAAATCAA 


ACATATCTGA 


5520 


ATAGATACGG 


TAGCTGTCAG 


AAAAAGCAAT 


CTCAGCCCTC 


ATATAGATGG 


GAATATCCTT 


5580 


TATCAAAGCT 


AACTGCCCTG 


GCTCTAAAAT 


AGCATATTCA 


CCTGCCCAAT 


AGAGTTTTCC 


S640 


GCAAGTTTTA 


ACAGCAATCA 


TCTTGACTCA 


AATCCTTTGT 


TTTTGACACA 


ATCAAGCGAT 


5700 


AACGATGACC 


GAAAATTTCT 


GATAAATGCT 


CCAAGTCTTT 


CTCCTGACAG 


AAGACCTTAA 


5760 


CATTGGGACC 


AGCATCCATG 


GTAAAGTAGC 


AGGCCTCTCC 


TTTCTCACGA 


AGCTGGCGAA 


5820 


CAAAGGCCAT 


AGCCTCATAA 


GAGGCATCCG 


TCAGATAAGA 


AAAGGCTGGA 


CTAGCAGTCT 


5880 


TTGTCGTAGC 


ATGCATAGCC 


AGGGCATTTT 


TCTCCGTTAA 


TTCTCCAATC 


TTGGCAAAAT 


5940 


CATTTTCCTT 


GAGATAAATC 


AGCATATCCT 


GATAGTCCTT 


CTCAGACTGA 


CGAACCCAGT 


6000 


CGTCGAAAGT 


CGTCGAGGTT 


TCCACACAAA 


GTTTCATCCC 


GTCACGGCTA 


GAGATTGGTT 


6060 


TTTTCTTGTC 


CTCTAGCACC 


AACATAATCA 


TAGCTAGTTT 


CAAGTCTGTC 


TCTACAGGGT 


6120 


AAATTTCTCC 


ACTATCCTTA 


TCCCAGGCTC 


CTAGTGGTCC 


ATAAAAACTC 


CGAGAAGAAG 


6180 


AACCTGAGGC 


AAATTTGGCT 


TCCTGTGCCA 


ACTGACTTCT 


ATCCAATCCA 


AGCTTGAAAT 


6240 


AAGCATTACA 


AGCCTTGACC 


AGGGCGGACA 


AACCACTAGA 


ACTTGAGGAC 


AGACCCGCTG 


6300 


CCGTAGGCAT 


ATTGTTTTGA 


GTATCGATAC 


GGACAAAGCC 


CTCACCAGCT 


GGACGATAAC 


63 60 


GGTCAATAAT 


CTTACTCATC 


TTGGCATGCT 


CGACCTCATT 


TTGTAGCTGA 


CCATTGATGT 


6420 


AAAATTCGTC 


AGCTGTTACA 


TTGGCTGGTA 


AAGGCGACAA 


GGTCGTCTCT 


GTATACATAT 


6480 


TTTCCAAAGT 


TAGAGAAATA 


CTGCTAGTAG 


CAGGCACCAT 


CTCTTTTTCT 


TTTTTCTTTC 


6540 
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CCCAATATTT GATAATAGCA ATATTTGCGT AGGAACGTAC TGTTACAGGC TCTCTATCCA 6600 

TGTCTGAACA GCTCCTTTCT CTTCTAATCT TTCTGCTAGT TCTTGTGCGT GTGTCAAATT 6660 

GGTTACCAAG GCTATGATAC AACCTCCTAG CCCACCACCG CTCATCTTGG CACCCAGAGC 6720 

ACCATGGCTA AGAGTCGTTT CAACCAAAAA GTCTGCCTCA GGGCTACTGA CTCCAATTTC 6780 

TTTTAAATGT AAATGCGCTT GACTGAGGAT TTGTCCCAGT CCTTCAGCAT CTTTTTGTGA 6840 

AATCGCAACT TCTGCTTGCT GGGTTAATTC TCCCAAGGCA TGCAAAAACG GTAGGGCATC 6900 

CTTGCCCTTA TTTTGAACCA CTTGGATGGC TTCACGAGTA TGACCATAAA CACCCGTATC 6960 

GGCAATCACC AAATAGGCGG ATAAATCCAT CTCAAGTTCT GTAAATCCTA CGTTCTTGAT 7020 

AAAGCGAATA GGTTGGTCAC TAAGACAGGT CTTAGCATCC AAACCACTAG GATTCATATG 7080 

GGCAATCATT TCAGCTCGAT TGACCAAGAT TTCTAGTACA TCATGAGGCA GATCAGCCTG 7140 

ATAGTAGTCA AATACTGCAC GAATGGCCGC TATGCTGATA GCCGCTGACG AACCCATCCC 7200 

CCGTTTCTCA GGGATAGCCG AGTCAATCTC ACAACGAATG CAGGCTTCTG TGATATTCAA 7260 

ATACTCCAGT GAGGCATAAA CCGCCATGGA CAAGGTATCC TCCTCATAAA GGCGCCAAGG 7320 

ACTCTCTGCA GGAACTACCT TACAGGTCAC CTCCACCTCC AAAAGAGGCA GGGAAATGGC 7380 

AGGATAACCG TAAACGACCG CATGTTCCCC TATTAAAATT ATCTTACTAT GTGCCTGACC 7440 

GACACCAACT TTTTTTGTCA TTTTTTCCTT TTACTAGACG AAAAAACGTC TTATTTTTCA 7500 

TACAAGTATT AATTCTTTCC TATCTATTTT ATTATATTTT CACAAAAAAA GCGATTGTTT 7560 

CCATTCACAA TCGCTTCTTT CATTATTGAA CCCATTCGCC ATTATAGTTG ACAGAATAGC 7620 

CATCTACGGT CGTATTCACT GCCAAGGCAC CTGAGCGCTA TAAGCGTAGT ACCATCTGCC 7680 

ATTGACCTGG AACCAACCTG TCGTCATAGA ACGACGAAAG AAACTCCATA CCATTAAGTA 7740 

AAGAGGAAAG TCGTGAGGGA GCATGCGCCA TTGACAACCT GTTTTAGTGA CGTACAAAGT 7800 

CTCATTAACA AGTACTCGTT TCGGCCATTT ATAGGTGCGG TGTTTGGAGA AATAGGGTTC 7860 

AATCTTCGCC CATTCTTGAT CGTTTAAATC AGTATCATAT GCTTTGCGTA TCATAACTCT 7920 

AGCTTAACAT TTTTTTGTGA ATACAGGTTC TAAATAATCG ACCACGAAAA TTTCTTAAGT 7980 

GGAAAACGCC TTATGAAGTA TGCTACGGGA AAGTTATGCA CTTAATTTGA CAATTCAAGA 8040 

TGTAAAAATA TATACTATAG TAGATTGAAA CTAGAATAGT ACACCTCTAC TTCTAAAATA 8100 

TTGTTAGAAA TCGATTTGAC TGTCCTGATC GATTTATCCT GTTATTATCT CATTTTACTA 8160 

TAATATTTGA TAAGTTATCC TAAAAGTATT ATTATGTTGT TGTGTTATAG ATTGATTGAA 8220 

TCTAACTAAA GGATCCTATT CAATTACTAG AACTATCACA TACTCAAGGT CAGCTCACAG 8280 

ATGAGCAACT ATTTTGGTTA CAATGTCTAC TAAATTTAAG TCAAACAAAT AATTTAGTCA 8340 
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AAATTAAAAA AATAGAGGAA CATAAATATG ATTACAAAAC 


AGAATGTAAT 


AGTGTTCTAC 


8400 


AATTTTTACT 


AGATAAAACT 


GTAAATTCTG 


AAGGAAGGAT 


CACTTCTTCA 


ACAGAATTTG 


8460 


GAAATTTCGT 


AAGTAATTTA 


TCATTCCAAC 


ACGGAATAGC 


TGGACTACTG 


TTTCCTCTAA 


8520 


ATAAATTGTA 


CCCCCCAGAA 


CTGGATTCTA 


AAATACTCTC 


TATCATCAAG 


AAGGCAGTGA 


8580 


CAATTAGAAC 


GACACACACA 


TATGAATATC 


AATACTCACT 


GCTATTTGGT 


GATGCAGGCT 


8640 


ATCTATGGTT 


ACTCCTACAT 


TTATTTTCTA 


TCAGTAAAAA 


TCAATACTAT 


CTACAATTAG 


8700 


CAAACGTCAC 


CGCTAAAAAA TTAATAGAGA ATTATGATAC 


TCTAGAGGAA 


ATAGACTTTG 


8760 


CATTGGGAAA 


ATCTGGTGTC 


L.IAI lAILAl 


TAATAAAATA 


CTATCAATTT 


ACCAATGACA 


8820 


ATACTCTTAA 


AATTTTCATC 


CACAATAGTA 


TAGGGGAAAT 


TTATCATTAT 


TTCCTACAAA 


8880 


GAGATACAGC 


CAAAGAAAGC 


ATTTTAGACT 


ATAGCTTTGC 


TCATGGATAT 


TGTGGAATTG 


8940 


CATATGCTTT 


ATTTGCCTAT 




TAGAACCTTC 


TATGTTTTAT 


AATGATCTCC 


9000 


ATACATTCCA 


TACTGAATTA 


/UwinAl 1A1 


1 AuAAAAAb 1 


TACTTCTAAT 


ACTGAAAATT 


9060 


TAGGAAATTT ACAACTTTCT 




unnl 1 1 


AATAATCTTA 


TATCTTTGTA 


9120 


TGTACGATTG 


TGACGGAAAC 




1 1 rVu 1 Ann 1 M 


TCAAGAATTT 


GTTTTTAACC 


9180 


ATCATCTAAA 


AATGATGACA 


GGATATTGCC 


ACGGAATAAC 


TAGCTTACTA 


CAAACCACTG 


9240 


TCTACAATCA 


AAACAAATTA 


CTGATGAAAA 


AAATCCAACA 


GGTAATTTTA 


GCATGTTCTG 


9300 


AACGAGATGA 


TCACGGTTTA 


CTGATGTTTC 


AAGGAGATAG 


TGGTAAAGCA 


GATTTGTTTG 


9360 


ACTTCGGAAT 


AGGAAGCATG 


GGGTATATTG 


GTGTCTATTA 


AATAATAAAT 


TCCCATTCGA 


9420 


TGTGCAGACA 


TAAGGAGAAA 


AGTATGAAAT 


TATTTTGGAC 


AAACAACATA 


TATAGACAGT 


9480 


TGCTGCTAAA 


CAGCTGTTTT 


TCATCATTCG 


GCGACAGTAT 


TTTCTACCTC 


GCCATTATCA 


9540 


ATTATGTGGC 


TCAGTACAAT 


TTCGCTCCGC 


TAGCGATTTT 


ACPGATTTCC 


ATTTCAGAGA 


9600 


TGGTTCCCCT 


ACTATCGCAA 


CTCTTTCTCG 


GGATTCTAGG 


AGATTTTCAA 


GAAAATAGAG 


9660 


TCAAACACGC 


ACTCTGGATT 


GCCAAAATCA 


AAATCCTGCT 


CTACGCTATT 


TTGACAGTAT 


9720 


TTCTCGTCTT 


GTCGCCCTTT 


TCATTAGTTT 


CAGTCATTAT 


GATTGTCATC 


ATCAACCTCA 


9780 


TCTCTGACAC 


CTTGAGCTAC 


CTGTCTGCCT 


ACATGATGAA 


CGCCCTCTAC 


ATCAGTGTAA 


9840 


TTAAGGACGA 


CCTGCATGAT 


GCCATGGGGT 


TCAGGCAGTC 


TCTGATGAGG 


GTTGTCCGTA 


9900 


TTGTCGCCAA 


TCTGGCTGGC 


GCATTCCTTA 


TCAATGTTAT 


AAGTATTCAA 


ACTATTTCCC 


9960 


TTATCAACAC 


TCTGACTTTT 


GTCATTGCCT 


TTTTGGGCCT 


GTATGTTATT 


CGACATACCT 


10020 


TGTATGAGGT 


TGAAAAAAGA 


ATTGAAATGT 


CACATACAGC 


ACTGAGTTTT 


AAGAAATATT 


10080 
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TTCAACATCT 


TAAACAGTCG 


CTGGCTGTGC 


TCCTGAGGTT 


AAAAGATACC 


GTCATACTAC 


10140 


TGTTTCTGAC 


GACCAGTATG 


ATTGCCATCT 


TGGATGTGTC 


CCCTCGGCTG 


ATTGCCCTCC 


10200 


GCTTCATCCA 


ACAGACACTA 


GCACAACTGA 


GCATTGGGCA 


ACTCCTCGCC 


CTGCTCTCCA 


10260 


TCATCATGTC 


TTGTGGAGCT 


ATCCTTGGCA 


ATATGACCAG 


CAGTAATCTA 


TTTAAAAATA 


10320 


TCCGTTTCAC 


GCACCTCTTG 


GTTTTCTGTG 


AGATTTCCCT 


ATTGACTCTA 


ATAACTAGTA 


10380 


TCCTTTGTCA 


AGCCTATATC 


GTAATTTTCA 


TGACCAGTTT 


CATCAGTTCT 


ACGATTATCG 


10440 


GCATTCTCAG 


CCCTCGCCTA 


CAAGCAGCTG 


TCTTTGCCCA 


TATCCCCAGT 


GACAAGATGG 


10500 


GGACGGTGGG 


CTCTGCTCTG 


AGCACAGTGG 


ACATTCTCGC 


CCCGTCCCTG 


CTCTCCCTAT 


10560 


TAGCCCTATC 


CATAGCATCG 


GGCGTTTCGG 


TGCAGTTAGC 


ATTGATATTT 


TTGTATCTTA 


10620 


TTTTAATTGC 


TCTTATCTTT 


TGTCAATGGT 


TAGTCAAGTT 


CAACACTCAT 


AACTAACGAA 


10680 


AAAGCATGTG 


TAGATTTCAC 


ATGCTTTTAA 


TCTCCCCAAT 


CGTCAGGTCA 


AGTACAACAA 


10740 


AGTCACTTCT 


TTGATTAAGC 


GAGTGTTCTA 


ATATAATTAT 


AAGCGCCCTG 


TCATTACCGA 


10800 


ACCCATTCGC 


CATTATAGTT 


GACAGAATAG 


CCATCTACGG 


TCGTATTCAC 


TGCCAAAGCA 


1086.0 


CCTGAGCTAT 


AAGCATAGTA 


CCAGTTGCCA 


TTGACCTGGA 


ACCAACCTGT 


CTTCATGTCT 


10920 


CCATTACCTG 


CATTTAGGTA 


GTACCAAGTT 


GAACCATCTT 


GATACCAACC 


AGTTGCCATA 


10980 


GCTCCTGATG 


AACGGAGATA 


GTACCATTTG 


TTCCCAAGGT 


TTTGCCAACC 


TGTTTTCATA 


11040 


TCGCCATTTG 


GGTGGTCTAA 


ATAATACCAA 


GTGGTACCTT 


CCTGATACCA 


GCCAGTGGCC 


11100 


ATTGCTCCTG 


AGGAACGGAG 


GTAGTACCAC 


TTATTACCTA 


GATATTGCCA 


ACCTGTTTGC 


11160 


ATAATACCAG 


TTGTTGGATC 


TAGGTAGTAC 


CAAGTCGAAT 


CATCGTTTAT 


CCACCCCGCA 


11220 


CGTCTTTCAC 


CACCAAGGTA 


GTTTTCTCCA 


TTAATTTCCG 


TCTTAGCTAG 


ATAATACCAG 


11280 


TTAGACTGAT 


CATAAAGCCA 


ACCTGTCTCT 


AAAGAATGAT 


TTTGATTAAA 


GTAATAGTTC 


11340 


GTATAATAAC 


GCTTCTCTTC 


TTTATCTTCT 


GAATCTTCAC 


GTTTTTCCCC 


GTACTTTCTT 


11400 


CCAACACTGT 


CTTTAGTTTT 


AATCTCTAAT 


GTTTTCCAAC 


CAACAAACTC 


TTGTAGCACT 


11460 


CCATTTTTAT 


CGAAGTAGTA 


CCACTCTGAC 


TTTGGAAAAC 


CTTCTAATCT 


GATACCATTT 


11520 


GGGTAAGGAC 


CAATTGTACT 


ACCTTTAGAT 


GGAAACGGGA 


TATATTGCCA 


GCCGACAACC 


11580 


ATCTCTCCAG 


ATAGAGAATC 


AAAATAATAG 


TACTTACCAT 


CAATCACTCG 


CCAGTAGGTT 


11640 


TCTTTGAGGT 


CCCCCTTTTT 


GTAGTAGGTT 


CTTCCGTTTT 


CTTGGACAAA 


CTGCCATCCT 


11700 


TCAGAATCAT 


CTGCAAATAC 


TGTACTGGTC 


CCTAGCAAAC 


CAAAGAAAAA 


TACTGTCAGT 


11760 


CCAACTTGCA 


TAGTTTTTTT 


CAAAATTTTC 


ATCTATATAC 


CCTCCAATAT 


TAAATCCACT 


11820 


CACCAGATGA 


GGCGAAATTA 


TAAACTTTAC 


CATCGATAGT 


TTGGCTACCT 


GTAACCATTG 


11880 
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CTCCAGG 



(2) INFORMATION FOR SEQ ID NO: 147: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11340 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



11887 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 147: 

CCGGTATGTT CTGGAATACT ACCAATCTAA GCTGGCTGTG CCCTACAGTT TTACAACCCT 60 

GTACGAATAC CTTAAGGAAT ATGACCGATT TTTCAGCTGG GTTTTGGAGT CTGGTATTTC 120 

AAACGCTGAT AAAATATCCG ATATTCCTTT ATCAGTTTTG GAAAATATGT CTAAGAAAGA 180 

CATGGAATCC TTTATCCTTT ATCTACGTGA ACGTCCCTTG CTGAATGCTA ATACAACAAA 240 

ACAAGGTGTT TCACAGACAA CTATCAATCG AACCTTATCA GCACTTTCTA GTCTTTACAA 300 

GTATCTAACC GAGGAGGTTG AAAACGATCA GGGGGAACCT TATTTCTATC GTAATGTAAT 360 

GAAAAAAGTT TCCACCAAGA AAAAGAAAGA AACCCTTGCT GCCAGAGCTG AAAATATCAA 420 

GCAAAAACTC TTTCTAGGTG ATGAAACAGA AGGTTTTCTA ACTTATATCG ATCAAGAGCA 480 

CCCACAACAG CTTTCAAATC GAGCTCTCTC ATCATTCAAC AAAAATAAAG AACGAGATTT 540 

AGCCATTATT GCCCTTCTCT TGGCATCTGG TGTTCGCTTA TCTGAAGCTG TTAATCTAGA 600 

TCTAAGAGAT CTCAATCTAA AAATGATGGT TATTGATGTT ACTCGAAAAG GTTGCAAACG 660 

TGACTCAGTC AATGTCGCTG CTTTTGCTAA ACCTTATTTA GAGAATTATC TGGCCATTCG 720 

GAATCAACGC TATAAAACGG AAAAAACAGA TACAGCCCTT TTTTTAACTC TCTACAGAGG 780 

TGTTCCTAAT CGTATCGATG CTTCTAGCGT TGAGAAAATG GTTGCTAAAT ACTCAGAGGA 840 

TTTTAAAGTG CGTGTAACAC CCCATAAACT GCGCCATACA CTAGCAACTA GGCTCTATGA 900 

TGCGACTAAA TCACAAGTTT TAGTCAGTCA CCAACTAGGA CATGCTAGCA CACAAGTCAC 960 

TGACCTCTAT ACCCATATTG TTAGTGATGA ACAAAAGAAT GCTCTGGATA GTTTATGATT 1020 

TTACGTATTT TAAATTATGT AAATAAATAT CAAAAAAAGA AGTTGGCCAA CTTCTTTTTG 1080 

ATTTATCCAA CTACCGCTTC AGCGATTTCT TCACGGCTAA TACCAGCGAA GTAGCGTGTG 1140 

ATATCAATGG TTTTTAGCGC CTTAAGAACA TCTTCGCGTT CGTATTTCAC CCCACGAAGG 1200 

ACATCTTCTA CTGCAGCAAC GTCTTCAATA CCAAAGAAGT CACCATAAAT CTTGATGTCT 1260 

TGGATTTTTG ATTCAGTAAC GTTAGCAAAG ACTTCAACCT TACCACTAGT GAATTTGATT 1320 
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CCACGACGGA CGTTAAATTC AGGTGATTTA CCATAGTTCC AGTCCCAAGT TCCAAACTTA 1380 

GTATCCTTGA TGCGATTGAT TTCGGCCAAT TCTTCTTCTG AAAAGACGTA TTCAGTCATC 1440 

TCTGGGTACT CTTTTTTCAT GTATTCCAAG AGTAAATCAC GGAATTTTTC GACTGTGATT 1500 

TTTTTTGGTA ATTCATTGAT AATATTGGTT ACACGGGCAC GGACGGATTT CACACCTTTT 1560 

GATTCAAATT TATCTTTTGA AACCTTAAGG GCATTTGCGA GGACTGACAA ATCAACGTCA 1620 

AAGAGCAAGC AACCGTGGTG CATGATACGG CCGTTGATAT AGGCTTGGGC ATTGCCACAG 1680 

AACTTCTTAC CATCAATCTC AAGGTCATTA CGACCTGTGA ACTCAGCTTT AACCCCAAGT 1740 

TGAGCCAGGG TATTGATAAC CGGAGTTGAG AAGCTCTTGA AGTCAAATGC CTTATTTTCA 1800 

TCTTCTTTGG AGATGATCGT GTAGTTGAGG TTATTTAAAT CGTGGTAAAC AGCTCCACCA 1860 

CCACTAATAC GGCGAACTAC CTCAATACCA TTTTCGCGAA CATAATCACG GTTGATTTCT 1920 

TCGATAGTGT TCTGGTGACG ACCAACAATG ATAGATGGCT TGTTAATCCA AAGTAGGAAG 1980 

ATTTGATCCT CATCCAAAAG GTGTTTAAAG GCGTATTCTT CCAAGGCAAT ATTAAAAGCA 2040 

GTGTCATTTG AATGATTGAT AATGTATTTC ATGATATCCC TTTACTTTAT ATGATAGAAA 2100 

CTGGAAATAA CCTTCCAGTC TAATCTATCT TCGTTTTATT TTTTCTTAGG TGAATGGATG 2160 

GCCATTCCTA GAACATCTGC AAACGCTTCG TACATCACTT CAGAGTAAGT TGGGTGCCCG 2220 

TGGATGGTCT TCAGCATTTC CTCAACAGTG ATTTCCATTT CGATGATGCT TGATGCTTCG 2280 

TTTATTAATT CTGCGGCTGC AGGACCAATA ATGTGTACAC CAAGGATTTC TCCGTATTTC 2340 

TTATCAGCGA TAACTTTTAC GAAACCTTGA GCTGCGTCAG ATGCAATAGC ACGACCGTTA 2400 

GCAGCAAAGT TAAACTTACC GATGGCAACA TCGTATTTCT CACGGGCTTG TTCTTCTGTC 2460 

AAACCTACTG CTGCTACTTC AGGGAGAGTG TAGATGGCTG CAGGAGTCAA ATTCAATTTG 2520 

GCAACTGCAT GATTTCCTTT AAGGGCATTT TCAGCGGAAA CTTCACCCAT GCGGAAAGCT 2580 

GCGTGAGCCA ACATCTTAGT ACCGTTGATG TCACCTGGTG CATAAATGCC TGGAACTGAA 264 0 

GTTTCCATGT ATTCGTTGAC CTTGATACAA CCACGATCCA ATTCAAACTC AACCTCTCvJA 2700 

ATACCTTCAA GGTCTGGCAT ACGACCAATT GAAAGAAGAG CTTTGCTTGC GATGATATCG 2760 

TCTTTTCCTT CAACCTTGAT ACGAAGTTGA CCATTTTCCT CAATGATTTC TTGCAGTTTA 2820 

GTACCAGTCA AGATGGTCAT TCCTTTACGC TCAAGAATCA AGCGAAGGTT CTTAGAAACT 2880 

TCCACATCCA TAGCTGGAAC TATACGGTCC ATCATTTCGA TAACAGTCAC TTTTGAACCA 2940 

AATGTCATGA AGGCCTGACC GAGTTCGATA CCGACAACTC CACCACCGAT GATAACAAGG 3000 

CTTTCTGGCA CTTCGTTCAT TTCAAGAATG TCATCACTAG TCATGACAAG TGGAGATTCC 3060 

ATACCAGGGA CGTTGATCTT GTTGACTTTT GAACCACCAG CAAGAATGAT TTTCTTGGTT 3120 
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TCAAGCAATT CAGAACCATT TACCAAGACG TTCTTGTCTT TAGTGATTGT ACCAATTCCT 3180 

TTATGAACAG TAACTCCGTA GCTACGAAGA AGTCCTGCAA CACCACCAAC AAGAGTATTA 3240 

ACAACTTTAG ATTTAGTTTC TAAAAGTTTT TCCATATCAA CAGTGAAGTT AGGATTTTCA 3300 

ATCACGATAC CACGATTTGC AGCATGACCG ATATTTTCAA TAATTTCAGC GTTATGAAGG 3360 

TAGGTCTTGG TTGGAATACA TCCACGGTTT AAGCAGGTTC CACCAAGTTC AGATTTCTCA 3420 

ACAAGGGCAA CCTTACCGCC GAATTGGGCA GCTTTAATGG CTGCAACATA ACCAGCAGGA 3480 

CCTCCACCAA TCACAACGAT ATCAAAAGCA TCATCGCTCT TACCATCATC GTTTGAGGTA 3540 

CTTGCTACAG GTACAGGGCT AGCTTCTGGC GATGCTGCTC CAGCTGTTGG GATGTTTTCC 3600 

CTTTCTTCAC CAAGGTAACC GATAACTTCC GTTACAGGGA CAGTTTCACC ATCTCCTTTG 3660 

AGAATGGCAA TCAAGTACCC ATCTTCTTCG GCTTCCAATT CCATGCTGAC TTTATCAGTC 3720 

ATGATTTCCA AAAGGATTTC TCCTTCTTTT ACAAATTCTC CGACTTTTTT ATTCCATTGG 3780 

ACGATTTGTC CTTCTGTCAT ATCCACGCCG GCTTTTGGCA TAATTACTTC TAAGGCCATG 3840 

TCTTCCTTCC TTTATCTATA TCTTAAAAAT GAATACTCTT GCTCTTAAAT TAACATTGAG 3900 

ATTGGCGTTT CAATCAACTC TTTCAAGTCC TTCATAAACT TAGCACCAGC CATACCATCT 3960 

ACGACACGGT GGTCAATGGT TAATCCTAAA CTCATGATTG GGCGAATCAC AATTTCACCA 4020 

TTGACGACAA CTGGCTTCTC GATTGTCGAA CTGACACCAA GGATAGCTGA GTTGGGTTGG 4080 

TTAATAATCG GACCAAAGGA CTGAACACCA AACATTCCCA AATTACTGAT TGTGAATGTT 4140 

GAATTTTGTA ACTCACTTGG AGCCAATTTA CCATCCAAGG TACGGCCAAT AACATCCTTA 4200 

AAGGCTACAA CCAGTTCTGA AAGACTCATC TTCTCAGCAT TGTAAACAAC AGGTGTCATC 4260 

AATCCATTAT CCATCCCAAC TGCCATGGCA AGATTGACAT AGTTGTGAGT GATAATAGTC 4320 

TTGCCATCTT CTGTCAATGA AGCGTTGATG TATGGGTGTT TCATAAGAGT CTTAACAACT 4380 

GCAAGCGAAA GAAGGTCTGT TACAGTAGTC TTCTTCCCAG TTGCTTCCAT GATTGGCTCA 4440 

AGAACCTTCT TACGAAGAGC CAACATTTCA GTCATATCAA CTTCATAGTT GAGGGTGAAG 4500 

GTTGGCGCAG TCAAGTAAGA TTCAACCATG CGTTGGGCAA TAACCTTACG CATTGGTGTC 4560 

ATTGGAATAC GCTCGATTTT ACCATATGGT GTTACGTTAT CAGGGACTTC TTCCACTTTT 4620 

TCAATCTGAG CAGGAGATTT GATGCTATCG TTTTCGATAT TTTCAGGAAG CAGGGCCAAA 4680 

ACATCCTTCT TCATGATTTT ACCACGATGA CCGGTTCCTT GGATTTCCTG CCAAGCAATG 4740 

TTATGTTCGA GGGCAATTCG TTTTGCAAGT GGCGAAATGC GAACCACGTT TGTGTCTTTA 4800 

TAAGTTTCCA CGTCTTCTTT GTGGACACGA CCGTTTGCAC CTGAGCCAGA AACGTCGTAG 4860 
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TCAGCCATGA CCTCTCCAAT TCTATTTATG ATACAAAGGG CGTCAAAAGC GACTGAAAAA 4980 

TAGGAAATCG ACGATGGCTT CGATGAAGCC AAGGAGATTT ATCTTTTTTC CGATCTTTTA 5040 

GCCCGTGCTC TAATCTAAGA TATTAATGAC GAAGAGCTCT GCACCTAAAA GATACAAAGT 5100 

TTCTCGTCAG CTTTATTTTA TTTACATAAC TTATCTTATG TAACCCTATT CTTTGTTATA 5160 

AGTTTTTCGG ATTGCATCTT TGATACTTTC AACTGTTGGA ATCATTGCAT TTTCTAGGTT 5220 

TTGTGCATAA GGCATCGGCA CATCTTCTCC TGCACAACGG CGAATTGGTG CATCTAGATA 5280 

GTCAAATGCT TCTGATTCTG AAATAATAGC TGAAATTTCA CCGATATAGC CACTTGTTTT 5340 

GTGGGCATCG TTGACCAGAA CAACCTTACC AGTCTTCTTC ACTGAGTTTA TGATGATATC 5400 

CTTATCAAGC GGAACAAGGG TACGTGGGTC AACAATTTCA ACTGAAATTC CTTCTTCTGC 5460 

TAATTCTTCA GCAGCTTGAA CCACACGGCG AAGCATTTTT CCATAAGTAA CAACTGTTAC 5520 

ATCCGTTCCT TGGCGTTTGA TTTCACCAAC CCCAAGTGGA ATTGTGTAGT CTGGATCAAC 5580 

TGGCACTTCC CCTTTTTGGT TAAATTCTGA CTTGTACTCA AGTATAATAA CTGGGTTGTT 5640 

ATCACGGATA GAAGACTTAA GCAGGCCTTT CATGTCCGCA GGTGTTCCAG GTGCCACAAC 5700 

CTTAAGTCCT GGAATGTGAG TAAACCAAGA CTCTAGAGAT TGTGAGTGCT GGGCGGCAGA 5760 

GCCAACTCCG TTACCAGCTG CACAACGAAC AGTCATTGGA ACCTGACCTT TACCACCAAA 5820 

CATGTAACGT GTTTTAGCAG CTTGGTTGAC GATATTGTCC ATGGCAATAA CAGAGAAGTC 5880 

CATGAAGGTC ATATCGACGA TTGGACGAAG TCCTGTCATG GCTGCTCCTG CTGCTGCTCC 5940 

AGAGATGGCA GCTTCAGAAA TCGGACAGTC ACGGACACGT TCTGGACCAA ATTCTTCAAG 6000 

CATTCCAACA GAAGTACCGA AGTCTCCTCC GAAGACACCG ACGTCTTCTC CCATCAAGAA 6060 

CACATTTTCA TCGCGACGCA TTTCCTCAGA CATAGCAAGG ATAATGGTGT CACGGAAGGA 6120 

CATTGTTTTT GTTTCCATTT TATCTCTTTC TCCTTAGTCT GCGTAAATAT CTTCAAAGGC 6180 

TGATTCAAGC GGTGGGAATG GGCTTTCCTC TGCAAATTTA ACAGAAGCTT CTACTGCTTC 6240 

CTTTACTTGC GCTTGGATTT CTTCCAATTC TTCGGCACTT GCAATGTTAT TTTCAATAAG 6300 

GTAATTGCGG AGGTTTTCGA TTGGATCTTT TTGTTTCCAC AATTCCACTT CTTCACGCGT 6360 

ACGATATTTA CCAGGGTCAG ATGATGAGTG ACCGAGCCAG CGATAAGTTA CACTTTCAAT 6420 

CAAGACTGGA CCATTGCCAC TGCGAACATG GTCCACAGCT TTCTGAAATC CTTCATAGAC 6480 

ATCGATGACA TTGTTACCGT CTTCGATGAA CATTCCAGGA ATTCCATAAG CGGCGCTACG 6540 

TTGATGGATA TGTTCTATAT TGGTCATTTT CTTGATATCC GCAGAGATAC CGTAACCGTT 6600 

GTTAATGCAA TAGAAAATGA CTGGCAGGTT CCAGATAGAA GCCATGTTCA CTGCTTCGTG 6660 
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GAAAACACCT TCATTGGTCG CACCATCTCC AAAGAAGCAG ACAACGATTT TACCGGTATT 6720 

TTGCATTTGC TGACTGAGGG CTGCACCGAC AGCGATCCCC ATACCACCAC CTACGATACC 6780 

ATTGGCACCA AGGTTCCCAG CATCAAGGTC AGCGATATGC ATAGATCCAC CTTTCCCTTT 6840 

ACAGGTTCCA GTGTATTTAC CAAGGATTTC AGCCATCATT CCGTTGAGGT CAATCCCTTT 6900 

AGCAATAGCT TGCCCGTGTC CACGGTGGTT TGAGGTAATC AGATCATCTG GATTGAGAGC 6960 

TAACATAGCC CCCACGTTAG CTGCCTCTTC ACCAACAGAA AAGTGCGTCA TTCCTGGCAC 7020 

TTTCCCTTTC TTTACTAATT GTGCAATTTT TAAGTCCATG CGACGGATTT CTTCCATCTT 7080 

ACGGAACATT TCTAGCAAAA GATTTTTATC TAAAGTTGAC ATCTTCTTGC CTTTCTAACT 7140 

TTCTTCTTAC CTTACTATTT TACCGCTTTT GGCAAATACT GTCAAAGTTT TTCTAAAAGA 7200 

AATTTCACAA AATAAAAAAG AAAACCCCGT GAAAACAAGG GATTTTCTTG TCAAGAATAT 7260 

TTTTTCACAA ACTTTTTAGC ATTTGGATTT TGCTAAAGAT TCAAATCTCT TCATAATCAC 7320 

AGTTAAACGC CAACGGTAGA GCGCCCCGCT CACAATCAAA CTAATAATCA AGCCGATCCA 7380 

GTAAGAATAA GCTCCAAAAT CTGTTAGGGA ATCAAATAGC GTAnCACAGG GATTGCTACG 7440 

CCCCAATAAC CAAGCAAACC AAGGTAAAAA GGAATAACTG TATCCTTATA CCCCCGCAAA 7500 

ATTCCCTGAA GCGGCGCCGC AAAGGTATCT GCTAACTGGA AGAAAAGACT ATAAGTTAAA 7560 

AAACGCACTG TCAAATCGAT AAATTTTGGG TCGTTACCAT AAAGACTGGC CACATTTCCC 7620 

CTAAAAATGT AAAGGAAGGT TAAGGTGAAG GCCGCAAAAA TGAGGGCAGT CCATCTTCCT 7680 

AGACCAATAT AGGTTTTCGC ATCATCAAAT CGCTTGGCTC CCACTTCATA GGAAACGACA 7740 

ATAGCCATAG CCGATGAGAT ACTCATAGGA AAGGCGTACA TAAGACTTGA AAAGTTCATA 7800 

GCTGACTGGT GACTAGCTAT AATCAAGGGC GAAAACTTAG CCATAATCAA GCCAACCACT 7860 

GAAAAGATAG CCACTTCCGC GAAGACAGTT CCCCCAATAG GCAGACCTAA ACGAACTCCT 7920 

TCCTTAATTT TATCCATATT AAGTGGAATT CGTTTCTCAA GGTGTAAGGC TTTGAGCTTC 7980 

TCCTGTTTAA ATAAAACCAG AACAGAAATC CCAAGCAAGA CCCAGTAGGC CAAGGATGTT 8040 

CCTAAACCAG CACCAGCCCC TCCCAGTTCT GGAACACCAA AGGCACCGTA AATCAAGAGA 8100 

TAGTTAAATC CGCTATTGAG AGGGAGTAAC AAAAGCATGA GGTACATGGA CAGTTTGGTC 8160 

AAGCCCAGCG AATCCAGCAA GGAACGAATG ACGCTAAAGA GCAACAAGGG GATAATCCCG 8220 

ATAGATAAAA ACCAAAGATA GCGAACCGCT ACTGCCGCTA CTGCTGCTTC TAACCCAATA 8280 

TGATTCAAGA TTATTGGTGC CAAGAAAAGT ACCATCCCCA GCAAGACCAC AGATAGGCCC 8340 

AAGGCCAAAT AAATAAATTG GTAAAAATCA GACGCAACTT CTTCCTTTTT GCCTCGACCA 8400 
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AGATGGTGAC CAATGATAGG CACCAAGGCT GACACAATCC CTGTTAGAAA TGTAAAGAAA 
GGATTCCAGA TACTGGTTGC CATAGATACA CCAGCCAAGT CCATAGTGTT GTATTGACCT 
GTCATTGCAG TATCAACAAA AGAGGCAGAA TAATTGGCAA ATTGGTAGAT CAGGATTGGG 
AAGAAAATTT TTAAAAATAA TACTAACTTC TCTCGTAAAC ACTTTGTCTT ATACATACTT 
CTCTTTCTAT TCTGATTTAT CTAAACCAAA GAGTTTCAGA CCATAGTTTT TCAAACTTAG 
CGGAGGTTTA TTAGATTTTG AAGTAGTATG CCAACACGCA CATGTACGAC AATAATAGCT 
' TCTAACTAAA CCTCCGTTAT CATATTGAAC CGCATGGTCA GCTTTTTCTT TAGTTTCATA 
TTGAATTTTG GAACGATTAG CTGCGGGACA GTAAATTCCA CTATTAGATT TCGCTTGTCT 
CTCCCTACGT TTTCGAAAAT AATTCATATT CTAACTCCTA TCAAGCTTGA TAGACGATTT 
GTCCCTTACA GATGGTATAT TTAACCTGCC CTTTTAAGGT TTCACCGATG AATGGTGAAT 
TAGCTGCTTT GGAAGCAAAA TGGGAGTCCA CAAAGCGGTC AGCCTTGGCA TCAAAAATAG 
TGATATCTGC TGGACCATTC TCAGCCAAGT AACCTGCTTC AAAGTTGTAA AGCTTGGCTG 
GGTTGTATGT CATTTTTTCA AGTAATTCCA TCAAGCTCAA CTCACCAGCT TCTACTAAAT 
AGGTCAAGCT GAGAGACAGG GATGTTTCTA AGCCAGTCAT ACCAGATGGC GCTTTGGTAA 
TATCCTCAAC ATTTTTTTCA TCTACATGAT GAGGCGCGTG GTCAGTCGCA ATAACTGTGA 
TGACACCTGA TTTGAGACCT TCGATAACGG CACGACGGTC TGATTCCAAA CGAAGCGGTG 
GATTCATCTT AGCATTGCTA CCTTGTGTTA AAAGAAGTGC TTCTGTCTTA GAGAAATGCT 
GTGGCGCTAC TTCTGCTGTG ACTTCTGCAC CTAACCCCTG AGCAAACTCC ACTACTTTAA 
CACTTTCTTC CTTAGACAAA TGCTGGATGT GAACATGGGC TTTAGTTGCA TAGGCAATCA 
TGACATCACG CGCCATCATA GCGTACTCAG CCACCCCAGT AGCACCGCAG ATATGGAAAT 
GTTCTCTAGC AATATTTTCA TTAAAGCCAA GAACACCGTT CAAACCTGGA TCTTCCTCAT 
GAAGGCTGAT AAAGGTATTG AGTTTTTTGG CTTCCTCCAT GGCTTCCTTG ACAATCTTAC 
TGCTCTCAAG CGGAATACCG TCATCAGAGA AACCAACCGC ACCAGCTTCT AAGAGTGCCT 
TAAAGTCAGT CAAGTTTTTA CCATTAAAGT TTTTAGTAAT GGTCGCAACT GTCTTGACAT 
TAATCTTCTC TTTGGCAGCT GACTGGAGAA CTGCTTGCAA AGTCTCCACG TCTGAAATGG 
TTGGACTGGT ATTAGCCATC ATGACGACAG TAGTAAAACC ACCTGCAGCG GCTGCTAGGG 
CACCAGTATG AATGTCTTCT TTATGTGTTT GACCAGGTTC ACGGAAATGA ACATGAATAT ' 
CGACCAAGCC AGGAGCAACC ACAAGACCAG TAGCATCAAT CGTTTCTGCT CCTTCTTCCG 
TGATCTCAGA CGCAATTTTG ATAATTTTCC CATCTTGAAC TAAGACATCA CAAACTTGAT 
CCAAACCAGA CTTGGGATCC ATTACACGAC CATTTTTGAT TAGTAGCATC TGCTTTCTCC 
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TTTATTCATA GAAATCAACT TGGGTATCCA ACAATTTATC 
CTGAAAAGAA GGGTTTATCC TCTAAAAGCC ACTCAACAAA 
TCGGCTTGCT CAAAACCTCA TCATAGGGAA CCCATTCTAG 
TCAAGTCGCC CTCAAACTCC GTCACCTTAA AAACATAGGT 
TAAATTCAGG AAAAGTGATG ACACCTTTTA GAACTGGCTT 
CAAGGATTTC ACGCGCCGCG CATTCCTGGG GCGTCTCTCC 
CACCAATCCA TTTCCCTTCA TGGACATCAT TGGGTTTCTT 
GTTCTTTCCC ATTATCAATG TAGCAAATCG TCGCTAACTG 
TAAGCCAATC GATTGGCTCT TGTCCTGTCT CTTTTAAGAA 
GCTTGGAACC CCAAAATCCT CTATAAACCG ACAAAGGACT 
TCAAGTGATG AGGATTGGTA ACTAATGCCT TCTTCTTACG 
CAAAAACGAC TGGTCTATCT AGATGATTGA CCACCTGAAT 
CCCAGATTTG ACCAGCATGA CCATTGGCCT GTCCAGCAGG 
GAAGCAAGAC TCCTTGCTCA GCCCAAGCTG TCAAATCATG 
CATCTGACAA TTCTTTCAAG ATATTTTGCA AGGATGGTGG 
CAGAAAAACT CAAGCCCTGC GCTTGACCTG GTCCGTGATA 
CCACCTTAAC TTCTTCAAGC AGTGTTGTCA AGAGAGCCTG 
GATAAATAAT CCCCTGAGAA TAGACCTGCT CCATAAACTG 
CCTCAGGTAA TTGCGCCTTA ATCAAAGCAT GCCAAGACGA 
(2) INFORMATION FOR SEQ ID NO: 148: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12127 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



CCCATCATAA 
GGTGTGGTCA 
CGTCCCCTCA 
GTACCAGTCT 
GGCTTTGAGC 
TCTCTCTAGC 
ATTACGATGG 
AGGCATATTT 
TGCATTGGCC 
TGGATGGGCT 
TGCATAAGCT 
CACAGCATCA 
AACAGTCAAA 
AGATTTCTTA 
AGCTGGGATA 
GGGGTCTTGC 
AAAAACCTTT 
ATTGATTTTC 
GTGTTCCATA 



ACAAACTTGG 
CCTTCCCAAG 
TTGCAGTCAA 
AAATCTGGTG 
CCTGTTTCTT 
TTACCACCCA 
AGCATGAGCA 
TCTCCTTATC 
TTGGAAAAGG 
GATTCGATAA 
CCCCAGAGTA 
GTAAAAGGCT 
CAAGCATTAA 
ACTCCGATAT 
GAGTCAGGTA 
CCTAGAATTA 
TCCTTGGGTG 
CCGAAATAAC 
GCCGACTCGG 



10260 

10320 

10380 

10440 

10500 

10560 

10620 

10680 

10740 

10800 

10860 

10920 

10980 

11040 

11100 

11160 

11220 

11280 

11340 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 148: 
AAAAAATAGA CTTGTTAGAC TATAAATGTA GTAAGCCTAC ACAAGAAAAA TACATAGAGA 
TAAAGGTGAT TATTATGAAA TTCAAAAAAA TGCTTACTCT TGCAGCCATT GGCTTATCAG 
GATTTGGGCT TGTTGCCTGT GGCAATCAGT CAGCTGCTTC CAAACAGTCA GCTTCAGGAA 
CGATTGAGGT GATTTCACGA GAAAATGGCT CTGGGACACG GGGTGCCTTC ACAGAAATCA 
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CAGGGATTCT CAAAAAAGAC GGTGATAAAA AAATTGACAA CACTGCCAAA ACAGCTGTGA 
TTCAAAATAG TACAGAAGGT GTTCTCTCAG CAGTTCAAGG GAATGCTAAT GCTATCGGCT 
ACATCTCCTT GGGATCTTTA ACGAAATCTG TCAAGGCTTT AGAGATTGAT GGTGTCAAGG 
CTAGTCGAGA CACAGTTTTA GATGGTGAAT ACCCTCTTCA ACGTCCCTTC AACATTGTTT 
GGTCTTCTAA TCTTTCCAAG CTAGGTCAAG ATTTTATCAG CTTTATCCAC TCCAAACAAG 
GTCAACAAGT GGTCACAGAT AATAAATTTA TTGAAGCTAA AACCGAAACC ACGGAATATA 
- CAAGCCAACA CTTATCAGGC AAGTTGTCTG TTGTAGGTTC CACTTCAGTA TCTTCTTTAA 
TGGAAAAATT AGCAGAAGCT TATAAAAAAG AAAATCCAGA AGTTACGATT GATATTACCT 
CTAATGGGTC TTCAGCAGGT ATTACCGCTG TTAAGGAGAA AACCGCTGAT ATTGGTATGG 
TTTCTAGGGA ATTAACTCCT GAAGAAGGTA AGAGTCTCAC CCATGATGCT ATTGCTTTAG 
ACGGTATTGC TGTTGTGGTC AATAATGACA ATAAGGCAAG CCAAGTCAGT ATGGCTGAAC 
TTGCAGACGT TTTTAGTGGC AAATTAACCA CCTGGGACAA GATTAAATAA AATGTTTGCT 
CCATAAATCT CTAAAGAGAT GCAGACGTTT CATCGTACAA TAAGATAAAG AAGGCAAGTA 
GGGAGGTGTC GTATCTCCCT TACTTTCTTC ACTAGAAAGG ACAAGATGTG ACAAAACAAG 
CCTTCAAAGA AGCAGTTTTT AGGGCAATTT TTTTCATGAG TGCAACAGTA GCTGTTGTAG 
CTATTTTGCT AATCTGTTTC TTTATTTTTA GTAATGGCTT ACCTTTCATA GCTAACTACG 
GCTTTGCCCG TTTTTTATTA GGCAGTGATT GGTCGCCAAC GAACATTCCG GCAAGCTATG 
GTATTTTACC AATGATCGTT GGTTCCTTAT TAATTACCTT AGGAGCGATT GTGATTGGGG 
TGCCAACAGG CATCTTGACA TCGGTGTTTA TGGTTTATTA TTGTCCAAAG CCCGTCTATG 
GCTTCTTAAA ATCAGCTATC AACTTGATGG CAGCCATTCC ATCTATTGTT TATGGTTTTT 
TCGGCCTACA ATTATTGGTG CCTTGGATTA GAAGCTTTTT AGGAAATGGC ATGAGTGTCC 
TAACCGCTTC GTTACTATTA GGAATAATGA TTTTGCCAAC CATTATCAGT TTGTCAGAAT 
CTGCTATCCG AACAGTTCCC AAAACGTATT ATTCTGGTAG CTTGGCTCTA GGAGCTAGTC 
ATGAACGGAG TATTTTTAGT GTCATCTTGC CAGCTGCGAG ATCTGGTATT TTATCAGCAG 
TTATTTTAGG AATCGGTCGC GCAGTAGGTG AAACCATGGC AGTTATTTTG GTGGCAGGCA 
ACCAGCCGAT TATTCCAAGT GGACTCTTTT CAGGAACCAG AACCTTAACA ACCAATATTG 
TTCTGGAAAT GGCTTACGCA TCAGGTCAGC ATAGGGAAGC CCTTATTGCA ACCTCAGCAG 
TTCTCTTTTT CCTTATTCTC TTGATTAATG CCTACTTTGC CTACTTGAAA GGAAAATCAT 
CTTATGAGTA AATACCTGCT AAAACTTCTC GTTTATTGTT TTTCAGCTTT AACCTTTGGC 
TCTCTCTTTT TAATCATTGG TTTTATCCTC ATCAAAGGCT TACCTCATCT AAGTCTATCC 
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CTCTTTTCTT 


GGACTTATAC 


TTCTGAGAAC 


ATTTCCCTTA 


TGCCAGCGAT 


TATTTCCACC 


2100 


GTTATTCTGG 


TCTTTGGTGC 


TCTTCTTTTA 


GCCTTGCCCA 


TAGGGATTTT 


TGCTGGTTTT 


2160 


TATCTTGTGG 


AATATACAAA 


AAAAGATTCC 


CTTTGTGTTA 


AAATCATGCG 


ATTGGCCTCA 


2220 


GATACCTTAT 


CTGGGATTCC 


TTCCATTGTT 


TTTGGTCTGT 


TTGGCATGCT 


CTTCTTTGTA 


2280 


GTCTTCTTAG 


GTTTTCAATA 


CTCTCTGTTA 


TCAGGAATCT 


TAACCTCAGT 


TATCATGGTG 


2340 


TTGCCAGTCA 


TTATTCGCTC 


AACAGAAGAA 


GCCCTTTTAT 


CTGTTAGTGA 


TAGCATGCGT 


2400 


CAAGCAAGTT 


ATGGACTTGG 


GGCAGGTAAG 


TTACGGACTG 


TTTTTAGAAT 


TGTTCTACCA 


2460 


GTTGCCATGC 


CAGGTATTTT 


AGCTGGAGTG 


ATACTAGCTA 


TTGGCCGTAT 


CGTTGGTGAA 


2520 


ACAGCTGCCC 


TCATGTATAC 


ATTAGGTACC 


TCTACCAATA 


CGCCAAGTAG 


TCTCATGTCT 


2580 


TCAGGCCGTT 


CTCTAGCCCT 


ACATATGTAT 


ATGCTGTCAA 


GTGAGGGGCT 


ACATGTCAAT 


2640 


GAAGCCTATG 


CTACCGGCGT 


GATTTTGATT 


ATTACTGTTT 


TAATGATAAA 


TACTCTATCA 


2700 


AGCTTATTAT 


CTCGAAAACT 


TGTGAAAGGA 


GCTTCCTAGT 


ATGGGAACAT 


TTTCAGTCAG 


2760 


ACACCTAGAC 


TTATTTTACG 


GGGATTTTCA 


AGCCTTAAAA 


AATATTTCGA 


TTCAATTACC 


2820 


AGAAAGACAG 


ATTACTGCCT 


TGATAGGCCC 


ATCTGGTTGT 


GGCAAATCAA 


CTTTTCTAAA 


2880 


AACCCTTAAC 


CGGATGAACG 


ATTTGGTTCC 


TTCTTGCCAT 


ATTGAAGGCC 


AAGTCCTCTT 


2940 


AGATGAGCAA 


GATATTTATA 


GTAGCAAATT 


CAACCTTAAT 


CAGCTACGTA 


AGCGTGTAGG 


3000 


GATGGTTTTT 


CAACAGCCTA 


ATCCCTTTGC 


CATGTCTATC 


TATGATAACG 


TGGCTTATGG 


3060 


CCCAAGGACA 


CATGGTATTC 


GAGACAAAAA 


ACAATTAGAT 


GCCTTAGTGG 


AGAAATCTTT 


3120 


AAAAGGGGCA 


GCCATTTGGG 


AAGAAGTCAA 


AGATGATCTT 


AAAAAGAGTG 


CCATGTCCTT 


3180 


ATCTGGCGGT 


CAGCAGCAAC 


GCCTTTGCAT 


TGCGCGAGCT 


TTAGCAGTAG 


AACCTGATAT 


3240 


TCTGTTAATG 


GATGAGCCGA 


CTTCAGCCTT 


AGACCCTATC 


TCCACTTTAA 


AAATTGAAGA 


3300 


CCTCATTCAG 


CAACTAAAAA 


AGGATTATAC 


GATTATCATT 


GTTACCCATA 


ACATGCAACA 


3360 


AGCTTCACGT 


ATTTCAGATA 


AAACTGCTTT 


TTTCTTAACA 


GGAGAAATTT 


GCGAATTTGG 


3420 


AGATACCGTT 


GACGTGTTTA 


CCAATCCAAA 


AGATCAGCGC 


ACAGAAGACT 


ATATTTCAGG 


3480 


ACGGTTCGGA 


TAAGGAAGGA 


AAAACCTATG 


AGAAATCAAT 


TTGACTTAGA 


ATTGCATGAA 


3540 


TTAGAACAAT 


CCTTTTTAGG 


ACTAGGGCAA 


CTTGTCCTTG 


AAACAGCTTC 


AAAAGCCTTA 


3600 


CTGGCCTTAG 


CCTCCAAAGA 


CAAGGAGATG 


GCAGAGCTAA 


TTATCAATAA 


GGATCATGCT 


3660 


ATCAACCAAG 


GTCAAAGCGC 


TATCGAATTG 


ACCTGTGCCC 


GTTTGTTGGC 


CTTGCAGCAG 


3720 


CCACAAGTGT 


CTGACCTTCG 


ATTTGTGATT 


AGCATCATGT 


CTTCTTGTTC 


AGACCTTGAA 


3780 
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CGTATGGGAG ACCATATGGC AGGCATTGCC AAAGCTGTTT TGCAACTAAA AGAAAATCAA 3840 

CTAGCCCCTG ACGAAGAACA GTTACACCAA ATGGGTAAAT TATCCCTCAG CATGCTAGCC 3900 

GATTTATTGG TTGCCTTTCC TTTGCACCAA GCCTCAAAAG CTATTAGTAT TGCTCAAAAA 3960 

GATGAACAGA TTGACCAATA TTATTATGCC TTATCAAAGG AAATCATTGG ACTTATGAAA 4020 

GACCAAGAAA CCTCAATTCC CAATGGAACT CAATACCTTT ATATCATAGG GCATCTGGAA 4080 

CGCTCGCTGA TTACATTGCT AACATTTGTG AACGCCTAGT CTACCTAGAA ACAGGAGAAC 4140 

TAGTGGATTT GAATTAATTC AACTAATCCT TAAAAGAGAA GAGTACGATT AAGTACTCTT 4200 

TTTTATGGTT GTAAAAAAGT TCATTTGACC AATTTAAGCA GTGTAGATAG TGAGGAGTTG 4260 

TTTCAATTCT ATCGTGAACG AGGGAATGCT GAAAACTTTA TCAAAGAAAG GAAAGCAGGA 4320 

TTCTTTGGGG ATAAGACAGA TAGTTCGACC ATGATTAAGA ATGAAGTACG TATGATGATG 4380 

GGCTGTCTGG CTTATAATCT CTACCTCTTT TTAAAGCAGC TAGCTGGTGA TGAAGTAAAG 4440 

TCCTTGACTA TCAAGCGTTT TCGACGTCTC TTCCTTCATA TTGCCGGAAA ATATGTCTCT 4500 

ACTGCTAGAC GACATATTCT CAAATTCTCA AGTCTATACG CCTATTCAAA ACAGTTTCAA 4560 

GCCTTATTTG ATACAATCTG CCAGATAAAT CTGATACTCC CTGTTCCATA TAGAGCTAGA 4620 

GGGCAGGGGA AAACATGCCT AACAGAATAA GTCACCTTAT TTTAAAAATC GAGCATCAAA 4680 

CCAAGGGAGG AGTCTGCCCT TTTTTAGGAA AAAATCAAGA CAAATCTCCT CAATTATGTC 4740 

TCGAACATCA GAAATTAAGC AAAATCACCA GAAGGACAGT ATTTCAACTA GCTTTTCTGG 4800 

TAATTTTTGA ACTGTGTAGT TCGTTAGTGC CAGATATGAA TAATTTGGGA TGATAAATCT 4860 

TTCTTCCTCA GGTAGCCTAT CATAATACTC TTCAAAAATC TTATCAAAAA CACTCTCTTT 4920 

CTTTTGGGCG ATAGTTTCAT CTTCGTATGT AGGAGTCCTC ATCAAGAAAT ACTTCAATTC 4980 

TAGGTATTCC TTATCCAACT CTATATAACT TGGCATCAAC TTGTAATCTT CAACCCCCAA 5040 

ACGTTCAGCA ATATATTTTA ACTTTGTTAG TATTGGTCTG GATTCTCCAT TTTCAATTCT 5100 

AATTAATTGA CGGATACTTA ATTCAGACTC ATCACCACAA AATTCTGAAC GACTGATTT V 5160 

TTTAGCCAAA CGTAATCTTT TAATTTTTTC GCCAAACTCT CGCAACCTAC AAGAACTTCC 3220 

TGAGTTGTTT ACCTCTATTA TAAGCATATA CTGAATCAAA CTATCTATCA GATTTCTTCT 5280 

CACTTTAACT AAAGACTAAG AGTTTATCCC TTCGTCTCGG TTTTTGTGTA TTTTTCCACC 5340 

ATACCCCAGT AATGCAAGTG CAAAATCCCC TAGAATATGA TAGAATAAGA GAAAGAACTC 5400 

TATCAAGGAG GAAATCATGG AAAAACAAAC CGTCGCCGTC TTGGGGCCTG GTTCTTGGGG 5460 

AACCGCCCTT TCACAAGTCT TAAATGACAA TGGACACGAG GTACGTATTT GGGGAAATCT 5520 

TCCCGAGCAA ATCAATGAAA TTAATACACA CCATACTAAT AAGCACTACT TTAAAGATGT 5580 
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CGTTCTAGAC GAAAATATCA TTGCCTACAC CGACTTAGCA GAAACATTGA AAGATGTGGA 5640 

TGCGATTTTG TTTGTTGTCC CAACAAAAGT GACACGACTT GTTGCCCAGC AAGTTGCACA 5700 

AACCTTGGAC CATAAGGTTA TCATCATGCA CGCATCAAAG GGATTAGAAC CTGATAGCCA 5760 

TAAACGATTA TCAACCATTC TTGAAGAAGA AATTCCTGAA CATCTCCGTA GTGATATCGT 5820 

CGTTGTTTCA GGGCCTAGTC ATGCAGAAGA GACCATTGTG CGTGACCTAA CTTTAATAAC 5880 

. TGCTGCTTCT AAAGATTTAC AAACAGCTCA ATACGTTCAG AAGCTATTTA GTAATCACTA 5940 

CTTCCGACTT TATACCAATA CGGATGTTAT CGGGGTTGAA ACTGCTGGTG CTCTTAAAAA 6000 

TATTATTGCT GTCGGTGCTG GAGCTTTACA TGGTCTTGGA TTTGGTGATA ATGCTAAGGC 6060 

AGCCATCATC GCTCGAGGTT TAGCAGAAAT CACCCGCCTA GGGGTAGCAC TCGGGGCCAG 6120 

TCCATTGACC TATAGCGGCT TATCTGGTGT GGGAGATTTG ATCGTAACGG GAACTTCCAT 6180 

CCACTCTCGT AACTGGAGAG CTGGAGATGC TCTCGGACGA GGAGAATCCC TAGCTGATAT 6240 

AGAAGCTAAT ATGGGCATGG TAATCGAAGG AATTTCAACG ACTCGAGCAG CCTATGAACT 6300 

AGCCCAAGAA CTTGGAGTCT ATATGCCCAT TACACAGGCT ATTTACCAAG TTATTTATCA 6360 

CGGAACCAAT ATCAAAGATG CCATTTATGA CATCATGAAC AATGAATTTA AAGCAGAAAA 6420 

TGAGTGGTCT TAACCCTCTA TAGAAAGGAT TTTTATGACA TCAAAAGTTA GAAAGGCAGT 6480 

CATCCCTGCT GCTGGACTAG GAACTCGATT TTTACCAGCA ACCAAGGCCC TTGCCAAAGA 6540 

AATGTTGCCA ATCGTAGACA AACCAACTAT CCAGTTTATC GTGGAAGAAG CTCTCAAATC 6600 

AGGTATTGAA GATATTCTAG TTGTCACTGG TAAATCAAAA CGTTCTATTG AGGACCACTT 6660 

TGATTCAAAC TTCGAATTGG AATATAACCT CAAAGAAAAA GGGAAAACAG ATCTTTTGAA 6720 

GCTAGTTGAT AAAACAACTG ACATGCGTCT GCATTTTATC CGCCAAACTC ATCCACGCGG 6780 

TCTCGGAGAT GCTGTTTTGC AAGCCAAGGC TTTCGTCGGA AATGAACCTT TTGTCGTTAT 6840 

GCTTGGTGAT GACTTGATGG ATATCACAGA CGAAAAGGCT GTTCCACTTA CCAAACAACT 6900 

CATGGATGAC TACGAGCGTA CCCACGCGTC TACTATCGCT GTCATGCCAG TCCCTCATGA 6960 

CGAAGTATCT GCTTACGGGG TTATTGCTCC GCAAGGCGAA GGAAAAGATG GTCTTTACAG 7020 

TGTTGAAACC TTTGTTGAAA AACCAGCTCC AGAGGACGCT CCTAGCGACC TTGCTATTAT 7080 

CGGACGCTAC CTCCTCACGC CTGAAATTTT TGAGATTCTC GAAAAGCAAG CTCCAGGTGC 7140 

AGGAAATGAA ATTCAGCTGA CAGATGCAAT CGACACCCTC AATAAAACAC AACGTGTATT 7200 

TGCTCGTGAG TTCAAAGGGG CTCGTTACGA TGTCGGAGAC AAGTTTGGCT TCATGAAAAC 7260 

ATCCATCGAC TACGCCCTCA AACACCCACA AGTCAAAGAT GATTTGAAGA ATTACCTCAT 7320 
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CCAACTTGGA AAAGAATTGA CTGAGAAGGA ATAACAAAAT CATTTATATA AAGATTAGCC 7380 

ACACATAAAT TAAGTAAATT CTCTACTTGA ATCTACCTAT TTAATAAAAA CTAATGAAAA 7440 

CGCTATACTT GTATTTGTTT TTTCATTAAA ATAAGAGTAG AATAAATTAG TATAGTAAAA 7500 

CAAAAAAGCA CCGAATCGGT GCGCACTTTT TCAAGTTGTG TACGGACAAA GCCTTATTTT 7560 

AACTTTGCTA TGTTGTTTCT AATGGTTCCA AAATAATAAA TAATTTTAAA TTTGACTTAA 7620 

CTGTTGGAGT AGTCATGGTT AAATTAAATC AACCGAGCCG AACATAAGTT GTTTAATTTT 7680 

GTGGAAGCTA TTAATAAAAA TATAATAAGG GAGAAAGATA GGTGTAATTT TAATTTTAAA 7740 

GTAATTGCGG ACACTATCAA AGAAAAAGAT TATGGAGAAC AAATTTGTAG AATTTATCGA 7800 

AAACAATAAA AAAGTAATCA TTTCATCAGT TGCAGTTGGT GTTGTATTGG TATTAGGGTT 7860 

TGGATGGTAT TCATATAACC AACAACAAGC AGAACAACAA GCAAAAATTG TACAATTAGA 7920 

AAAAGATAGC AAATCAGACA AAGAACAAGT TGATAAACTA TTTGAATCAT TTGATGCATC 7980 

TTCAGATGAA TCTATTTCTA AATTAAAAGA ACTATCTGAA ACTTCACTTA AAACCGATGC 8040 

AGGTAAAGAC TATCTTAATA ACAAAGTCAA AGAATCATCT AAAGCAATTG TAGATTTTCA 8100 

TTTGCAAAAA GGTTTGGCTT ATGATGTTAA AGATTCAGAT GACAAATTTA AAGATAAAGC 8160 

AACTCTTGAA ACAAATGTAA AAGAAATTAC AAAACAAATT GATTTTATCA AAAAAGTTGA 8220 

TGAAACTTTT AAACAAGAGA ATTTGGAAGA AACTCTTAAA TCTCTAAATG ATCTTGTTGA 8280 

TAAATATCAA AAACAAATCG AACTTTTGAA GAAAGAAGAA GAAAAAGCTG CTGAAAAAGC 8340 

TGCTGAAAAA GCAAAGGAAT CTTCTAGTCA AAGTAATTCT TCTGGTAGTG CTTCTAATGA 8400 

GTCTTATAAT GGATCTTCCA ATTCAAATGT AGATTATAGT TCATCTGAAC AAACTAATGG 84 60 

ATATTCAAAT AATTATGGCG GTCAAGATTA TTCTGGTTCA GGAGATAGTT CAACAAATGG 8520 

TGGATCATCA GAACAATATT CATCTAGCAA TTCAAACAGC GGAGCAAATA ATGTCTACAG 8580 

ATATAAAGGC ACTGGTGCTG ACGGCTATCA AAGATACTAC TACAAAGATC ATAATAATGG 8640 

AGATGTGTAT GATGACGATG GAAATTACCT TGGGAACTTT GGTGGCGGCA TTGCAGAACC 8700 

TAGTCAACGC TAATAACTAT TTTAGAGCTG TGTTGTTTCG AATGGTTCCA AAACACATTA 8760 

AAAGCTACTC ATTTTTTAAG TAGCTTTTTT CTTATTCAAG TTTACATATT ATACTCAATG 8820 

AAAATCAAAT TCAAACCACG TCAGCATCGC CTTACCGTAG GTATGGTTAC TGACTTCGTC 8880 

AGTTTCATCT ACAACCTCAA AACCATGTTT TGAGCTGACT TCGTCAGTTC TATCTACAAC 8940 

CTCAAAGCAG TGCTTTGAGC AACCTGCGGC TAGCTTCCTA GTTTGCTCTT TGATTTTCAT 9000 

TGAGTATTAG TCGTCACAAT CCCATTCCCT TGTAGAAAAG CAAAATGGCG AGTCCTACGA 9060 

ACAAGACTAC CGCTCCTAAT CTCTGGCTGG TGTTATACAT CCGTTTTTCT CCTCTAACTG 9120 



WO 98/18931 



PCT/US97/19588 



997 



GAAAGATAAC 


TGCTAGAAAT 


GCGCCACCAA 


CTGCACCACC 


GATATGGCCT 


GCTAGGCTGA 


9180 


TTCCTGGAAT 


CAGAACACTT 


CCAATAATGT 


TAACCACAAA 


AAGTGTCAGA 


TAGGATTGCC 


9240 


CTAGCTGTTG 


GATATAAGGA 


TTGCGAGTTG 


CATAGCGAAG 


AACAATAATC 


GCGGCAAATA 


9300 


GCCCATAAAG 


AGAGGTAGAG 


GCGCCTGCTG 


CTAAGGATTT 


AGGACTAAAT 


ACAAAAACAA 


9360 


AGAGATTGCC 


CATCATTCCT 


GATAAAAGAT 


AGAGAAAGAA 


AAACTGCTTA 


GAACCGAAAA 


9420 


-TCTCCTCTAC 


CTGCCTTCCA 


AGATAATAAA 


GTGAAAGCAT 


ATTAACAATG 


AAATGTTCCC 


9480 


ACCCAATATG 


AACAAAAATG 


GCAGACAAGA 


GACGCCAAAC 


CTGCTCGGGA 


AAGAGGCGAA 


9540 


TAGCTGGCCC 


ATACATGGCT 


CCAAATCGAA 


ATAATGTATC 


TGCCCTGTCA 


AAGTTTCCGC 


9600 


CTGCAGTGAC 


CAACATTAGT 


AAAAATACCA 


AGGCCGTCAC 


TAAGAGGAAG 


AAACTCGTCA 


9660 


CAGGGTAACG 


TCTATCAAAG 


ATTTCCTTCA 


TCAATTAATA 


CCTCCTGAAC 


AGGAATATCA 


9720 


TGGTTTTCAG 


GTATAAAGTC 


CTGAATTTGA 


CAAGGATATA 


TCGTACTCAA 


AGTACGACCA 


9780 


GAAAAATGTT 


CCAGATAGCG 


GTCATAATAG 


CCTCCACCGT 


ATCCTATCCG 


ATATCCTTTC 


9840 


GTCGTAAAAG 


CCAGACCAGG 


AACATGAATC 


AAATCAATCT 


GAGATGCATC 


CACCACTTCC 


9900 


AAATCTCCCT 


GTAGCTCCAG 


TAAGGCAAAG 


AAAGTTTTTA 


CCAACTGTTG 


CGGATCATAG 


9960 


ACCACAAAGT 


CCATGCGCCC 


CTTGGGATAA 


GTTTTGGGTA 


TTAAAACCTT 


CTTGCCGTCC 


10020 


TTCAGCGCCT 


GCTCAATCAG 


TTCCTGCGTT 


TGAAACTCAT 


GAGAAAAAGA 


GAGGTAGGTT 


10080 


GCGATGACCT 


TGGCTTCTTG 


ATAAAAGGGG 


TGTTGTAAAA 


GCCGCTCGGT 


TAAAGCTTGG 


10140 


TCTATAGCCT 


GTTTTTGCTC 


TTGAGATATA 


GCCTTCATTT 


CATGCAAGAC 


TTGCTTGCGT 


10200 


AATTCCGATT 


TCATAGACAA 


GCCCTCTATT 


CTGCTGCCTT 


CTTTTTCAGG 


AAACTAGACA 


10260 


CCGCAGCCAC 


CCCAATAGCT 


AAGACTTCTT 


CCTTAGGACT 


CATTTGAGGG 


TGATGAAGAG 


10320 


CGTAGGGACT 


ATCGATACCT 


AGCCAAAACA 


TCACGCCATC 


AACCTTTGAA 


AGGAGATAAC 


10380 


CAAAGTCCTC 


GCCTGTCATA 


GCAGGTTCGA 


TATCAATCAA 


CTCGATTCCG 


TCTTTTTCGT 


10440 


CAAAGAAGTC 


CATCAGTTCA 


CGCGCCAAGG 


CTGGATTGTT 


CTCAACAGGT 


AGGTATCCAC 


10500 


CTTGTTTGAG 


TTCCACTTCG 


ACTTCCATAT 


CAAAGGCAGC 


TGCAACCCCT 


TCTGCAACTG 


10560 


TTTTTACCCT 


CTTTTGCACC 


AAGAGACTCA 


TGTCCTGTGT 


CAAGGCACGA 


ATAGTTCCAT 


10620 


GTAAAAAAGC 


TGTGTCTGTG 


ATGACATTGT 


TGGTGGTTCC 


AGCTTGAAAA 


ACGCCGAAGG 


10680 


TCACCACTGC 


TCCCTCGATT 


GGGTTGACAT 


TGCGGCTAAC 


AACTGACTGC 


ACTTGGGTCA 


10740 


CAAAGTAACT 


AGCCGCCACC 


AAGGCGTCAT 


TGGCTTCATG 


AGGAAAAGCT 


GCGTGGCCAC 


10800 


CTTTGCCTTT 


GAAACGGATC 


TTCACCTCGC 


AAGTTCCTGC 


AAAGAGTGTA 


TGAGTATTAG 


10860 
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TCGCAATCTG GCCGACTTTC AAATCTGGAC GAACATGGAG ACCATAGAAT TGATCTGGCA 10920 

ACCAATCTCC AAAAGCACCG TCCTCATACA TGAGCATACC ACCAGCTTCA TTTTCTTCAG 10980 

CAGGCTGAAA TAGAAAGAGC AGATTATTCT TGGGTTGCTC CTCAAGGGCG CGCTCAAGAC 11040 

AGCCTAAGGC AATGGTCATA TGAAAATCAT GGACACAGGC ATGCATGCGA CCTTGGTGTT 11100 

GAGAAGCAAA AGGTAGACCT GTTTGTTCGA CGATAGGCAG GCCATCAATA TCTGTCCGCC 11160 

AACCAATGGT TCGCTCCGGC TGACTTCCCT GCAGGTAGAC CAAAATCCCT GTCCGCCAAG 11220 

TACGAATTTG AACAAAATCC TTGCCCGTAG TCAATTTCTC AATCACATCC AGCAAATAAG 11280 

CCTGAGTCTT GAACTCCTCC AAGCCAATCT CTGGAATCTG GTGTAAATCT CGTCTAGTCT 11340 

GAATCAAATC TAACATCTAT CTGTCCTCCG ATATAGCAGA AAGAGGCTGG AAAAAGGGTT 11400 

CCGCCTCTTT TTTACTTTTA CAATTACAAG GTACGAAGCG CATCCTCTAG CGCTGTTTTT 11460 

TGTTGAGTTT GGGCATCAAT TTCTTTGATA ATACGAGCTG GAACACCTGC TACTACCACG 11520 

TTTTCTGGGA CATCTTGGGT AACAATAGCT CCTGCTGCGA CAACTGAACC ACTACCGATT 11580 

TGGACTCCTT CGATAACCAC TGCATTAGCA CCGATAAGAA CATTGTCTCC GACACGGACT 11640 

GGTTCAGCAC TAGCTGGCTC AATCACACCT GCCAAAACTG CACCTGCACC AACGTGGCTA 11700 

TTTTTTCCAA CGATGGCACG GCCACCAAGG ATGGCACCCA TGTCAATCAT GGTTCCAGCA 117 60 

CCGATTTCAG CACCGATATT GATAACAGAT CCCATCATGA TAACAGCATT GTCACCAATT 11820 

TCCACCTGGT CACGGATAAT CGCACCTGGC TCGATACGAG CGTTGATAGC ACGCTTATCT 11880 

AGCAAAGGAA CTGCAGAATT ACGAGCATCT TGCTCGACAA CATAATCTTG ATTTTCTACC 11940 

AAACCTTCAA GAAGCGGAGC CACATCCTTC CAGTCTCCGA ATAGGACATT TCCTAGTTTG 12000 

ACAACAGAGC TAGGCACAGC AGTTGCGAGT TGCCCCTCAA AGGTTACTTT GACACTGGTT 12060 

TTCTTTTCAG CATTGGCGAT AAATTGGATA ATTTCTTGAG CGTTCATTTT TGTAGCAGTC 12120 

ATAGGTG 1212 1 
(2) INFORMATION FOR SEQ ID NO: 149: 

(X) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12566 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 149: 

CCATCCTTCT GTTGATGTGA CAGGAATGAT GATAAATCAA CCAGTAGCTA GTCGCGAAGA 60 

GGTGACAGAG GCTTTGAGTC ACTTGGCGGT AGAGCACAAT AGTCTCATTG CTCGTCGAAT 120 
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CGTTGAGCCA AATGAAGCTG GAGAAACACG CTTTACCTAT GCCACTTATG GTGAGGGAAA 180 

GCTTCCAGAA GGTCTGACCA TTTCCTCCAA GGAGAGTGCA GAAACGAGTG ATTTATTAGG 240 

GTCTTACTTG ATTGTATCAG GAAGTTTGGA TGGAGTGAGC TTACAGACCA CCTTGAAAGA 300 

GCTTGGTTAT CAAGGCTTTG TTTCGAATGG AGAAGATCCA TTTTCGATAG TCTTACTATT 360 

GACGGCCACC CCTATGGTGC TACTGAGTTT AGCTATTTTT CTGCTGACCT TTATGAGTCT 420 

-GACCCTGATT TATCGGATCA AATCCCTTCG TCAGGCAGGG ATTCGCTTAA TAGCTGGTGA 480 

GAGCTTGTTT GGAGTTGCTC TCAGACCAGT GTTAGAAGAT GTGAGACAGC TTATCTGCTC 540 

AGTGCTGGTA TCCAGTCTTT TGGGATTGGG GATTCTCTGG TATCAAGGTG CCTTGTTTAT 600 

GGCAACGGTG CAACTGGTCA TCATTGCTCT TCTACTTTAT GGATTGACCT TGGCAGGGAT 660 

TTCTACCTTA CTAAGTGTCG TCTATCTACT TGGTTTACAG GAAAATAGTC TGGTGGATCT 720 

ATTGAAAGGG AAACTCCCTC TCAAACGTAT GATGACATTG ATGATGGTGG GGCAACTCTT 780 

AGCTGTATTG GTGGTCGGAT CGAGTGCGAC AGCTCTCCTA CCCCACTACC GTGAAATGCA 840 

GGAAATGGAG AGAGCTAGCA ATAAATGGAG CCAGTCCTCA GACCGTTACC GTCTATCCTT 900 

TGGTTGGTCT AGTGCATTTG CCGATGAAGA AGGAACGCGT AAGGATAATC GTGAGTGGCA 960 

GACATTTACT GAAGAACGGT TAGCCAATAC AGACTCTTTT TATATTATGA GCAATGTTGA 1020 

CAATTTCTCA GATGGAGCAG AAGTGGACCT AGATGGCAAT CGTCTCAGTG ACTACACACC 1080 

GTCAGGGAAT GTTATCTATG TCTCACCGCG CTATCTGATA GAAGAAAAGA TTACCGTTTC 1140 

TTCAGAGTTT ATGGACAAGA TGCAAAACTT GTCTGAGGGA GAGTTTGGGC TGATCTTGCC 1200 

TGAGAGCTTG CGAGAGCAGT CTGTCTACTA CCAAGGATTG TTTACAGATT ACCTGCAAAA 1260 

CTTTTCATCT GAAAGTGTAG AAGTGACGAG TCAGAAACAC TACCTCCCAC AGGTAAGGCT 1320 

AGCTTTTACA GAAACAGGAC AGGAACGTTT CCTCTATAAT GATGGGTACA AGACAACACG 1380 

CCAGTACCTA AAAGATCCGA TTATTGTAGT TCTAACGCCG CAAGCGACTG GAACAAGACC 1440 

TGTTGCAGGG ATGTTGTGGG GAACTACGGC TAATAGTGCC TTGAAACTAG ATCGATATGG 1500 

AGACAGCATC ACAGCTCTAA AAGAGAAAGG TCTGTATCAC AAGGTTTCTT ACTTGGTAAA 1560 

AAGCCAGCTA TTTTTTGCCA AGG TACT AAA TGACAAACGG GTGGAGTTTT ACTCTCTCCT 1620 

TATTGGGACG ATTTTGACCC TGTCTACGGC TATCTTGTTA TTTGATTCCA TGAATCTTCT 1680 

CTATTTTGAG CAGTTCAGAC GGGAACTTAT GATTAAACGT CTTGCTGGTA TGACAATCTA 1740 

TGAGCTTCAT GGCAAGTATT TACTGGCGCA AGGAGGAGTT CTCTTGCTTG GCCTAGTCCT 1800 

ATCTAGTATT TTGACAAGAG ATGGTTTGAT TAGCGCTCTA GTTGTAGCTT TGTTTACGCT 1860 
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TAACGCCCTC 


TTGATTTTAG 


TAAGGCAGGA 


CAAAAAAGAA 


GAAGCTGGTA 


GCATGGCAGT 


1920 


ATTGAAAGGA 


AAATAAGATG 


ATTGATATTC 


AAGGATTGGA 


AAAGAAATTT 


AATGACCGCG 


1980 


CGATTTTCTC 


TGGTTTGAAT 


CTCAAGCTGG 


AGAAGGGCAA 


GGTTTATGCC 


TTAATCGGAA 


2040 


AGAGTGGAAG 


CGGAAAGACG 


ACGCTGCTGA 


ATATCTTGGG 


AAAGCTAGAA 


AAGATAflATfi 


2100 


GTGGAAGGGT 


TCTCTATCAG 


GGGAAAGATT 


TAAAAACCAT 




(2 A (TP A TTTTP 


3i fin 

£. J.DU 


GAGACCAGAT 


GGGCTATCTC 


TTTCAAAATT 


TCGGCCTCTT 


AGAAAAPPAA 






AAAATTTGGA 


TTTGGGTTTT 


GTTGGTCAGA 


AAATCTCAAA 


AnTAfiAArVTP 


1 1\juAAA\jvj*_ 




AAGTGGGGGC 


TTTAGAAAAA 


GTTAATCTAG 


GGTATTTGGA 


TTTAflA APAA 
111 nuAAwAA 


nnnAlL 1 A I A 




CTTTATCTGG 


GGGAGAGGCC 


CAACGAGTTG 


CCCTTGPTAA 




a a & a n tt 1 /^ c 


2400 


CCTTGATTTT 


GGCAGATGAA 


CCAACAGCAG 


CPCTTflATrT' 


TttAAAATTVA 


oALAxAlAal 1A 




TGAATCTCTT 


GGTGGATTTG 


AAAGATGAAA 


ATCGAATTAT 




AL\-l_A 1 AA 1 \. 


2520 


CCCTAGTCTG 


GAATAAGGCT 


GATGAAATCA 


TTGATATGAG 

X X x r\ X \JAw 


f3 A A A PTTRfT 


r , aTV , 'iv"*"Tv ,, a a 

4-A I v» 1 vj 1 uAA 


OCQA 


AAAATCCGTA 


TTCGCAGGGT 


ATCTGATTAT 


CCTAGTGCCA 


GAGGTGGTTT 


a a a f2 a t a tv 




CTCATCATGG 


AAAATATGAC 


CAATCATCTC 


CTTTTGGTTC 


AAATCCGAGT 


GCATGGCTAT 


Z /UU 


TTGCTTGATT 


TTGCTAGTAT 


TGAAGGGCAA 


AGGCAAAAGC 


ATTATCGTTT 


GAAAAATTTA 


I /Oil 


CCTCAGACGG 


TTGAACTGAC 


AGTGGATGAT 


GTGGAGGAGG 


ATGTGGATTT 


GACCCTACCT 


*5 Q O A 


GAAAATCGAA 


GTTATCAAGA 


AGCTGATTTT 


TTTGAACGCA 


TGTTTCGAGA 


GAACTGCTAA 


*j q on 


GGCCACTTTT 


AAAGATTTCC 


AAGACTATCT 


TTCTTCATGA 


GGAAAGATAG 


TTTTTTGGTA 




TGATTTTCAT 


TCCCAAAATA 


CAAGGGGAAT 


GTGTTACAAT 


AGTAGTAACA 


GATAATAGAA 


JUUU 


AAGAGAATAG ATGAGAATTG CAGATTATAG CGTGACCAAG GCAGTGCTGG 


AGCGTCACGG 


j u DU 


TTTTACCTTT AAAAAGTCCT TTGGGCAAAA TTTTTTGACG GATACCAATA TCCTTCAAAA 




AATTGTGGAT 


ACGGCTGAAA TTGATGATCA 


GGTCAATGTC 


ATCGAAATCG 


GGCCAGGTAT 


3180 


TGGTGCCTTG 


ACAGAATTTT 


TGGCTGAGCG 


TGCAGCCCAA 


GTCATGGCTT 


TTGAGATTGA 


3240 


CCACCGTTTG 


GTGCCAATTT 


TGGCAGATAC 


CCTGCGTGAT 


TTTGATAATG 


TGACCGTAGT 


3300 


TAACGAAGAT 


ATTCTCAAGG 


TTGATTTGGC 


GCAACATATC 


CAGAATTTTA 


AAAATCCTGA 


3360 


CCTGCCAATC 


AAGGTAGTGG 


CTAATTTGCC 


TTACTACATC 


ACGACGCCTA 


TTCTCATGCA 


3420 


CTTGATTGAG 


AGTGGCATTC 


CTTTTTGTGA GTTTGTGGTC 


ATGATGCAGA 


AAGAAGTAGC 


3480 


GGACCGCATT 


TCAGCCCAGC 


CTAACACCAA 


GGCTTACGGT 


AGCTTGTCTA 


TCGCCGTGCA 


3540 


GTATTACATG 


ACAGCCAAGG 


TTGCCTTTAT 


CGTGCCTCGT 


ACGGTCTTTG 


TGCCAGCGCC 


3600 


AAATGTGGAT 


TCAGCCATCT 


TGAAAATGGT 


GCGTCGTCCA 


GAGCCAGCCG 


TAGCAGTAGA 


3660 
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AGATGAGAAC TTTTTCTTTA AGGTTTCCAA GGCTAGTTTT ACCCATCGCC GCAAGACCTT 3720 

GTGGAATAAC TTGACAGGTT ACTTTGGTAA GACTGAAGAG GTCAAGGACA AGCTGACCAA 3780 

GGCTTTGGAC CAGGCAGGCT TGTCACCAAG TGTGCGTGGG GAAGCTCTCA GCTTGGCAGA 3840 

ATTTGCCGGT CTAGCAGACG CACTTAAAGG GCAAGGACTC TAAGATGCAG GGACAAATCA 3900* 

TTAAAGCCTT GGCAGGTTTC TACTATGTGG AGAGTGATGG CCAGGTTTAT CAAACACGCG 3960 

. GGCGTGGGAA TTTCCGTAAA AAAGGCCATA CCCCTTATGT TGGGGACTGG GTAGATTTCT 4020 

CTGCCGAGGA AAATTCAGAA GGCTATATCC TCAAAATTCA CGAACGGAAA AACAGTCTGG 4080 

TTCGTCCGCC TATTGTCAAT ATCGATCAAG CTGTAGTAAT CATGTCCGTC AAGGAACCTG 4140 

ATTTTAACAG CAATTTGCTG GATCGTTTCT TGGTTCTTTT GGAGCACAAG GGCATCCATC 4200 

CCATTGTCTA TATTTCCAAA ATGGATTTGT TGGAAGATAG GGGAGAACTG GATTTTTACC 4260 

AGCAGACCTA TGGTGACATC GGCTATGACT TTGTGACCAG TAAAGAGGAA CTCCTGTCTT 4320 

TGTTAACAGG CAAGGTTACG GTCTTTATGG GGCAGACAGG TGTTGGGAAG TCAACTCTTC 4380 

TCAATAAAAT CGCACCAGAC CTCAATCTTG AAACGGGAGA AATTTCAGAC AGTCTAGGTC 4440 

GCGGTCGCCA TACCACTCGA GCTGTTAGTT TTTACAATCT CAACGGGGGT AAAATCGCAG 4500 

ATACACCAGG ATTTTCATCC TTGGACTATG AAGTATCAAG GGCTGAAGAC CTCAATCAGG 4560 

CTTTCCCAGA GATTGCTACT GTTAGCCGAG ATTGTAAGTT CCGTACTTGT ACCCATACCC 4620 

ATGAGCCGTC TTGTGCCGTC AAACCAGCTG TTGAAGAGGG TGTTATTGCA ACCTTCCGTT 4 680 

TTGACAATTA CCTGCAATTC CTTAGTGAAA TTGAAAATCG TAGAGAAACC TATAAAAAAG 4740 

TCAGCAAAAA AATTCCAAAA TAAGGAGAAA CCTATGTCTC AATACAAGAT TGCTCCGTCA 4800 

ATTCTGGCAG CAGATTATGC CAACTTTGAA CGTGAAATCA AACGTCTAGA AGCAACTGGG 4 860 

GCAGAATATG CCCATATCGA TATCATGGAC AGTCATTTTG TACCGCAAAT CAGTTTTGGT 4 920 

GCAGGTGTGG TCGAGAGCCT TCGTCCTCAT AGTAAGATGG TTTTCGATTG CCACTTGATG 4980 

GTGTCAAACC CTGAGCATCA TCTGGAAGAT TTTGCGCGTG CAGGTGCAGA CATCATCAGT 5040 

ATCCATGTAG AAGCAACGCC TCATATTCAT GGCGCCCTCC AAAAAATTCG TTCACTCGGA 5100 

GTTAAGCCTT CAGTCGTTAT CAATCCTGGC ACATCAGTTG AAGCCATCAA GCACGTCCTT 5160 

CATCTAGTTG ACCAAGTTTT AGTCATGACG GTTAATCCAG GTTTTGGTGG GCAAGCCTTT 5220 

CTGCCAGAAA CCATGGATAA GGTCCGTGAG TTGGTTGCTC TTCGTGAGGA AAAAGGTTTG 5280 

AACTTTGAAA TCGAAGTGGA TGGTGGGATT GATGACCAAA CTATTGCTCA AGCCAAAGAA 5340 

GCCGGTGCGA CTGTTTTTGT AGCAGGTTCC TATGTCTTTA AGGGAGAAGT CAATGAGCGA 5400 
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GTACAAACTC TCAGAAAACA 


ACTGGACTAG 


GGTTGCAGTT 


TTTGCAGGCG 


GAAACPfiCf^ 

wflnnw V* Vj V* wVJ 


5460 


TCATTATCGG ACAGATTTTG 


ATGCTTTTGT 


TGGGGTGGAT 


CGAGGCTCGC 


1 V_ 1 VJvJ\J i L 1 


5520 


GGAAGAAGAC TTACCTCTTG 


CTCTAGCAGT 


CGGAGATTTT 


GATTCTGTGA 


CCICI A A f! A A CI A 


jjOU 


GCGACAGGTG ATTCAAAAAG 


GTGCCCAGTA 


TTTTGTCCAA 


GCACGACCAG 






TACAGATCTG GAATTGGCTC 


TCTTAACCAT 


CTTTGAACAA 


AATCCTCAGG 


r\- AVju 1 v. At. 


D /UU 


TATTTTCGGT GCCTTGGGTG 


GCCGTATTGA 


CCATATGTTG 


GCCAATGTCT 


TTCTGCCTAG 


5760 


CAATCCTAAG TTGGCACCCT 


ATATGCATCA 


AATAGAAATT 


GAGGATGGGC 


AAAACTTGAT 


5820 


TACTTATTGT CCAGAAGGAA 


TCAGTCAGCT 


AGAACCTCGT 


TCAGACTACG 


AC 1 A 1 C I AbC 


5880 


CTTTATGCCA GTTCGGGATA GCCAGCTGAC TATTCTTGGA GCCAAGTATG 


AGTTGACAGA 


5940 


GGAAAATTTT TTCTTTAAAA 


AAGTGTACGC 


TTCTAACGAA 


TATATAGATA 


GGGAAGTGTC 


6000 


GGTAACTTGC CCAGATGGTT 


ATGTGGTCGT 


ACTGCATAGC 


AAGGACAGGA 


GGTAGGATGG 


6060 


AAAGTTTACT TATTCTATTA 


TTAATTGCCA ATCTAGCTGG 


TCTCTTTCTG 


A 1 rTGGCAAA 


6120 


GGCAGGATAG GCAGGAGAAA 


CACTTAAGTA 


AGAGCTTGGA 


GGATCAGGCA 


uni LAI 1 ILii 


6180 


CAGACCAGTT GGATTACCGC 


TTTGACCAAG 


CCAGACAAGC 


f~* a nt~ s r' a /t^t a. 
LnuV.V.Aul In 




6240 


ATTTGGAAGT GGTTGTCAGC 


GACCGTTTGC 


AAGAAGTGCG 


GATTGAATTG 


L.AUCAAGGTC 


6300 


TGACCCAAGT CCGTCAAGAA 


ATGACAGATA 


ATCTCCTCCA 


AACTAGAGAC 


AAGACAGACC 


6360 


AACGTCTCCA AGCCTTGCAG 


GAATCAAATG 


AGCAACGTTT 


GGAACAAATG 


LuL I. Aly A(_ Lrtj 


6420 


TCGAGGAAAA ACTAGAAAAG 


ACCTTGCAGA 


CACGCTTACA 


GGCTTCCTTT 


r! r r* r r> r riwnm 


6480 


CTAAACAACT GGAGTCTGTC 


AATCGTGGCC 


TTGGAGAAAT 


GCAGACAGTT 


VjLLUjlliAlXj 


6540 


TCGGAGCTCT TAACAAGGTT 


CTCTCTGGAA 


CCAAGACGCG 


AGGGATTCTG 


wnuAAl 1 \jL_ 


6600 


AACTGGGGCA AATTATTGAA 


GACATCATGA 


CACCTGCCCA 


GTACGAACGA 


vxAA 1 ALoLAA 


DOOU 


CGGTTGAAAA CTCTAGTGAA 


CGAGTGGAGT 


ATGCCATCAA 


GTTACCCGGA 


P A A fV2 Pfl A C C 


o/zy 


AAGAATACGT CTATCTGCCA 


ATTGACTCTA 


AGTTTCCACT 


GGCAGATTAT 


TACCGCTTCG 


6780 


AAGAAGCCTA TGAGACAGGT 


GACAAGGATG 


AGATTGAACG 


CTGTCGTAAG 


TCACTCCTAG 


6840 


CAAGCGTCAA GCGCTTTGCT 


AGGGATATTA 


GGAACAAGTA 


CATAGCACCA 


CCTCGGACGA 


6900 


CCAATTTTGG AGTTTTGTTT 


GTTCCGACAG 


AAGGTCTCTA 


CTCAGAAATC 


GTCCGCAATC 


6960 


CGGTCTTCTT TGATGATTTG 


AGACGGGAAG 


AACAGATTAT 


TGTTGCAGGA 


CCAAGTACCC 


7020 


TATCAGCCCT TCTTAACTCC 


CTATCAGTTG 


GTTTCAAGAC 


CCTTAATATC 


CAAAAGAGTG 


7080 


CCGACCATAT CAGCAAGACT 


CTTGCCAGTG 


TCAAGACCGA 


GTTTGGCAAG 


TTTGGTGGTA 


7140 


TTCTGGTCAA GGCACAAAAA 


CATCTCCAAC 


ATGCCTCTGG 


CAATATTGAT 


GAATTATTAA 


7200 
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ACCGTCGTAC CATAGCTATC GAGCGGACGC TCCGTCACAT TGAGTTGTCA GAAGGTGAGC 


7260 


CTGCGCTTGA 


TCTACTCCAT 


TTTCAAGAAA 


ATGAGGAAGA 


ATATGAAGAT 


TAGTCACATG 


7320 


AAAAAAGATG 


AGTTATTTGA AGGCTTTTAC 


CTAATCAAAT 


CAGCTGACCT 


GAGGCAAACT 


7380 


CGAGCTGGGA 


AAAACTACCT 


AGCCTTTACC 


TTCCAAGATG 


ATAGTGGCGA 


GATTGATGGG 


7440 


AAGCTCTGGG 


ATGCCCAACC 


TCATAACATT 


GAGGCCTTTA 


CCGCAGGTAA 


GGTTGTCCAC 


7500 


ATGAAAGGAC 


GCCGAGAAGT 


TTATAACAAT 


ACCCCTCAAG 


TCAATCAAAT 


TACTCTCCGC 


7560 


CTGCCTCAAG 


CTGGTGAACC 


CAATGACCCA 


GCTGATTTCA 


AGGTCAAGTC 


ACCAGTTGAT 


7620 


GTCAAGGAAA 


TTCGTGACTA 


CATGTCGCAA 


ATGATTTTCA 


AAATTGAAAA 


TCCTGTCTGG 


7680 


CAACGGATTG 


TCCGAAATCT 


CTACACCAAG 


TATGATAAGG 


AATTCTACTC 


CTATCCAGCT 


7740 


GCCAAGACCA 


ACCACCATGC 


CTTTGAAACG 


GGCTTGGCCT 


ATCATACGGC 


GACCATGGTG 


7800 


CGTTTGGCAG 


ACGCTATTAG 


CGAAGTTTAT 


CCTCAGCTCA 


ATAAGAGCCT 


GCTCTATGCG 


7860 


GGGATTATGT 


TGCATGACTT 


AGCTAAGGTC 


ATCGAGTTGA 


CGGGGCCAGA 


CCAGACAGAG 


7920 


TACACAGTGC 


GAGGTAATCT 


TCTTGGACAT 


ATCGCTCTCA 


TTGATAGCGA 


AATTACCAAG 


7980 


ACAGTTATGG 


AACTCGGCAT 


CGATGATACC 


AAGGAAGAAG 


TCGTTTTGCT 


TCGTCATGTC 


8040 


ATCCTCAGTC 


ACCACGGCTT 


GCTTGAGTAT 


GGAAGCCCAG 


TCCGTCCACG 


CATTATGGAA 


8100 


GCAGAGATTA 


TCCATATGAT 


TGACAATCTG 


GATGCAAGCA 


TGATGATGAT 


GTCAACAGCT 


8160 


CTTGCTTTGG 


TGGATAAAGG 


AGAGATGACC 


AATAAAATCT 


TCGCTATGGA 


TAATCGTTCC 


8220 


TTCTATAAAC 


CAGATTTAGA 


TTAATAATTT 


AAGAAAAATG 


AGCATTTTTT 


AGGATAAGAA 


8280 


TGTTCGTTTT 


TTTATGTGAA 


TATGGTATAA 


TAAGTAAAAG 


ACAAAAATGA 


ATACTCTTCG 


8340 


AAAATCTCTT 


CAAACTAGGG 


TAGTATCGCC 


TTGTCGTATG 


TATATATGCA 


GGTATATTAC 


8400 


AGGGTTTGTC 


AGTTCTATTG 


ACAATCTCAA 


AACAGTGTTT 


TGAACCACCA 


GCGACCAGCT 


8460 


TTCTAGTTTG 


CTTTTTGATT 


TTTTGAATAA 


AAATGGAATA 


GGAAATAGAA 


ATGAAATTAA 


8520 


GAAGAAGTGA 


TCGGATGGTT 


GTCATTTCCA 


ACTATTTGAT 


TAATAATCCT 


TATAAACTAA 


8580 


CTAGTCTCAA TACTTTTGCT GAAAAGTATG AGTCTGCTAA ATCATCCATC TCAGAAGATA 


8640 


TCGTCATTAT 


CAAACGCGCC 


TTTGAGGAAA 


TTGAAATCGG 


TCATATCCAG 


ACAGTGACTG 


8700 


GGGCTGGCGG 


AGGTGTCATC 


TTCACACCGT 


CTATTTCGAG 


TCAGGATGCT 


AAGGAAATGG 


8760 


TTGAAGACTT 


GCGTACCAAG 


TTGTCAGAAA 


GTGACCGTAT 


CTTGCCAGGT 


GGTTATATCT 


8820 


ATCTGTCTGA 


TTTGCTTAGC 


ACACCAGCCA 


TCTTGAAAAA 


TATTGGTCGT 


ATTATTGCCA 


8880 


AAAGCTTTAT 


GGACCAAAAA 


ATTGACGCGG 


TTATGACCGT 


AGCAACTAAG 


GGTGTGCCAC 


8940 
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TTGCAAATGC AGTTGCCAAT GTCCTCAATG TCTCTTTTGT CATTGTGCGC CGTGACCTGA 9000 

AAATTACCGA AGGTTCAACT GTTAGCGTCA ACTATGTTTC AGGTTCAAGT GGTGACCGTA 9060 

TCGAGAAAAT GTTCCTTTCA AAACGTAGTC TTAAGGCAGG CAGCCGTGTC TTGATTGTGG 9120 

ATGACTTCTT GAAAGGTGGC GGAACGGTCA ATGGTATGAT TAGTCTCTTG CGCGAGTTCG 9180 

ACTCAGAACT GGCAGGTGTA GCGGTCTTTG CGGACAATGC CCAAGAAGAA CGTGAAAAGC 9240 

AGTTTGACTA CAAGTCACTC TTGAAGGTAA CCAATATTGA TGTCAAGAAC CAAGCCATCG 9300 

ATGTTGAGGT TGGCAATATC TTTGACGAAG ATAAATAAGA GATAGAACTA AAGGTTGGAA 9360 

CGATTGTCCC AGCCTTTCTT TGCAAACAGA ATAGAAGGAA GCTTATGAAA ACACCATTTA 9420 

TCAATAGAGA AGAGTTAGAA GCGATTGTTG CCGAGTTCCC GACTCCCTTT CACTTGTATG 9480 

ATGAGAAGGG GATTCGTGAG AAGGCAAGAG CCGTCAACCA AGCTTTTTCG TGGAACAAGG 9540 

GCTTTAAGGA ATATTTTGCA GTTAAGGCTA CTCCAACTCC AGCTATTTTG AAAATTCTCC 9600 

AAGAAGAAGG TTGTGGTGTG GACTGCTCTA GTTATGTAGA GCTTTTGATG AGCCATAAAC 9660 

TGGACTTTCT GGGTTCTGAG ATTATGTTCT CTTCCAACAA CACGCCAGAC AAGGAATACG 9720 

CCTATGCACG TGAATTGGGT GCGACCATTA ACTTGGATGC CTTTGAAGAT ATTGAACATC 9780 

TGGAGAGAGT AGCAGGCATT CCAGAAATCA TCTCTTGTCG TTATAATCCT GGAGGCGTTT 9840 

TTGAACTGGG GACAGACATT ATGGACAATC CTGGGGAGGC TAAGTTTGGC ATGACCAAGG 9900 

ACCAGCTCTT TGAAGCCTTT GCTATCTTGA AGGAAAAAGG AGCCAAGACT TTTGGGATTC 9960 

ACTCCTTCCT AGCGTCCAAT ACCGTGACCC ATCTCTATTA TCCAGAGTTG GCTCGTCAGC 10020 

TCTTTGAACT GGCTGTTGAA ATCAAGGAAA AGTTGGGCAT TTCGCTAGAC TTTATCAATC 10080 

TTTCTGGCGG TATTGGTGTT AATTATCATC CAGACCAGGA GCCGAACGAT ATCGCCTTGA 10140 

TTGGTGAGGG AGTTCGTAAG GTGTATGAAG AGGTTCTTAC GTCAGCAGGT CTTGGTCAGG 10200 

TCAAGATTTT CACCGAATTG GGTCGTTTTA TGCTGGCACC TCACGGTGCT CTAGTCACAA 10260 

GAGTCACTCA TAAGAAGGAA ACCTACCGTA CCTATCTAGG TGTGGATGCC TCAGCAGTCA 10320 

ACCTCATGCG TCCAGCTATG TACGGAGCTT ACCATCATAT TAGCAACGTG ACCCATCCAG 10380 

ATGGACCAGC TGAAGTGGTA GATGTGGTCG GTTCACTCTG TGAAAACAAT GATAAATTTG 10440 

CAGTTAATCG CGAACTGCCT CATACAGAAA TCGGTGATTT GCTGGTCATT CATGATACAG 10500 

GTGCCCACGG ATTTTCAATG GGCTACCAGT ATAATGCCAA ATTACGTTCT GCGGAAATCC 10560 

TCTATACCGA AGAAGGTAAA GCCCGTCAAA TCCGCCGTGC AGAGCGCCCT GAGGACTATT 10620 

TTGCAACCTT ATATGGCTTC GATTTTGAAG AATAATCTGA TAATAGATTG AAAATGAAAT 10680 

TGAAAAACAG ATTGCTTTCT AAAAAATAGG CAAAAATCTT GTTTTTCCTT CAAGTCGTGA 10740 
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TATAATAAAA CTATAAAACG TTTTCAAGGA AGGTAACGAT ATGTCTGAAG AAACAATTGA 10800 

TTATGGACAA GTGACAGGAA TGGTGCATTC GACAGAAAGC TTTGGGTCAG TAGATGGGCC 10860 

TGGTATTCGC TTTATTGTCT TTTTGCAGGG CTGTCACATG CGTTGCCAGT ATTGCCACAA 10920 

CCCAGACACT TGGGCTATGG AGTCCAATAA GTCACGTGAA CGGACGGTAG ATGATGTCTT 10980 

GACAGAGGCC TTGCGCTACC GTGGTTTCTG GGGAAATAAG GGTGGGATTA CAGTCAGTGG 11040 

AGGAGAAGCT CTCTTGCAGA TTGATTTCCT GATTGCTCTC TTCACCAAGG CTAAGGAACA 11100 

AGGAATCCAC TGTACCTTGG ACACCTGTGC TCTTCCTTTC CGTAATAAAC CACGTTACCT 11160 

TGAGAAGTTT GACAAACTCA TGGCTGTCAC TGACTTGGTT CTTTTGGATA TCAAGGAAAT 11220 

CAACGAAGAA CAGCACAAGA TTGTCACTAG CCAAACCAAT AAAAATATCT TGGCTTGTGC 11280 

CCAGTATCTA TCAGATATTG GAAAACCTGT CTGGATTCGC CACGTGCTAG TTCCAGGATT 11340 

GACAGACAGA GATGATGACT TGATTGAACT TGGTAAGTTC GTCAAGACCC TCAAAAATGT 11400 

TGATAAGTTT GAAATTCTAC CTTATCACAC CATGGGTGAG TTCAAGTGGC GTGAACTTGG 11460 

AATTCCATAT TCCCTCGAAG GAGTCAAACC ACCAACAGCA GATCGCGTCA AGAACGCTAA 11520 

ACAACTCATG GATACCGAAA GTTATCAAGA TTATATGAAA CGTGTACATG GATAGAAAAG 11580 

AAGCCTGATG GAAACATCGG GCTTTTGACT TGCAAAAAGA CTTAGCAAAT CAGCTAAGCC 11640 

TTTTTCTTCT TATCTCGAAC GTTGTTTTCC AGCGTTGCGA TTTTTGTGTT TTTTCTTGCT 11700 

TGTGATAGCA GTTGGTTGTT CAGGGGTAAC GTCTTTTCGT CCACTTGGTT TAGAGAAAGC 117 60 

ACTTGCTTTT GGTGGGTTCT TGGCTAGTTC TTCACGGACT TTTTTGCGAA GTTTTGGACG 11820 

AACGATATAG TTGACGATAA ACTGTTGGAG AATCATCATG AAACCACCGA CAACCCAGTA 11880 

AAGTGTGACA CTAGCTGGTG AGAAGAGGGA GAAGACGACG ATCATGAGTG GGCTCATGTA 11940 

AATCATTTTC TTGATTTGTT CTCTTTGCAT TTCATCTTCT ACTCCGTGAA GTGAAAGGAG 12000 

CGATTGAAGA TAGTAAAGGA CACCAGCACA GGCAACCAAA ATCATACTTG GAGAACCTAG 12060 

AGGAATGCCT AGGTAGCTTG CTTGAGCAAC CCCTTCAGTA TGTTGGGCAG CAAAGTAGAT 12120 

AGCAGAGAAG AAAGGCATTT GAAGGAGGAT AGGGAAACAT CCTACACCGC CAAACATGCT 12180 

GATACCGTGC TCTTTTTGAG CAGCAAAGAG AGCTTGTTGG GCTTCGAGTT TTTCTTCTTG 12240 

AGTAGTCGCT TCTTTGAGAC GCGTTTGGTG TGGCTCAAGG ACGTGCTTGA GGGCGTTCAT 12300 

CTTTTCAGAG TGAAGCGTTG CCTTCCATGA TTGGTAGATA CCAAGTGGTA AGATAATCAA 12360 

GCGTACGATA ATGGTTACGA TAATGATAGC GACACCAAAG CCTAGACCTT TATCAGTAGC 12420 

GAAGTACTTG ATGGCTTCAG CCATAGGCGC TCCGATCGTA TTCCAAATAA ATCCTGTTGG 12480 
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CTGACCTGTG GTTTTATCGA CATTGACACA GCCAGTCAAG ACAAGCAACA TAGCCACTCC 12540 



CATAGCCGAG AGTGCAAAAT CGGGGT 

(2) INFORMATION FOR SEQ ID NO: 150: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5238 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



12566 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 150: 

TGACACTCTG TAGGATTGTC GTTAATTGAT TGCTCGTACT CTCTACAATA ACCACCAAAG 60 

TAAAAACGAC ATAGAAAGAT AGCATCAGCT GTAGCCATAG CGCCTTTGAC ACCTTCTGGA 120 

TGATTATGAG TTACCTCTGC AGAAAGACTC GTAAGTCCTC TAGATGATGG CCATATACCA 180 

GTTTTCGCAT AAAAACCACA GTCCATGATC CAAGCACATG GAGAAATACG CATAGCTGAT 240 

CCATTCCCAA AGCTATTATA AGGCTCACGG TTATCGCTGT TTAGCCATGC ATTAAACCGA 300 

GCACCGTAAT CAGCATTCGG ATACATTCTG CCATATTTCT TCATCGCGTC AATGAAGTCA 360 

TCTTTTTGTC CACCATTCAT AATTGCTTCT GCAACAGCAC AGGTCATAAC CGTGTCATCT 420 

GTAAAAAAGC AGTCCTTCCG AAATAAAGGA AAGTCCTTTG TTTTGATATT GTTCCATTCG 480 

TAAACAGAAC CGACAATATC TCCAATAATT GCTCCAAGCA TCAGATTCCT CCTTGTTCAT 540 

TTTGATGCTT TTTATATTGG TTATCTACCA TATTTATTTT AGAAAATAAC ATCCTGTTGG 600 

ATTTTAAAAA TTTCATTTTT TTCAAAATAG GGTTTTACCA TTTCTTTCCA CCTAGCTCTA 660 

TGAAAATTGA TTGATTTTAA AGGAGATAGG CCATAATTTC CCAATGCATA ACCATCATTT 720 

ACTTCAACAA CAAGTGTTCT GCCATCGCGA GTAACACCGA TATCTAGTCC ATAAGCTATT 780 

GGCGCATCTT TCCAACATGA TATCGCTTCA TCAATTACAC TTGCATCAAA TTGTGCATGA 840 

TAATCACCTG TATAGGGTCG AACATCTAAT ACGCGACCAT CTAACACAAA ACAACGCCAT 900 

TCAGCTATGA ATTCTACAAC CTCACTAATC CATATAGGAT AGTCGAAAGG TAGACCAATA 960 

CCTATTAAAT CATGGGTTCC ATTAACAACT CTTCCAGTAA AGACTTTTGA ACCAGCTTTA 1020 

GGCTTAATAA ATTTTCCCCA ATTATCAGGT ATATTCACAA TCTCTCCTAA AATACCAGCA 1080 

TAAATCTTTC GACCATAAAA CTCTTTAAGC TCAATAGGAT AGTCATGAAC CGGAACGTTT 1140 

AAGCCCATCA TTTTTAGTAA TGCTCTAGTC TCCATTATAT AATCTACAAC TATATCTTCA 1200 

CTTGTTAACT CTTTTATTTC AGAAAAAGAT TGATATAAAA TAACTTCTTC TCCTTGTAAG 1260 

TAGGCACCTA CTTGAGCATT GTATTTATTA ATTGAAACCT CACTTGGTAA TTTACTTTGT 1320 
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CTAATATAAA 


CAACCATTTC 


ATCACTCCTA 


TATCACTAGT 


GTTACACCAA 


TTTGTAAAAA 


1380 


ATAATAGCAA 


TTTTGCTCTT 


ATTTTTTTGA 


GTAAATAGCC 


CCCATAATAT 


CATCGAAATA 


1440 


ATCAACGGTA 


TTTAGGAGTA 


ATTCAATAAC 


CTGGGACTTT 


GTTAGTCGCA 


TTCCCCTTCT 


1500 


ATCTCTAGCA 


TCTTCTACTA 


AATTTTCAAG 


TTTCTCTAGA 


TTTTTATCAT 


CCAAGCTAAT 


1560 


CATTATTCTA 


TTTTTATCGG 


TTGCCATTTT 


CATCACCTCA 


AGTTAATTCT 


ATCACAGGTG 


1620 


TAACACTAGT 


GTCAACTGGC 


TTTTATAATA 


CATTAGTTTA 


AAAGTGGAGA 


GGATTTTTAA 


1680 


CACAGTAACT 


TTAAATCTTT 


GGTATTAAAA 


AATTTTCACA 


ATATTTATAG 


AAATAAAATC 


1740 


TGTCTCAAAT 


CAGTTATCAA 


ATCTAGTATA 


AATTATGAGC 


GGCTACTCTA 


ATACTTTCCC 


1800 


TCTAAACAAG 


AAAAAGACTT 


ACACTCAAGG 


GTTTTCTTCC 


CCCCCTTCGT 


TATAACGTTT 


1860 


TGACTCTTTT 


ACTAGCAAAG 


GTATATACTC 


ACAAGGAACT 


TTGGTTGACT 


ATTGAATCTC 


1920 


TCCAACTTCT 


TCTTTAACAT 


ATCCTTCTAC 


ATCTTCAATC 


TCTACAAACA 


TTGGGTCTAA 


1980 


GTGACACAAG 


AAATGCCAAA 


CTTCGATCCC 


TTTTTTTCTG 


TAAAGAATCG 


CTTCACCGTC 


2040 


TTCACTTCCG 


AAAAAGCTTC 


TGTCGATTTC 


ATATCCGCGG 


CTTTCTAAGA 


AGTCTTTTGC 


2100 


TTTACGATAG 


TTCGTTTCTC 


TTGTTTCGAC 


ATAGGCTTTA 


ACTTCATGGT 


TGTTAACGAC 


2160 


ATATGCATCA 


ATTTTTGAAT 


ATCCTTCGAT 


CACTCTATCA 


TTTTTGAGGG 


ATAAATTTGA 


2220 


AATCTCTTTC 


CAAATAATGT 


TTACATTTTC 


CTCAGGATCG 


AACATAAATT 


TAGATAAAGG 


2280 


AACAATATTT 


CCGTTAAAAA 


TAATTTCCAT 


ATAATCCGGT 


ATGTTTTTAG 


GATTAAAATA 


2340 


CTCCACTTCA 


AAACCATCTT 


CTGTTTCCAG 


AGTGTATCCC 


GGGATTTGAG 


CTACAAAGGC 


2400 


TTTCCCATCT 


TCTATGGAAT 


CAAATGCTAC 


TAAATCTTTA 


GAATAATCAT 


TTTGGTACAA 


2460 


TTCCAATATA 


ACCATCGATA 


ATCTCTCCAT 


TTTCATTATC 


AGGCTAATGT 


AAATAAGCAC 


2520 


GTCACCTGAC 


CAATTCAGGC 


TCTCTGTATC 


ATCTCATCAT 


ATTTCCTACT 


TACTTTACGA 


2580 


GTCTTATACC 


CAGAACACAC 


CTTATCGACC 


TTCGGTCTCA 


CCTCGTCGCA 


TTGGCTGAAC 


2640 


ATCTACTTTT 


ACTTTGCTGA 


TGCTTCAACT 


CGTACAAGCA 


GTGATACCGC 


CTCAGCGTGA 


2700 


TGCGTCAGTG 


GGACTCAAAA 


GGTTCGGGGA 


ACCTTTTGAG 


GATTAACTAC 


GTTTCTCTAA 


2760 


TAAACTTACA 


CATTCAACTT 


GTTCATCATT 


GTCCAAACCT 


ATGTTGAGAT 


TTTCTTCTAT 


2820 


AATTGGTAGC 


TTAAAAGTAA 


TGGATTTTAG 


CCATTGTCCG 


TTAGATTGTT 


TTTCTTCATA 


2880 


AACTTGAATT 


TCAGAAATCA 


AAGCTGAAAT 


TAACTGCCTA 


CGCTCTACAT 


CATTCATGAC 


2940 


TTTATAGAGC 


TTATCAAAAT 


AGATCAGAAC 


CTTATATATG 


TTATCTCCTG 


TAAGCTTTTC 


3000 


AGCTTCAATA 


GTCTGTTTCT 


TTGCTTTCGC 


ATCAATTAGT 


GATGATTCTA 


ATTCATCTAG 


3060 
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TTTGTCATAC ATACGATATA GTCTATCATC TAAATCCTGT TTCCTTCTCT TATAATGCTT 3120 

ATCTTCAACA TCTAAATTAT CTATTTCCTC AATTAGCTTA AACTTTGTAG AATGACTCTT 3180 

TCTCAATTCC TTTTGGTAAT TATCTATTTC TTTTTCTATT TCAGAGGTAT CCACCTTCAT 3240 

GTTGATTTTT TCTTGCATCA TAGAAGCAAA TTTCGGATTA CTTACTATCT TGACAATCAC 3300 

CTCTGCAACA GCATCATCTA ACAATTCTTC TCTAATTTGC TTACTGAATG TACACTTATT 3360 

ACCTCTTATC ATCTGCCTAT GGTTACAACC ATAGTAATAA AAATCTTTAT ACTTTGTGCC 3420 

ATCTTTCTTT TTCTTGATAC ACTTGTTCCC AAACATTCCC ACTCCACATA TCGGGCATTT 3480 

TACAATTCCA GAAAGCAAGT GTGTGCGTGT ATCTTTTCCT TTATTCACAT GCTCATATTT 3540 

CTTTGCTTGA GATTTTAGCT TAACCTGAGC AGCTTGCCAA ACTTCATCGG AAACTATAGC 3600 

TTCATGTATC CCTTCAGATA TTAGATATTC ATCTTGTTCA ACCTGCTTAT ATTCATTTCT 3660 

TGTACCATGA ACTTTTTCTA AAGTTCTTCT TCCAAATGCT ATTTTCCCAT TATATACAGG 3720 

ATTCTTTAAT ATCTTTCTTA TAAGACCTGC ATCAAACAAA GGATTCTTAC CATTCTGTCT 3780 

TGGGATTTTT CTAATTCCAT GATTCTCTAA GTATTTAGAT ATCCCATTGG CTCCTATCGT 3840 

AGTATTTACA TACTGGTCGA AAATCGTTCT TATTGCAACT GCCTCTTCCT CATTTATAAA 3900 

CAGCTTGCCG TCTTCAAGTT TATATCCATA CGGAGCAAAG CCACCATTCC ATTTTCCTTC 3960 

CCCTGCTTTT TGAATGCGAC CTTCCATTGT TTGAATACTG ATGTTTTCTC TTTCTATTTC 4020 

AGCCACAGCT GATAAAACAG AAATCATTAG TTTCCCAGCA TCTTTAGATG AATCAATGCC 4080 

ATCTTCAACG CAGATAAGAT TAACTCCATA ATCCTGCATT ATATGAAGTG TAGAAAGAAC 4140 

ATCAGCGGCA TTTCTTGCAA ATCTTGATAA CTTAAACACA AGAACAAAAG ATACTCCATC 4200 

TTTTCCAGAT TTTATATCTT CCATCATTCG ATTGAACTGT ATTCTACCTT CAATAGACTT 4260 

GTCAGACTTC CCGGCATCTT CATACTCTCC AACAATTTCA TAATCGTTGT AAATAGCAAA 4320 

AGCTTTCATT CGTGATTTTT GTGCCTCTAA CGAATACCCC TCTATCTGTA TTGACGTAGA 4380 

TACTCGTGTA TAGAGGTATA CTTTTATTTT TTCTTTTGAC ATAGTATTAA CCTCAATATA 4440 

ATTTTTCTAT ATCATATATA ATTTTTTTAA TTTAAGTTTG GACTATCATT TCAAGTATAT 4500 

TATAACACTT TTATTAGTCC GTCTCAATTT GTGTTTTTGC CATGTCAAAA CTATTTTTCA 4560 

TCTCTTGATT TTTTGCTGGC GTTGGATCGG GTAGATTATC TAAATCTAAA GCACCAGCAT 4620 

ATTTTGCAAT CAGATTTGCT ATTAAATCAG CCAATCCATT CCAGTCATTG TCCAATATAT 4680 

ACCTCCTCTA AAGTTTTATA TCTAATAATT ATTTGTTTAA TTAAGTTTTT TGACATTGAC 4740 

AAGTGCTTTG GATTAGCAAC ATAGGAATCT CACTTCCGCC TCTATTCCGG ATGAGCCGGC 4800 

TTCAACCTTA GAAGTATCAT TACCCTCATT TTCTTCATAG CGGATAGGGT ATCCCTCCCT 4860 
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ATATTCAAAC TCTTACTTAT CGCTCACTTT CTTTTTGCTT AGCAGAACTT TTTTTGCCGA 4920 

ATTATTCAGC CGAAAGATCT TGACGGATAG GTTATTACGC TCCAAAAATA ATTAACGTCT 4980 

TGTCTTGGTC TATTCAATTG TTAAGGTTCA AAATTTATCG AGAGTTATTA ATCTTTTTAA 5040 

AATTTGACCA TCAGAAAATA TTTATCTTGA TGTAACAAAA TTCTATAAAT TACCCTCTTA 5100 

TACTTAACAG TGAAAAGAAG TCTTTCTTGG TAACCAATTT TGAAATAGAA TTTGCTTATA 5160 

* TAAAAAGGTC CAATTCCCAC TGCATAAATA GCAGTGAAAA TTAGACCCTC TTGGTAACTG 5220 

TCATCTAAAA GTCTTCTA 5238 



(2) INFORMATION FOR SEQ ID NO: 151: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13425 base pairs 
<B) TYPE: nucleic acid 
<C) STRANDEDNESS : double 
<D) TOPOLOGY : linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 151: 

GACGATTTAC GAAGAATCGA ACAAGAACCT GCTCCTATCA ATTCCCAACC TCTATCTCTA 60 

AAATCTTGCA GTTCATGCTT ATACTTTTTT AAGAAATCTA GAATCATAGA TACGGTAGAT 120 

GACATCGTCT GGTTGACATT GGTCAAAATA GAACAAACCA AAACGACTCG TTCTATACCT 180 

CCAACCTTTC AAATGCATCT CATGTAAATG TTCTTCTTCC TTGTCCAAAT CAACAATGGT 240 

GAAAATCCGA AATTCTACTC TGCTATTCAT TGTCTTACCC CAAAATTAGA AAACATGCCT 300 

GGCGTTATTT ATTAGATAAT TCTTTCCACT TTTGACTCAA TCTCCAAAAA ATATAAGAAA 3 60 

TCTGAATCGC AAAAACTATC AATAAAACCC AATCTATTAT GAAAATCAAA AACACTTTCC 420 

AACTGAAAGA ACTACCTCCA GTGACAAACT TTGAGAAAAA CGGTAGTAGA GCTAAAAAGA 480 

GAAATAAAAT AGGAAGCATC CGCATTGTTA AAATCCGTTT GGCATAAAAA AATCTTTATT 540 

TAAACGAAAA TATTATGGCA AAATTTACGC CAGTTTTTGA ACGGCTGATG TAGATATTTT 600 

ATACTTTCAA AATGTTTAAA TGTGATTATT TATTTTTGAA AAATAGATCA CCAGCCCGAC 660 

TGAAAGTGCT TATAGAATGA TAATAAGTCG CCTGCCGAAA ACAGCGAAAA ATAGCGGTGT 720 

TATGCGGAGA TAATCTGACG CGATGCGAAA GTATATTGCA TACTTATTTT CAACAATTTA 780 

GCAGAGTATT TTTATAAGTG TGATATAATA GAAGTATAAT TTGTTCTGAT AGTTTATTTT 840 

ATGGAGAAGT AGATTTTTAG AATGCGGAGG GTTCAATATG GTTGAGTTTA TAAAGTCTAA 900 

GAAAGAAATG AGTGAGGAGG ATATTAAAGC AAATTTCATC ACTCCTGCTA TTGTATCCAA 960 
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AGGATGGAAA AATGGTGAGC ATATCGCTTA CGAAGAATAC TTCACTGATG GTCGAATTGA 1020 

AGTTAGAGGA GATAAGGCTC GTCGTAAAGA AGGAAAAAAA TCAGACTATT CACTGTATTA 1080 

CCAATTTGGA ACTCGAATTG CAATTGTTGA GGCAAAGGAT AATAAACACA GCGTTCGAGC 1140 

AGGATTACAA CAAGCTATTG AATATGGAGA GATTTTAGAT GTTCCATTTG TTTATTCTTC 1200 

GAATGGTGAT GGCTTTATTG AACACGACCG TATCACGAGA GAAGAACGTG AGCTGGAGTT 1260 

AGACGAATTC CCTACTCGTG AAGAATTATT TTCTCGTATG ACGAAGGAAA AAGGATTGAC 1320 

GTACGAAATT ACAGAAGCTA TCTCAACTCC ATACTATACA GACGCCTTCT CAATGAAAAC 1380 

GCCACGCTAT TATCAGCAAA TAGCTATCAA CCGTACTATT GAAACAGTTG CCAGAGGACA 1440 

AAAACGAGTA ATGTTTGTGA TGGCAACAGG AACGGGGAAA ACGTTCATGG CTTTTCAAAT 1500 

TATTCATCGC CTTCGAAAAG CTGGTTTGGC TAAACGAGTT TTATTCTTAG CAGATAGAAA 1560 

CATCTTAGTA GACCAAACGA TGGCTGAAGA CTTTAGGCCA TTCGAAAAGG TAATGACGAA 1620 

AATTACACCA AAACTTTTGA CTGCTCCTGA AAAATTAAAT TCTTTTGAAA TTTATCTAGG 1680 

GCTTTATCAG CAACTAACTG GTGAAGATGG AACTGAAACA CATTATCAAA AATTTGACAA 174 0 

AGACTTCTTT GATTTAATCG TAATTGATGA AGCGCACCGT GGTTCAGCTA AGGAAAACAG 1800 

TAACTGGCGT AAGGTAATTG ATTATTTCAG TTCTGCGACA CAGATTGGGA TGACCGCTAC 1860 

TCTTAAAGAA ACCAAGAATG CTTCCAATAC GGAATACTTT GGTGAGCCAA TCTATACTTA 1920 

TAGTTTAAAA CAGGGAATCG AGGATGGTTT TTTGGCTCCA TATCGTGTTA TGAGGGTTAA 1980 

TTTAGATGTG GATGTGGATG GTTATCGTCC AGAAACTGGA AAAGTTGATG CTAACGGACA 2040 

ATTAATAGAA GATAGGTACT ACGGCAGGAA AGATTTTGAT AAAACCATTG TCATTGATGA 2100 

TAGAACGCAA AGAGTTGCCA AGTTTGTTTC TGATTATATG AAGCAAAACA ATGCACGATT 2160 

TGATAAAACA ATTGTTTTTT GTGTTGATAT TGACCATGCC GAGCGAATGC GTGCTGCACT 2220 

TGTAAAAGAG AATCTAGACT TAGTCCAAGA AGACTATCGT TATGTCATGC AAGTAACTGG 2280 

TGACAACGCT GAAGGAAAAG CTCAACTGGA TAACTTTATG GATGTCAATT CTAATTTTCC 2340 

CGCTATTGTA ACAACGTCTA AATTATTAAC GACAGGAGTT AATGCTAAAA CATGTCGTTT 2400 

GATTGTTTTA GACTCTAATA TCCAATCCAT GACTGAATTT AAACAAATTA TTGGTCGTGG 2460 

CACACGTCTT TATCCTCAAA AGGGGAAAGA ATTTTTTACG ATTATTGATT TTCGAAATGT 2520 

TACCAATTTG TTTGCTGACC CTGATTTTGA TGGTGATCCA GTGAAGGTGC TAGAAACAGG 2580 

TGCGAAAACA GTCAGTGGTT CTACGCCCGG TTTCGTAGAT GAGGAAGGTG ACCCAGTAGA 2640 

AAAATATATC GTTACAGACA AGCAGGTTAC CATTCTTAAT TCTACTGTTC AAGTATTGGA 2700 

TGAAAACGGG AAACTGATTA CCGAAAGCCT GACCGACTAC ACTCGAAAGA ATATCTTAGG 2760 
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TAGCTACGCC ACTTTGAACG ATTTTATCAC AGTTTGGCAT ACGGCAGATA AGAAGAAGCT 2820 

TATCTTAGAC GAACTTTATA AAAAAGGAGT TTATCTAGAT GCTATTCGAG AGTCGGAGGG 2880 

AATATCAGAA CAAGAAATCG ATGATTTTGA TTTACTCCTA AAACTTGCCT ATGGTCAAAA 2940 

AGAATTAACC AAAACGGAAC GTATCAATAA ACTCAAACAA AGCGGATATT TATATAAATA 3000 

TAGTGAGGAA GCGCGTGCTG TTTTGGAAAT TTTACTGAAC AAATACATGG ATAAAGGTAT 3060 

- TGGAGAACTC GAAAGCATTG AAACATTAAA ACTTCCAGAA TTTCAGATAT ATGGTGGAAC 3120 

t CTTCAAAATC ATCAATACTT ATTTTGGAGA TAAAAAACGA TATTTACAAG CAATTAAAGA 3180 

ATTGGAGCAA GAGCTATTTA CAGTAGCTTA ATGAAAGGAA AGTATGTCAA TTACATCATT 3240 

TGTAAAAAGA ATTCAAGATA TCACTCGAAA CGATGCTGGT GTTAATGGTG ATGCTCAACG 3300 

TATTGAGCAA ATGTCTTGGT TATTATTCTT AAAAATTTAT GATAGCCGTG AAATGGTTTG 3360 

GGAATTAGAA GAAGACGAGT ATGAGTCAAT TATCCCAGAG GAATTAAAAT GGCGAAATTG 3420 

GGCTCATGCT CAAAATGGGG AACGGGTATT GACAGGCGAT GAATTACTTG ATTTTGTCAA 3480 

TAACAAGTTA TTCAAAGAGT TGAAAGAGCT TGAAATAACT TCAAATATGC CT ATT CG AAA 3540 

AACGATTGTT AAATCAGCTT TTGAAGATGC GAACAACTAT ATGAAAAATG GCGTCTTGTT 3600 

ACGCCAAGTC ATCAATGTTA TTGATGAAGT TGATTTCAAT AGCCCTGAAG ATCGTCATTC 3660 

GTTTAATGAT ATTTACGAAA AAATTCTTAA AGATATTCAA AATGCTGGGA ACTCAGGAGA 3720 

ATTTTATACG CCACGTGCAG CGACTGATTT TATTGCCGAA GTTCTTGACC CAAAACTTGG 3780 

AGAATCAATG GCAGACCTTG CTTGCGGAAC AGGAGGCTTC TTGACTTCGA CTCTGAACCG 3840 

TTTAAGTAGT CAACGTAAAA CTAGTGAAGA TACCAAAAAA TATAATACAG CTGTTTTTGG 3 900 

TATTGAAAAG AAAGCATTTC CTCATCTTTT AGCAGTTACA AATCTGTTTC TTCACGAAAT 3960 

TGATGACCCT AAAATTGTTC ATGGAAATAC TTTGGAGAAA AATGTTCGTG AATATACGGA 4020 

TGATGAAAAA TTTGACATTA TTATGATGAA TCCACCTTTT GGAGGGTCAG AATTAGAAAC 4080 

AATAAAAAAT AACTTTCCAG CAGAATTACG GAGTTCTGAA ACAGCTGATT TATTTATGGC 4140 

TGTCATTATG TATCGTTTGA AAGAAAATGG TCGTGTTGGA GTTATTTTAC CTGATGGTTT 4200 

TCTATTTGGT GAAGGTGTAA AAACTCGCTT GAAACAAAAA CTGGTAGATG AGTTCAACTT 4260 

GCATACGATT ATTAGGTTGC CTCATAGTGT CTTTGCACCG TATACAGGAA TCCATACGAA 4320 

CATTCTTTTC TTTGATAAAA CAAAGAAAAC AGAAGAAACT TGGTTTTATC GTTTAGATAT 4380 

GCCAGATGGT TATAAAAATT TCTCGAAAAC TAAGCCGATG AAGTCAGAAC ACTTCAATCC 4440 
TGTTCGTGAC TGGTGGGAAA ATCGTGAAGA GATTCTGGAA GGTAAGTTCT ACAAATCTAA ' 4500 
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ATCATTTACA CCTAGTGAAT TGGCTGAGTT GAATTATAAT TTAGACCAGT GTGACTTTCC 4560 

AAAAGAGGAA GAGGAAATCT TAAATCCCTT TGAGTTGATT CAGAATTATC AAGCGGAAAG 4620 

AGCAACTTTA AATCATAAGA TTGATAATGT ATTAGCTGAT ATTTTGCAGT TGTTGGAGGA 4680 

CAAATAATGA CACCAGAACA ACTTAAAGCA AGTATTCTCC AAAGAGCGAT GGAAGGGAAA 4740 

TTAGTGCCGC AAAATCCCAA TGACGAACCT GCAAGTGAAT TATTAAAGAG AATTAAAGCT 4800 

GAAAAAGAAA AACTTATCAG TGAAGGAAAA ATCAAACGAG ATAAAAAGGA AACTGAGATA 4860 

TTTCGTGGTG ATGATGGGAA ACATTATGGG AAGTTTGCTG ATGGAAGCAC TCAAGAAATT 4920 

GATGTTCCTT ATGATATTCC TGATACTTGG GAGTGGGTGA GGTTTTCTAC ATTGGTTGAA 4980 

ATTGTCAGAG GTGGCTCTCC ACGACCAATC AAaGATTATC TTACTTCTGA AGTAGATGGA 5040 

ATAAATTGGA TAAAAATAGG TGATACTGAA AAGGGTGAAA AGTATATAAA TAATGTTAAA 5100 

GAAAAAATCA AAAAATCAGG GCTTAACAAA ACTAGATTTG TAAAAAAAGG TACATTTTTG 5160 

TTAACTAATT CTATGAGTTT TGGTAGACCT TATATTTTGA ATGTTGATGG TGCAATACAC 5220 

GATGGATGGT TGGCTATTTC GAACTATGAA AACTCATTAA ATAAAGATTA CCTATTCTAT 5280 

ATTCTTTCAT CAAATGTAGT TTATTCTCAA TTTCTATCTC TAATTAGTGG AGCTGTTGTG 5340 

AAAAACTTGA ATAGTGATAA AGTTGCTTCT ATTCTTATCC CTCTCCCCCC ACTATCCGAA 5400 

CAACAACGAA TAGTAGAAGC AATCGAATCA GCTTTAGAAA AAGTAGATGA ATATGCTGAA 5460 

AGTTATAATA GACTAGAACA GCTAGATAAA GAATTTCCAG ATAAACTAAA AAAATCTATT 5520 

CTTCAATATG CTATGCAAGG AAAATTAGTT GAACAAGACC CAAATGATGA ATCAGTCGAA 5580 

GTTTTACTTG AAAAAATACG AGCAGAAAAA CAAAAACTCT TTGAAGAAGG CAAGATTAAA 5640 

AAGAAAGATT TGGACATTTC TATTGTTTCC CAAGGAGATG ATAACTCTTA TTATGGGAAT 5700 

ATACCTATGA ATTGGGTTGT TATAAAAATA AAAGATATTT TTTCAATAAA TACAGGTCTT 5760 

TCTTACAAGA AGGGCGATTT AAGCATTAAT AATAAAGGTG TTAGAATTAT ACGTGGTGGT 5820 

AATATTAAGC CTTTAGAATT TTCTCTGTTG GATAATGATT ACTACATTGA TACACAATTC 5880 

ATCTCCTCTG AGCAAGTTTA TTTAAAACAT AATCAGCTAA TAACACCTGT ATCAACCTCT 5940 

TTAGAACATA TTGGAAAGTT TGCAAGAATC GATAAAGACT ATGATGGTGT TGTGGCTGGT 6000 

GGATTTATTT TCCAATTAAC ACCATTCGAA AGTTCAGAGA TTATTTCAAA ATTTCTATTA 6060 

TTTAACTTGT CCTCTCCGTT ATTTTATAAA CAATTGAAAG CAATAACTAA ACTATCAGGT 6120 

CAAGCTTTAT ATAATATTCC TAAAACTACA CTGAGCGAGC TATTAATTCC GTTAGCTCCT 6180 

TTTGAGGAAC AGGAACTTAT TACTCAAAAA GTTGAGAAAC TTTTTGAAAA AGTAAATCAA 6240 

CTTTGAAAAT GATTCTTTTC ATCTCTTCAT GATTAGAAAT AGGGATTAAT AATTCGGAGA 6300 



i 



WO 98/18931 



PCT/US97/19588 



1013 

TACTGGTACT ATTTAATGTT TTCCCTTTGA TAGCATCTTT TGAATCACCT AAAGTAGAGA 6360 

TAAGTGGCAA AAATATCATT AAGTAATCTC TGATAATATT TTCTTTATTA GCATAGGGGA 6420 

ATATCGATAT AATGGCTTCA TTATGAGTGG CAGGAATATC CAATATGGCA ACTTTTCCAA 6480 

TAGATAATTT AAAACTCATT AATAAAGTTC CTTTAGGTGA AATGTCTATT TTCTTTGATT 6540 

TTAATGCTAA TTTAGAAATA GATTCTCTCG CATTAGTTAC ATAACCAGAT ATAGGCATAT 6600 

-CTGATATAGA TACCCAAGGT ATTTCAGTTC CCCAAAAAGT AGCTTCACTG CGTGGAGGAG 6660 

TTTTTCCTAT TCTGAAGTTA ACTAGGCTAG CAAATTTAAT ATATCTCCAT GCTTCTGGGA 6720 

TTTCATATAT AGGATAAGAG GTTGTTTCGT CTTTGTTCCC ATAATAAGAG CCATAATCAC 6780 

AAAAATAGCA GGTAGTCAGT TTGACCACCT GTTATTTTTT ACCAATTAAC AATTTTATCT 6840 

ACAATATTTT GTTGTTCAGT AGCTGTTTTC CTTAGATAAA TTCGAGTAGT TTCTATACTT 6900 

TCGTGTCCCA TCAAATCTGC AAGCAAGGCA ATATCATTAT ACTTCGCTAA AAAATTCTTA 6960 

GCAAATAAAT GCCTAAAAGA ATGAGGGTAA ATTACGTTAG GATTCATTTT GTATTTATCA 7020 

GCATAATTTT TTAACTGTTG AGCAACTCCT CTTGCTGTAA TTGGTTCGTT AAATTTATTC 7080 

AAAAATAAAT AACCACTTCG GCGATTTTCT GATTCTAACC AACTAAGACA ACTATTTCTT 7140 

AATTTTTTAG GAATGTACAG TCTACGAATT TTACCACCTT TTGAGTAAAT GTCAAAATAA 7200 

CCGATTTCTA CATGCTCTAC TTTTAGTTTA ATAAGTTCAC TTACACGAGC CCCAGTTGCA 7260 

CCTAAAAACC AAACGACAAA ATGCCATTTT AAAATACCAT CTTTTTTCAA ACTACGTTTA 7320 

AGAAAAAGGT AATCAGCATG GCTAATGACA TCTTCTAAAA ACGGTTTTTG CTGTACTTTG 7 380 

ACAAATTTTA ATTTCAAATC ATCATGACCA ATAAAAGCCA GATATTTATT TACTCCTTGT 7440 

AGTCGCAAAT TGACAGTTTT AGGTTTAAAA TTGTCTAATA AATATCCTTT GTATTCAAAT 7500 

AAATCTTCCA TTTTGAGTTC GTAATTCTCC AAGAAAAATC GAACACCATA AAGGTAGGAA 7560 

CGCACAGTAT TTTCAGCTAA ACCAGCTTTC TTCAAATGTA ATTCAAAATC TTTCAACGTA 7620 

AAACTCCTAT CTTATGTTTG ATAGAAATTC CACCGCACGT AAAACTATTA TACTAAATTA 7680 

GTGCGTCAAT ATGGGCGAAA AATTGTTCGA TTTTATCAAC GATTCTGGAT TGTTCAGGAA 7740 

GGGGTGGGAG GGGGATTAAA TATTCTTTTA TAGTTTTCGT TAATAATTCT TTTTGTTTTG 7800 

TACTACCCGA CGCTTTTTCT TCAATAACTG ACTGAACAAT AGGAGAGGAA AGAAAATTAT 7860 

AGATGAAATG GCAATTAATA ACCCCCGATA AGACTCTTAT AACTGTAACA TGGCTATCTG 7920 

CAACAGCCCA GCCATAAGGA TTTTTATTTT CATGGTAAAT AGCTAATCGT CCTAACGTAC 7980 

CTAGACCTGT TGAATTCCAC ATTAAATCAC CATCTCTTAG TAATCTTTCT TTCTGGTAAC 8040 
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TATGAACTGT TTCGGGATCA ATAAATCTTG CTAAGTCAAT AGAAAAGCCA GACCATTGAT 8100 

TACATTTCTG AGCAATCACA GGGTATATAG GAATATTTGA ATATTTTGGA GACTTCCCTC 8160 

TTTGAATGTA GGAGGTTATA TCGTTTAACC TCACCCATTC CCAACTTTCT GGTATTTCAC 8220 

AAGGTACTTC CTCATAATAA GAGTTATCAT CTCCTTGGGA AACAATAGAA ATGTCCAAAT 8280 

CTTTCTTTTT AATCTTGCCT TCTTCAAAGA GTTTTTGTTT TTCTGCTCGT ATTTTTTCAA 8340 

GTAAAACTTC GACTGATTCA TCATTTGGGT CTTGTTCAAC TAATTTTCCT TGCATAGCAT 8400 

ATTGAAGAAT AGATTTTTTT AGTTTATCTG GAAATTCTTT ATCTAGCTGT TCTAGTCTAT 8460 

TATAACTTTC AGCATATTCA TCTACTTTTT CTAAAGCTGA TTCGATTGCT TCTACTATTC 8520 

GTTGTTGTTC GGATAGTGGG GGGAGAGCAA TTAATAATAG ATTAAAATTA TAATCATTGA 8580 

TTGCAGGATA ACTTGTTCCA GTAGATTTAT TATTAACACG ATTGATAAAA TTATCTGATA 8640 

ATAAATAATA TTTCAAATAT GTTTCGTTAA GTAAAGTATC CAAAACAATA AATGCTGTAC 8700 

TAGCTATCAA ATACTCTTTA AGTTCTCTAA CTACAGCAAT ATTTTTTAGA TATGGTCTAA 8760 

CTGTTGAAAA TAAGACACTA TTCTGCGAAA CTAATTTTCT AGCACGGGAA GGCGCTTGTT 8820 

CAGGTGAAAG ATATTGTAGA TTTTTGTAGT TGATTATGTT CTTTTTTCTA TCAATACTAG 8880 

ACGTATCTAT ATACCTAAAG GATTTCTCTG GCTTATTTTG CCCAAAATTC CAATAAATTG 8940 

ATTTTATCCT CACCCACTCC CAAGTATCAG GAATATCATA AGGAACATCA ATTTCTTGAG 9000 

TGCTTCCATC AGCAAACTTC CCATAATGTT TCTTATGTGC TTCAAGTATA TAAAAAGGCG 9060 

TAAAAATACG CCTATAGATA ATGGGGTTGA AATAGGTTTA TTGTTGATGA GATTGTAGAT 9120 

AATTCAATTT TTTACTTCCA ATCGAATATT CAAATCCTCC ACCTTTTCTG CCTGTAATTG 9180 

TTCATCATAA AATTCAATAT CTTCAGGATT TTCCCCTTGG CAACCTCGGC AGAAATATTC 9240 

TTCCGCTCGA TCAGGATTCA AAAATCGACA AGCACAAACA AAACAGTCGC CATCATCATT 9300 

TATTGAGATA ATATAGTAGA TTGAAATAAG ATGTAAACAA ATCGATTAGG AAAGTTAAAT 9360 

TAGTTTCTAG AAATTTTTAG CAGATGTAGT GTACTATTCT AGTCTCAATT TACTATGGCT 9420 

TCAAATATAT CTTTCGAAAA AATATTTACA GATGTGTAAT TTTGAAGCTT GCAAAAGTTA 9480 

GTAAACTTGT AGATTTCGAT TTGAAGTAAC TTGTTTTCTT GCCCGATATT GTTTTTGAAA 9540 

TTGAATTTTT CCATAGTGAC TCCTTAATTT TCTTCTACAC GTCTGATGAT AAATCTAATT 9600 

CGCAAAAGAG TCAAGAGGAT TTTTCGAAAA ATAAATAGCG ACCGAAATCG CTATTTTAAG 9660 

GGTTATAGGT ATTTGATGGC TTAGACTGCT GTGTGACTGT TTACCCACAG GCAATCTTTC 9720 

TTCTATATTA GTATTAGTAA AGGTCTAAAT AATTATCAAT TTCCCATTGT GAAACGAAGG 9780 

TTGCATAACT TGCCCATTCG ATTCGTTTGG CTTCAAGGAA GCTAGTATAG ATGTGATCTC 9840 
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CGAGAGCAGC TTTAACCACT TCATCTTCTG TCAAAGCTTT CAAAGCGTTG TGAAGAGTTG 9900 

ATGGAAGGTC TGTAATACCA GCTTCCTTGC GCTCTTCTGC TGTCATGATG TAGATATTTT 9960 

CTTCGATAGG AGCTGGTGCT TCGATTTTAT TTTCAATACC ATACAAACCA ACTTCCAAAA 10020 

GAACAGCCAT AGCAACGTAA GGGTTCGCCA TTGGATCCAC TGAACGCAAC TCAAGACGAG 10080 

TTCCCATACC ACGTGAAGCA GGTACGCGCA CAAGTGGCGA ACGGTTACGA CCAGCCCAAG 10140 

" CAATGTAAAC AGGCGCTTCA TAACCTGGAA CCAAACGTTT GTATGAGTTA ACTGTTGGGT 10200 

TCATGATGGC AGTATAGTTG TAAGCATGCT TGATCAAACC GCCTAGGAAA TGGTAAGCTG 10260 

TTTCTGACAA CTGCATTCCT TTTGGATCAT TTGGATCAAA GAAGGCGTTA TTTCCTTCTG 10320 

CATCAAACAA GGACATATTA CAGTGCATAC CTGATCCAGC AATACCAAAT TTTGGCTTCG 10380 

CCATAAATGT TGCGTAAAGT CCGTGTTTGC GAGCAATGGT TTTAACAACA AGCTTAAAGA 10440 

TTTGAATCTT ATCACAAGCA CGGAGAACTT CATCGTACTT AAAGTCAATC TCATGCTGTC 10500 

CAACCGCAAC CTCGTGGTGA CTCGCTTCTA CTTCAAATCC CATTTTGGTC AAGACATTCA 10560 

CAATCTCACG ACGTGTGTTG TCCGCAAGGT CAGTAGGTGC CAAGTCAAAG TAGCCACCCT 10620 

TGTCATTCAC TTCAAGTGTT GGGTCCCCAT TTTCATCCAA CTTAAATAGG AAGAATTCTG 10680 

GCTCTGGACC AAGGTTGAAG GATTTGAATC CAACTTCTTC CATGTGACGA AGAGCTCGTT 10740 

TCAAATTACC ACGAGGGTCA CCCGCAAATG GTTCACCTTC TGTTGTATAG ACATCACAGA 10800 

TCAGACCTGC AACACTTCCA TTTTCATCTC CCCAAGGGAA GACTGTCCAT GTATCCAAGT 10860 

CCGGGTACAA GTACATATCC GACTCATTGA TACGTACAAA ACCTTCAATA GAAGATCCAT 10920 

CAAACATAAC CTTGTTCGAC AAGACCTTAT CTAACTGTTC ATCTGTAGCA GGAATTTCGA 10980 

CGTTTTTCAT GGTTCCCAAA ATATCTGAGA ACATAAGACG AATAAAGGTA ACATTTTTTT 11040 

CCTTGACTTC ACGACGAATA TCTGCAGCTG TGATTGGCAT AAGTTTTCTC CTTAATCTAT 11100 

GACTACTTGC GGTTGCCTAA CCGCGACCAA AAGGTGACTG TACTGAAGCA AAACGCCCCT 11160 

GTTGGAGGAG TTCATTGTGA AGTGCACGAC GTACTTCAGT CTGACTAACC GCTTTCTTGG 11220 

ATTTCGCTTC ACGTTCAGCA TATTTTTTCT TAATGGCAGC GATATTATAA CCTTCAGAGA 11280 

TATAATCTTT GATTTCAAGC AGACGATCCA TGTCATTCAA GGAATACATG CGACGATTTC 11340 

CTTCGTTTCG ATCGGGCTTG ATCAACTCTT GATCTTCATA ATAACGAATC TGACGCGCCG 11400 

ATAGATCGGT CAACTTCATA ACACTGCCGA TAGGAAAAAC AGCCATATTT CGGCGAAATT 11460 

CTTTTTCCTT CATTTACAAT TTCCTTCTTT CTGTCTATTA TAGTCTAAAA AAAGACAAAC 11520 

GTCAATTGAT AATGTTATAA AATGTAACAT TATTTTTCTT TTTTCTCTAA AAAGAGACGA 11580 
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ATACGATCAA TATCGTAATT TACGATAATT GCGACAAAAA CTCCCATAAA CGTTTCTAAT 11640 

ACACGCACAA ACACGTACAA AATTGTCTCA CCACTTGGAA TTGATAGGGT AATGATTAAC 11700 

ATAGCTGCTA CACCACCAAT AACCCCTGCT TTGTTATTCA TGGCTACATT TGTCATAATG 11760 

GTTAACATGG TGCAGATTGG AACAACTACC AAGGTCACCC AAAAGGCTTC GTGGAAAAAG 11820 

GTATTTAATA AGAAGAAGAC CAAGGCATAG AGTCCACCGA TACTATTTCC TAGAATACGC 11880 

GAAGTCCCAA AATGAACACT CTCATCAAAA CTCTCCCTCA GGCTAAAAAC GGCTGTCAAA 11940 

GCACCAATTT GAAGACCTTT CCAGCCAAAA AAGCCAAAAA TCAAGAGAAC TAGAAAAACA 12000 

GCAATACCTG TTTTAAAGGT TCGCATACCA AGTTTGAACT GGGATTTATC GAATTTATAT 12060 

TTTTTAAAAT AACTCATAAT CTCAACTTTC TATTTCCATT TTATCATAAA TCGGTGATTT 12120 

TTATGAGTAA TAGTTGAGAG GAAGCGTTTT TATTTTAAGC AAAAGAAAAG AGGAACTTTC 12180 

ATCCCTCTCT TCTTTGATTT ATTTATAAAA TCTTATTTTT CTGTCAAGGC TGCAAGTCCT 12240 

GGAAGAACCT TACCTTCAAG AAGTTCCATT GATGCTCCAC CACCCGTACT AATCCATGAG 12300 

AACTTGTCTG CACGGCCAAG GTTAATCGCT GCGGCAGCTG AGTCACCACC ACCGATGATT 12360 

GATTTAACTC CTGGTTGTTT CACGATAGCG TCCATCACAC CGATTGTACC AGCTTGGAAA 12420 

TCTGGGTTTT CAAATACACC CATAGGTCCG TTCCATACGA CTGTTTTGGC ACCAGTCAAA 12480 

GCTTCGTCAA ATTTGGCGAT AGATTTTGGA CCGATGTCAA GACCAAGGAA GCCTTCAGAA 12540 

ACTGCTTCAC CTTCAGTGTC ACGCACTTCA GTGTAACCAG CAAATGCGTT AGCTTCTTTT 12600 

GAGTCAACTG GCAAGATCAA TTTACCATTT GCTTTTTCAA GAAGAGCTTT CGCAACATCC 12660 

AATTTGTCTT CTTCTACAAG TGAGTTACCG ATTTCGATAC CTTGTGCTTT GTAGAATGTG 12720 

TAAGTCATCC CACCACCGAT AAGGACGTTA TCAGCTTTTT CAAGCAAGTT TTCGATAACA 12780 

CCGATCTTGT CTGAAACTTT TGAACCACCA AGGATAGCCA CGAATGGACG TTCTGGAGTT 12840 

TCAACTGCTT CTTGGATGTA GGCAATTTCG TTTTCAAGAA GGAAACCAGC AACTGCTTTT 12900 

TCAACGTTTG CTGAGATACC AACGTTAGAT GCGTGTGCAC GGTGAGCTGT ACCGAATGCA 12960 

TCGTTTACGA AGATACCATC TCCAAGTGAT GCCCAGTATT TACCAAGTTC AGGATCGTTT 13020 

TTAGATTCTT TCTTGCCGTC AACATCTTCG TAACGAGTGT TTTCAACCAA GAGAACTTGT 13080 

CCATCTTCAA GAGCGTTGAT TGCCGCTTCT AATTCAGCAC CACGAGTGAC ACCTGGGAAA 13140 

ACAACATCTT GACCAAGTTT TGCTGCCAAG TCAGCTGCTA CAGGAGCAAG TGATTTACCA 13200 

GCTTTATCAG CTTCTTCTTT CACACGTCCA AGGTGAGAGA AAAGAATTGC ACGTCCACCT 132 60 

TGTTCGATGA TGTACTTAAT AGTTGGAAGA GCTGCTGTGA TACGGTTATC GTTAGTGATT 13320 

ACGCCATCTT TCAATGGTAC GTTGAAGTCA ACACGAACGA GGACTTTTTT ACCTTTCAAG 13380 
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TCAACGTCTT TAACAGTAAG TTTTGCCATG TTACAAAAAC TCCGG 13425 
(2) INFORMATION FOR SEQ ID NO: 152: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 905 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 152: 

GATTTATCCT ACCGGnGAAT TTCCGGAGGG GTTCTAGCAG CAATCTTAGG AATCTATGAA 60 

CGAATGATTG GCTTTCTGGC CCATCCCTTT AAAGACTTTA AAGAAAATGT TTTGTACTTT 120 

ATTCCAGTTG CCATCGGTAT GCTTCTGGGA ATCGGCTTAT TTTCCTACCC GATTGAATAC 180 

CTGCTTGAAA ATTATCAGGT TTTTGTATTA TGGAGCTTTG CGGGAGCTAT TATCGGTACA 240 

GTTCCTAGCC TCCTCAAAGA ATCAACTCGA GAATCTGACC GAGACAAGAT TGATTTAGCT 300 

TGGTTATGGA CAACCTTTAT CATTTCTGGA TTAGGACTCT ATGCCTTAAA TTTTGTCGTT 360 

GGAACCTTAA GCGCCAGCTT TCTTAACTTC GTCCTAGCAG GCGCACTATT GGCCCTTGGC 420 

GTCTTGGTTC CTGGCCTCAG CCCATCAAAT TTACTTTTGA TTTTGGGACT CTATGCTCCT 4 80 

ATGTTGACTG GTTTTAAAAC TTTTGATTTC TTGGGAACCT TCTTTCCGAT TGGAATTGGT 540 

GCAGGTGCAA CTCTCATCGT TTTTTCAAAA TTGATAGATT ATGCCTTAAA CAACTACCAC 600 

TCACGCGTCT ATCATTTCAT CATCGGTATC GTCCTATCAA GTACCCTTTT GATCTTAATT 660 

CCAAATGCAG GAAACGCTGA AAGTATCCAA TACACAGGAC TTTCACTTGT CGGTTATGTC 720 

ATCATCGCCT TCTTCTTTGC GCTGGGAATC TGGCTTGGTA TTTGGATGAG TCAATTGGAG 780 

GATAAATATA AATAATGGCA AAAAAAGTTA AAATCAAAAA AACATTGGTG GAACAAATCC 840 

TATCTAAAGC AGCTATCCCT CATCAGGGGA TTCAAATCAA TGCCCTAGAA GGAGAGCTTC 900 

CTCAA g05 
(2) INFORMATION FOR SEQ ID NO: 153: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4278 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 153: 
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CTTGAATTAA ATAAAAAACG TCATGCGACT AAGCATTTTA CTGATAAGCT TGTTGATCCC 60 

AAAGATGTGC GTACGGCTAT CGAAATTGCA ACCTTAGCGC CAAGCGCCCA CAACAGCCAG 120 

CCTTGGAAAT TTGTGGTGGT ACGTGAGAAA AATGCTGAAC TGGCAAAGTT AGCTTATGGT 180 

TCCAATTTTG AACAGGTATC ATCAGCGCCT GTAACCATTG CCTTGTTTAC AGATACGGAC 240 

TTAGCCAAAC GTGCTCGTAA GATTGCCCGT GTTGGTGGTG CTAATAACTT TTCTGAAGAG 300 

CAACTTCAAT ATTTTATGAA AAATCTGCCA GCTGAGTTTG CCCGTTACAG TGAGCAACAA 360 

GTCAGCGACT ACCTAGCTCT CAATGCAGGT TTGGTTGCCA TGAACTTGGT TCTTGCATTG 420 

ACAGACCAAG GAATTGGTTC TAACATTATT CTTGGTTTTG ACAAATCAAA AGTTAATGAA 480 

GTTTTGGAAA TCGAAGACCG TTTCCGCCCA GAACTCTTGA TCACAGTGGG TTATACAGAC 540 

GAAAAATTGG AACCAAGCTA CCGCTTGCCA GTAGATGAAA TCATCGAGAA AAGATAGAAA 600 

GAAGAAAAAA TGACAGCAAT TGATTTTACA GCAGAAGTAG AAAAACGCAA AGAAGACCTC 660 

TTGGCTGACT TGTTTAGCCT TTTGGAAATC AATTCAGAAC GTGATGACAG CAAGGCTGAT 720 

GCCCAGCATC CATTTGGGCC TGGTCCAGTA AAAGCCTTGG AGAAATTCCT TGAAATCGCA 780 

GACCGCGATG GCTACCCAAC TAAGAATGTT GATAACTATG CAGGACATTT TGAGTTTGGT 840 

GATGGAGAAG AAGTTCTCGG AATCTTTGCC CATATGGATG TGGTGCCTGC TGGTAGCGGT 900 

TGGGACACAG ACCCTTACAC ACCAACTATC AAAGATGGTC GCCTTTATGC GCGCGGGGCT 960 

TCGGACGATA AGGGTCCTAC AACAGCTTGT TACTATGGTT TGAAAATCAT CAAAGAATTG 1020 

GGTCTTCCAA CTTCTAAGAA AGTTCGCTTC ATCGTTGGAA CAGACGAAGA ATCAGGCTGG 1080 

GCAGACATGG ACTACTACTT TGAGCACGTA GGACTTGCCA AACCAGATTT CGGTTTCTCA 1140 

CCAGATGCTG AATTTCCAAT CATCAATGGT GAAAAAGGAA ATATCACGGA ATACCTCCAC 1200 

TTTGCAGGAG AAAATACAGG TGTTGCCCGT CTTCACAGCT TTACAGGTGG TTTACGTGAA 1260 

AATATGGTAC CAGAATCAGC AACAGCAGTC GTTTCAGGTG ACTTGGCTGA CTTGCAAGCT 1320 

AAACTAGATG CCTTTGTTGC AGAACACAAA CTTAGAGGAG AACTCCAAGA AGAAGCTGGC 1380 

AAATACAAGG TGACGATCAT TGGTAAATCA GCCCACGGTG CTATGCCTGC TTCAGGTGTC 1440 

AATGGCGCAA CTTACCTTGC CCTCTTCCTC AGCCAGTTTG GCTTTGCTGG TCCAGCCAAA 1500 

GACTACCTTG ACATCGCAGG TAAAATTCTC TTGAACGATC ATGAGGGTGA AAATCTTAAG 1560 

ATTGCTCATG TGGATGAAAA GATGGGTGCT CTTTCTATGA ATGCCGGCGT CTTCCACTTC 1620 

GATGAAACAA GTGCTGATAA TACCATTGCC CTCAACATCC GCTATCCAAA AGGAACAAGT 1680 

CCAGAACAAA TCAAGTCAAT CCTTGAAAAC TTGCCAGTTG TTTCTGTTAG CCTGTCTGAA 1740 

CACGGTCACA CGCCTCACTA TGTGCCAATG GAAGATCCAC TTGTGCAAAC CTTGTTGAAT 1800 
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ATCTATGAAA AACAAACTGG CTTTAAAGGT CATGAACAAG TCATCGGTGG TGGAACCTTT 1860 

GGTCGCTTGC TAGAACGCGG AGTTGCCTAC GGTGCTATGT TCCCAGACTC GATTGATACC 1920 

ATGCACCAAG CCAATGAATT TATCGCCTTG GATGATCTTT TCCGAGCAGC AGCAATTTAT 1980 

GCCGAAGCTA TTTACGAATT GATCAAATAA AACGATAGAA GTCTGAGATC TTATGCTTGG 2040 

ACTTCTTTTT GGAGGGAAAG TAGATGTCTC AAATCGAAAG AATCAAACAG GCTATCATGG 2100 

rGGATTCGCA GAATGCCAGC TATACAGAGC GTGGCATTGA GCCTCTCTTT GCAGCGCCAA 2160 

AAACTGCTCG CATCAATATC ATCGGTCAGG CTCCGGGACT TAAAACTCAA GAAGCAGGCC 2220 

TTTACTGGAA AGATAAAAGT GGTGACCGCT TGCGGGACTG GCTAGGTGTG GATGAAGATA 2280 

CCTTTTACAA TTCAGGTTAT TTTGCTGTTT TGCCTATGGA TTTCTACTTT CCAGGACATG 2340 

GCAAGTCGGG TGATCTTCCG CCTCGTACAG GTTTTGCAGA AAAATGGCAT CCGCAGGTCT 2400 

TACAGGAATT GCCTGATATT CAGTTAACCC TCTTGATTGG GCAATATGCC CAAGCCTACT 2460 

ATTTACAGGA GAAAATCAGT GGGAAGGTAA CGGAGAGGGT GAAACACTAT AAAGACTATC 2520 

TGCCAGCCTA TTTTCCGCTA GTTCACCCAT CACCACGAAA TCAAATCTGG ATGGCCAAAA 2580 

ATCCTTGGTT TGAGGCAGAA GTAGTGCCAG ATTTGAAAAA AAGAATTAAA ACCATTTTAT 2640 

AGTCAATGAA AATCAAAGAG CAAACTAGGA AGCTAGTCGT AGGCTGCTCA AAGTACAGCT 2700 

TTGAAGTTGC AGATAAAACT GACGAAGTCG GTAACATACG CACGGTAAGG CGACGCTGAC 27 60 

GTGGTTTGAA GAGATTTTCG AAGAGTATTA GAAGAAAAAG AATGAAAGAA ATAGCCTTTG 2820 

ACGCATTTTA CCAGCTTTAC CAAAACGACC AGCTTTCTTT AGTGGATGTG AGAGAAGTGG 2880 

ATGAGTTTGC AGCTCTTCAT TTAGAAGGTG CCCACAACCT ACCGCTTAGT CAATTGGCTG 2940 

ATAGTTATGA TTAATTGGAC AAAGATCGCT TGCATTATAT TATTTGCAAA TCTGGAATGA 3000 

GATCGGCGCG TGCTTGCCAA TTCCTATTAG AACAAGGTTA TAATGTTATC AATGTCCAGG 3060 

GTGGCATGTT AGCCTTTGAA GAACTTTAAA ATTTTGCATT TCTCCTACTT GGTGTGGACT 3120 

GGGTAGGAGA GTTTTATTTT TAGATAATTC TTATTTTTAA GAAAATTGAA AACATTTAAT 3180 

ATTTGCCTCG TGATGCTTTT TTCAGACTCC TAATCGTGGT ATACTAGGTC AGTATTTTAT 3240 

AAATATGAAG GAGATTTTTA TGGCTAAAAA AGGTACCCTA ACAGGTTTGC TCCTGTTTGG 3300 

AATATTTTTT GGTGCGGGGA ACTTGATTTT TCCGCCTTCT CTAGGTGCTC TATCTGGAGA 3360 

ACATTTTCTT CCTGCCATCG CAGGTTTTGT CTTTTCAGGC GTTGGTATCG CCGTCTTGAC 3420 

CCTTATTATT GGAACGCTAA ATCCTAAAGG ATATATCTAC GAGATTTCAA CGAAGATAGC 3480 

GCCTTGGTTT GCGACTCTTT ACCTCTCAGT TCTTTACTTG TCAATCGGTC CATTCTTTGC 3540 
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TACCCCACGT ACTGCTACAA CAGCTTACGA AGTAGGGATT AGCCCCCTTT TGTCGGATGC 3600 

AAATAAAGGA CTTGGCTTGA TTGTATTTAC GGTTCTGTAT TTTGCGGCAG CCTATTTGAT 3660 

TTCGCTTAAT CCATCAAAAA TCTTAGACCG CATTGGACGT ATTTTAACGC CAGTCTTTGC 3720 

AATTTTGATT GTTATCTTGG TCGTTCTGGG AGCTATCAAA TATGGTGGAA CAAGTCCTCA 3780 

AGCTGCTTCA CTGCTTATCA AGCTTCTGCC TTTGGTACAG GTTTCCTAGA AGGTTACAAT 3840 

ACCTTGGACG CCCTTGCCTC AGTGGCCTTT AGCGTAATCG CAGTTCAAAC CTTGAAACAA 3900 

CTTGGATTTT CAAGTAAGAA AGAATACATT TCAACTATTT GGGTTGTTGG TATCGTTGTT 3960 

GCCCTTGCCT TCAGCGCTCT TTACATCGGT TTAGGTTTTC TTGGAAATCA TTTCCCAGTA 4020 

CCAGCTGAAG CGATGAAGGG TGGAACACCA GGTGTTTACA TCTTGTCACA AGCCACTCAA 4080 

GAAATCTTTG GCTCAACAGC TCAACTCTTC CTTGCAGCTA TGGTTACCGT AACCTGCTTC 4140 

ACAACGACTG TTGGTTTGAT TGTGTCAACA GCTGAGTTCT TTAATGAGCG CTTCCCACAA 4200 

ATCAGCTACA AGGTTTATGC GACAGCCTTT ACCTTGATTG GATTTGCTAT TGCCAATTTG 4260 

GGTCTTGATG CGATTATC 4278 
(2) INFORMATION FOR SEQ ID NO: 154: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1953 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 154: 



ACCCGATCAA 


ATGACAAAAG 


CTAACTTTGG 


TGTCGTAGGT 


ATGGCCGTAA 


TGGGTCGTAA 


60 


CCTTGCCCTT 


AATATTGAAT 


CTCGTGGTTA 


CACAGTTGCT 


ATCTACAACC 


GTAGTAAAGA 


120 


AAAAACGGAA 


GATGTGATTG 


CTTGCCATCC 


TGAAAAGAAC 


TTTGTACCAA 


GCTATGACGT 


180 


TGAAAGTTTT 


GTAAACTCAA 


TCGAAAAACC 


TCGTCGTATC 


ATGCTGATGG 


TTCAAGCTGG 


240 


ACCTGGTACA 


GATGCTACTA 


TCCAAGCCCT 


TCTTCCACAC 


CTTGACAAGG 


GTGATATCTT 


300 


GATTGACGGA 


GGAAATACTT 


TCTACAAAGA 


TACCATCCGT 


CGTAATGAAG 


AATTGGCAAA 


360 


CTCTGGTATC 


AACTTTATCG 


GTACTGGGGT 


TTCTGGTGGT 


GAAAAAGGTG 


CCCTTGAAGG 


420 


TCCTTCTATC 


ATGCCTGGTG 


GACAAAAAGA 


AGCCTACGAA 


TTGGTTGCGG 


ATGTTCTTGA 


480 


AGAAATCTCA 


GCTAAAGCAC 


CAGAAGATGG 


CAAACCATGT 


GTGACTTACA 


TCGGTCCTGA 


540 


TGGAGCTGGT 


CACTATGTGA 


AAATGGTTCA 


CAATGGTATT 


GAGTACGGTG 


ATATGCAATT 


600 


GATCGCAGAA 


AGCTATGACT 


TGATGCAACA 


CTTGCTAGGC 


CTTTCTGCAG 


AAGATATGGC 


660 
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TGAAATCTTT 


ACTGAGTGGA 


ACAAGGGTGA 


ATTAGACAGC 


TACTTGATTG 


AAATCACAGC 


720 


TGATATCTTG 


AGCCGTAAAG 


ACGATGAAGG 


CCAAGATGGA 


CCAATCGTAG 


ACTACATCCT 


780 


TGATGCTGCA 


GGTAACAAGG 


GAACTGGTAA 


ATGGACTAGC 


CAATCATCTC 


TTGACCTTGG 


840 


TGTACCATTG 


TCACTGATTA 


CTGAGTCAGT 


GTTTGCACGC 


TACATTTCAA 


CTTACAAAGA 


900 


AGAACGTGTA 


CATGCTAGCA 


AGGTGCTTCC 


AAAACCAGCT 


GCCTTCAACT 


TTGAAGGAGA 


960 


- CAAGGCTGAA TTGATTGAAA AGATCCGTCA AGCCCTTTAC 


TTCTCAAAAA 


TCATTTCATA 


1020 


CGCACAAGGA 


TTTGCTCAAT 


TGCGTGTAGC 


CTCTAAAGAA 


AACAACTGGA 


ACTTGCCATT 


1080 


TGCAGATATC 


GCATCTATCT 


GGCGTGATGG 


CTGTATCATC 


CGTTCTCGTT 


TCTTGCAAAA 


1140 


GATTACAGAT 


GCTTACAACC 


GCGATGCAGA 


TCTTGCCAAC 


CTTCTTTTGG 


ACGAGTACTT 


1200 


CTTGGATGTT 


ACTGCTAAGT 


ACCAACAAGC 


AGTACGTGAT 


ATCGTAGCTC 


TTGCGGTTCA 


1260 


AGCAGGTGTG 


CCAGTGCCAA 


CTTTCTCAGC 


AGCTATTACT 


TACTTTGATA 


GCTACCGTTC 


1320 


AGCTGACCTT 


CCAGCTAACT 


TGATCCAAGC 


ACAACGTGAC 


TACTTTGGTG 


CTCACACTTA 


1380 


CCAACGTAAA GACAAAGAAG 


GAACCTTCCA 


CTACTCTTGG 


TATGACGAAA 


AATAAGTAGG 


1440 


TCAGCCATGG 


GGAAACGGAT 


TTTATTACTT 


GAGAAAGAAC 


GAAATCTAGC 


TCATTTTTTA 


1500 


AGTTTGGAAC 


TCCAGAAAGA 


GCAGTATCGG 


GTTGATCTGG 


TAGAGGAGGG 


GCAAAAAGCC 


1560 


CTCTCCATGG 


CTCTTCAGAC 


AGACTATGAT 


TTGATGTTAT 


TGAACGTTAA 


TCTGGGAGAT 


1620 


ATGATGGCTC 


AGGATTTTGC 


AGAAAAATTG 


AGCCGAACTA 


AACCTGCCTC 


AGTCATCATG 


1680 


ATTTTAGATC 


ATTGGGAAGA 


CTTGCAAGAA 


GAGCTGGAAG 


TTGTTCAGCG 


TTTTGCAGTT 


1740 


TCATACATCT 


ATAAGCCAGT 


CCTTATCGAA 


AATCTGGTAG 


CGCGTATTTC 


GGCGATCTTC 


1800 


CGAGGTCGGG 


ACTTCATTGA 


TCAACACTGC 


AGTCTGATGA 


AAGTTCCAAG 


GACCTACCGC 


1860 


AATCTTAGGA 


TAGATGTTGA 


ACATCACACG 


GTTTATCGTG 


GTGAAGAGAT 


GATTGCTCTG 


1920 


ACACGCCGTG 


AGTATGACCT 


TTTGGCGACA 


CGG 






1953 



(2) INFORMATION FOR SEQ ID NO: 155: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6474 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY : linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 155: 
CCGGCAGTAC ACGAGCTTGG GGAACAGCCA CTGGAACGAT GAGGTGTGAG CTCAAAATAT 



60 
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CCTCCAGTTA 


TGTTTTTCCT 


AATAGTATAC 


CGGAAGAGTC5 


AAA CCX A TTTT 

mnVJUn X X X X 


AfaiTnpipp 

A 1 AA 1 VjVjHVjL. 


1 Of) 


GGTTACAAAG 


AACCTACTTT 


CTATTAAACA 


GTATAPTATP 

w X « \ X /*V_ X *» X VJ 


AAAATPTHAA 


A ATTTA 1PST 
AA1 1 I AAL A 1 


i nn 

ItJU 


TTTTTTGTAC 


AAATTTTATA 


AATTATTGCC 


TTTTTA ATAT 
x x x x x AA x nx 


PA ATAHTTA A 
LAft 1 AVj 1 1 AA 


1 L IV 1 1 A 1 LL 




AGATCCCCCT 


TGTGTAAACT 


TTATCTTTAT 


AAGPTTPAAP. 


PPPPPTATPH 
VjV-*^.V_V_ lAlLL 


PITPT&TT'PP 
LA 1 L 1 A 1 1 1 Vj 


inn 
J UU 


CAACAATTAG 


ATCACTTTGT 


TTTfST AAA T A 

X X X X X J* 


HTTP A A A ATT 
oil UWift x x 


LIU lLAfl 1 A 


Ax lALVaTTAT 


360 


CTATACTAAC 


GTTTAAATTT 




PT B. A A ZL TTTT 
L InAAAl lit 


Inl ALLIjALA 


a rno a a m a ^fn/n 

A 1 lAATAGTT 


420 


CATTAATTAT 


ACTTAAAATA 


G CTfi APTCTT 


TPTA ATT ATP 


rnpi ATTATAT 
KoAAi 1A1A1 


1 ILAxLLLlA 


480 


ATTTATATAT 


TCCTACTATC 


TTTfSf^PTTTP 

X X X VJFVJV^ X X 1 ^ 


PTTPr'Zl AT AT 


1 I Vj 111 AAL 1 


ft TV*' ft ft «wimm 
AluAALlul 1 


540 


TTCTATTTGT 


GTTTGAAATA 


TPAATPGPTT 

X I. An X Vr V7v* X X 


PTATPAPmr* 

L 1 A i v_ AV_ I VjLj 


uuLAi 1 1A1 1 


1 L I AT AAATT 


600 


CTTTTTTTAA TTGTTTAGTA 


TPTTTf^PP A A 


papa at at/"* p* 


1 LLAAA1LLA 


A ft AO ft A^/^A rn 
AAAJbAAGGAT 


660 


TATTATAAAA 


ATTTCCAATT 




AAPAAAPAPP 
AAL AAA^»AL L 


1 1 1 1 A 1 1 At- A 


ftP'PTPiPPi'p 


720 


TTAAGCTTCT 


CCTCTCAGCA 


AAAGAATfTA 

nnnwin x ^* x n 


(2TTP ATTAR A 
ul 1 \~r\ x 1 AAA 


anafipiapftp 

AAAo L AAL AL 


A/"* AOPTft ft 
LAsA(jA(jL 1 AA 


/oU 


GAATGTGTTA 


GAAAAAAGCT 


T A A TTfiPTT P 


TPPTTPAHTA 


r.finraa apt a. 

wAVjAAAV, 1 A 


A A rp A A /*• A mm 
ALA 1 AAL A TT 


840 


TTTAATATTG 


GCAGTACTAT 


GAGTAPTAAT 

vftVJ x nv> x nn 1 


PP A A A CZC1 A & r* 


A A PTPTPJ^ & a 
AAL 1L1 uvAA 


TTTTTCTTCC 


900 


TTCAACTGTC 


TCATCTCCAA 


PAAPTATP.PP. 


AL 1 luun x A 1 


AAA 1 1A1LA1 


ft m a m a /-^ a a rs ■» 
A i A r ACi AAC a 


960 


ACCTTCTCTC 


AAAAATTCAG 


GGAPJVAAAAT 


f2 AT ft TTTT TT 
»A1A1 11 1 1 i 


Vj 1 A 1 tAAALA 


LaL L TTTTTAA 


1020 


TTTGTTTGAA AAGCCGATCG 


RAAPTPTTY^A 
w AAL 1 vj 1 x \sn 


LI 1 1 AAAAX A 


A 1 L 1 1 1 LLA 1 


m a /^/rnmrrvn a /~i 

TAGGTTTTAC 


1080 


CCTCAGAATC TTCGATACCG 


' H 1 ™ 1 "/STTfR A T* 
X I lul l^unl 


TTJ^ATATPTft. 
1 ILAlAlblA 


mm A A A A r^rn a r> 
I x AAAaLTAC, 


CAATTTTCTC 


1140 


ATCATAATCT 


GTCGGAAGCG 


PA ATA Rfpama 
LAA1 AA1 KlA 


ITftl'PPl AT ft 
A1AA1 


TTATTTTTAA 


rrvT\m/^ a a a a h 

i 11 CAUAAAA 


1200 


TGTATCAAAA 


AAAGTAATAT 


TT A A f^TT A TT 
X X /V\\3 X x ^» x x 


PTTT'P A A A & A 
L l^uUAAAAA 


A A PTt<P ATA A 
AAL. 1 1 LA 1 AA 


GCTCTTCATT 


12 60 


TTTAGATGGA 


AGAATGCCCT 


TTTTTAAATT 
x x x x xnnnx x 


X mmm A rprnmfTwp 
nl 1 Inl 1111 


ALAuAA ILIA 


m a pp/~> a rn a tr>f*r* 
1A1LA1A1\iL> 


1 1 in 


AACAACTTTA 


TATTTAGATG 


CAAATAGTAA 


CGCGTAGGCC 


APPPP A AP JIT 
n\J L LLAALA 1 


uLLLLAAALL 


lion 


AATTACTGCT 


ATATTCATAA 


AACTACTTCC 


TTATTTCTTA 


ATCCAAAATC 


TAATAGAA r i'A 


1440 


AGCTGCCCCA 


TTCCTTAAAT 


ACAACTCTTT 


AATATTGTTT 


AAAAGTTTTT 


CAACTGATTT 


1500 


CCAGATTATC 


AAAATCTGAG 


ATTTATAGCA 


CAATATTGAT 


GATATTCTAT 


CAATATAATT 


1560 


TTTTTCATCA 


AGTTCCTCTT 


GATACATTTT 


TAATTCTTTA 


GTTTTTCCCA 


TATAACTAAC 


1620 


CATACTACTA 


TCACTTACAT 


ATGGGAAGTC 


CTCATAATAT 


ATTACTTTAT 


AACGCATAAA 


1680 


TTCAAGCGCC 


CTTCCAATAC 


TATTCACAAA 


AACATGAGCA 


ACATGGTCAC 


CAAGTGAAAG 


1740 


CGGACAATAT 


ACGACACATT 


TGTCGTCTAA ATGCATTAAC 


AGCTCTTTTA 


TGATATCATT 


1800 


CTTTAATGTG 


TCCTCATTTT 


TTAATTCACT 


ATAGATATGA 


CGGTATAGAA 


AATTGCCATT 


1860 
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TCTATCTTTC 


CTATAGAGAC 


ATTCATAGTA CGATAAGTGT 


CTAAAATCAC 


ATTGTAGACG 


1920 


TTCACAAGCT 


AACCTGTCTT 


CTTTCTTCCT TTCTTCAATC 


GGATATTTCC 


CAAGGTTACA 


1980 


CAACTTATGA 


AATTGCTTAG 


CAGAGGGCTG TAGCTGTTGG 


CTCAAAGGGT 


AACCAGAAAA 


2040 


TATAGTAATA 


ACAAGTACAA 


TTTCTCCTTC TGAAGTTAAT 


TTTGAAATAT 


AATCACCACA 


2100 


GGAAAAAATT 


GCGTCATCTA 


AATGTGGAGA TAAAAAGATA 


TACTTAGTAT 


TGTTACTCAT 


2160 


AACCATTCCC 


TCTACAATTT 


ATCTAAAAAC TCACTAAGTG 


TCTGATTAAA 


TTCCACATCA 


2220 


TCAAAAAAAT 


TCACCTTATT 


CTTAATAATG AATATTTCGT 


TAAATAAACA 


TATATATAAA 


2280 


TATTTCAATA 


TCCTTTCAAT 


ATCATCCTCT AAATTCTCCT 


CAATATTTTG 


TATCAGCCCA 


2340 


TTTACAATCT 


TATTAAAAAA 


GATAAGCTCT TTATCTCTAA 


AATTAAATAT 


TTTCATACAA 


2400 


CTGTTGTATC 


GAAAAATATA 


TAAAATAATT TTTACTAATG 


TTTGAATATT 


TAAACAACTA 


2460 


AATAAATGAG 


TTGTACCCGG 


GACACTATTT ATGTTATCAA 


GAACACTATC 


TTGAAACCTC 


2520 


AACTCACAGT 


TCTTTTTGTG 


AAATTCTTTT TTATCGTTTA 


GATCTGATAT 


TTTTTTAGAC 


2580 


ATTTCAACAA 


TCTCAGACAT 


TTTATATGGA TATCTAGGAT 


GAATGCCAAA 


ACTATGCAAA 


2640 


ATGAACTGCA 


CCCCAAAAGT 


TAGACAGAAT AAATCTAACT 


TTTGGGGTGC 


AGTTCATAAG 


2700 


ATTGGGATAT 


TTTTTTTTAG 


CTAGAACTAG TAGAAATATA 


TAGTCAAATA 


ACAGATACCT 


2760 


TAAGGGTTTC 


TCATCTACAT 


AAAAAAATGA TACTTTTTTC 


TCTTCAGTAA 


TTACCTCATA 


2820 


AGCTTCACAA 


TAGAATCTCA 


TGTTTCCCTC CCCTATATTC 


TTAAATAAAA 


TCCTTTGGAA 


2880 


ATTGATATAT 


CTTAGTAAAA 


TATTGTTTAA GTTCCGGATG 


CGGAGCATGG 


GTAACAATAA 


2940 


TGACAGTCAA 


ATCCTCTCTA 


TCTAATATCT TACGTTCAAT 


CGCTAACGAA 


GTTCTC CT AT 


3000 


CGATAGCAGA 


AGTTCCCTCG 


TCAATTAATA CTATTTTCTT 


ATTTCTAATT 


AGCCCTCTAG 


3060 


CTAAAGTAAT 


TTTTTGTTTC 


TGCCCTCCTG ACAGTAATCT 


CCCATCATCA 


CCAACATAAT 


3120 


AATCTAAAAT 


GTTATTAGGA AAATCTTTTA CACTCAAACC AACTTGCTCT 


AAAGACTGTA 


3180 


GTATTTCTTC 


ATCAGTATAA 


TTTTCTTCCA ATAAAATATT 


ATCTCTAATC 


GTACCTTCAA 


3240 


ACAAATAAGC 


TTTTTGATCT 


ACATATAGAA CATTCGAAAC 


CATATTTAAA 


TAGGAGGTTT 


3300 


TTTTTATATC 


ATCCCCGCAG AATCGCAATT CTCCACTATA ATCTCTCAAA AAGCCATTCA 


3360 


ATAATTTTAA 


TAATGTAGAT 


TTCCCGCTTC CACTTTCACC 


TAAAATTAAA 


TACTTTTCAT 


3420 


TACGTTGAAA 


ACAAAAATTT 


AAGTTTTTTA ATATTTCTTT 


ATCTCCATAC 


TTATAGCAAA 


3480 


TATTTTTTGC 


TTCATATAAC 


GGAAAATCTC TATTCACCTC 


ATTTGGTTCG 


ATATCATTCA 


3540 


TTTTATTTGA 


CTCAATTGGA 


TTAATTGAAT ACAATTTTAA 


AAAAATAGGC 


TTCGTACCAA 


3600 
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TAATAGAGGA TAATTGACCT CCTAATTCAC CTAGCGCTGT AAAAATAACA CCTGTTAGTG 3660 

CTCCTATTGC TTCAATAGTA CCAATTTTCA CTATTCCTTT TATTGCAAGA TAGCCTGTTA 3720 

AAAAAACGAG AGATATCTGA AAAAAAATAT TGAGAAAGAA GCTAATAGCG CCTGCTAACG 3780 

TTTCTACAGT TGTCTTTCTT TGTATAACCA TCTTTAATAA AATTCCTGCT TCTTTAATTT 3840 

TCTTAGGCAA TACATATAAA AGATTCAAGG ACGCTAACAC ATCAAATCCA TTCAATATAG 3900 

TCTCACTAGA TTTTAAAAAA GCTTCATTTT GGTTAGTTAA ATTTAGACTA ACTTCTCGCA 3960 

TTTTCGATGC AAAGATTTTT GGTACAAGTA GCATAATCAT TAATGAAAAC AAGGTGGCTA 4020 

CAGTCAATGA CCAATGATAG TGATTAAGAG TCACAACTGC AAATATAGTA CCAGAAATTC 4080 

CTTTTATTAC TAAAAAAAGT TGTTTAAACG CCTGATCATT TAAAGTCTGA ACATCATTAT 4140 

TTAGCCACGA AAGATATGTT CCTGATGATT TACTATGAAA TTCTTGATAG GTAGAGTTAG 4200 

AGATGTCTGT GGCAACTCTA TTTCGAATCT CTAGATTAAA CTCTTGGATC ACTTCAACCT 4260 

GATAATTTTT CACTACCCAG TCAAGGAATA TTATCCCACA CCAGACAATC ATTTGGTAGA 4320 

TTGACAATTT CAAAAACCGC TCTAAATTCA TCGCAATTAA TTCATTCAAC ACCAGAGCAT 4380 

TAATAGTTGC TGCATAAATT AGCAATAATT GACCAGCAAC AATAAATATC GTTAATAAAC 4440 

TAAATTTTTT TATATTTGAT TTTATAATAG TATACACAAT AGTTTCTCAC TTTCTAAATT 4500 

TTAATTGAAC ATAGTTTTCA TATATACAAT AGAAAAAACC AAAATGATAT AATAACATAT 4560 

ATTTCAAAAA AGAAATTCGT TAAAAATTTT TTCTTCTCTT GCCTTCTTGA TTACTTTTAA 4620 

AGCCTTGCAT TTGTCTCCTA TTAATAGTAA CCGCTTTATG TTTAAAGAAT AATATTTCTT 4680 

TGTAACCAAT ATTCTCTCGT TGAAACTCAA TAAATTAAAA TATTTCCTAC AGTAATTATA 4740 

ATATTCTTCA TCTGCATTAA TTGTTTTTTG TGTCACTCCA GTGATACCGT TTTCTTTACT 4800 

GTGAGCGTAG TAATTCACCA AGAATTCTCG CACTATATCA ATTTGGTATC CTTGAACAAG 4860 

TAGTTTTAAT AAAACAACAC CGTCCTGATG TGAATCTATT TTCTCAAAAC CATTAATTAA 4920 

TTCTAGCACC TCTTTTTTAC ACAACCAAAA TGACGTACCT GCTATATTGT GAACCATTTG 4980 

AACAAACAAG GGATTTCCAA CAAAATCGGT CTTCTCCTCT TCTCGTGTAC CATTTGGATA 5040 

AATTATTATT CCATAACTAC AAACTAAAGC TAAATTCTTC ATTCTACTCT TTTTAAAACA 5100 

AGCCATCAAC TTTAAAATTC GATCTGGCAT ATATTCATCA TCATCGTCTA AAAATGATAT 5160 

ATACTTACCT CTAGAATTTT TGATACCTAT GTTTCTGGCA TTAGTTGCAC CTAAATCTTC 5220 

ATTACTTAAA ATTAACTTAA TTCTATGATT GGTATAGCCA AATTGATGGA TAATTTTATT 5280 

TCTTAAATTT ACATTACTAT AATTATCATC AATAATTATA ACTTCGATAT TTTTATAACT 5340 

TTGATGTAAA CAACTTTTCA CAGCTCTAAT CAGAGATTCA TACCTATTAT GTGTTGGTAT 5400 
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TATAATACTT 


ACTAATTCTT 


GATCTATATT 


CCTATCCATG 


ACTACTCTTC 


TCTAATAATT 


5460 


CATCATATAC 


TCTCATGGTT 


TCTACAAACA 


TTTTTTGCAC 


AGAAAAATGT 


TTTCTTATTT 


5520 


TTGATTTACT 


ATTCTCACCT 


ATATATTTCA 


AATACTCAGA 


ATCATTGAGT 


AAAAAATTAG 


5580 


CACAAGCACA 


CACTCCCTCA 


ACATCTTCCT 


TCTCAAATAA 


AAATCCATCA 


ACCCTATGTT 


5640 


CAATAATTTC 


ACTTAACCCG 


CCAACATTAC 


TAGCTAAAAC 


CGGAGTTCCT 


TGTGACATTG 


5700 


-ACTCTAAAAC 


ACACATAGGT 


ATTCCTTCTG 


TATCAGAAGG 


AATATACAAT 


AAATCCGATA 


5760 


TTTGGTAAAC 


TATAGTAGCT 


GGATAGATTT 


CACCAAGTAA 


CCTGAAATTA 


TCTCTACATT 


5820 


TCAAATGGCA 


AATTTTTTCT 


TTCAAAGCAG 


CCCACATACT 


ACCATTTCCA 


GCCATAATAA 


5880 


AAATCACATC 


TTCTCTGACT 


AAAAATAATT 


TTTCTGCAAA 


TTCAAGGAAT 


CTATCCGGCC 


5940 


TTTTTTCTGG 


ATCCAACCTT 


CCAACATAAC 


AAATGATTTT 


TTGTTATTTG 


GAATACAAAA 


6000 


TTCTTTTTTA 


AAGTCTTGAA 


CACCTACTAC 


ATCTAAATCG 


CTATTTGATA 


CATTAATTCC 


6060 


GTTATTTATT 


GCAACTATCT 


TCTTATTTTT 


TATTATACTC 


TCCAATCTTT 


TTTTTCATAG 


6120 


TTTCAGATAC 


ACAAATAAAA 


GCATCTCCCA 


TAGAATATGT 


CCAAAAATCA 


AAATAAGTCA 


6180 


AGAATTTCTT 


TTTTAAGTTA 


TATTCAACCC 


ATCCATGGCA 


TGTTATCACT 


GTCTTAACCT 


6240 


TTCCAAATCC 


ATTCTTGTCA 


AGTTTTTTTA 


ACATATATAA 


AAAATAATTA 


GTTGAGTAGC 


6300 


CATGACAGTG 


TATAAGTTGG 


ATTTTTAATA 


ATTTTAAAAT 


ATTTTTAACG 


TGTAAGGCAG 


6360 


TTTCAAAATT 


ATTTGAACAT 


TGAGTACAAT 


CAACATAGGC 


AATATCTAAA 


TTTTTATAAT 


6420 


CATCAATAAC 


CTTTGAATCT 


CTAGATACAA 


TTATCAAAAT 


AGGGAATAGA 


GACA 


6474 



(2) INFORMATION FOR SEQ ID NO: 156: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4792 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(XX > SEQUENCE DESCRIPTION: SEQ ID NO: 156: 

TATTTAACGA TTTTTTTCAT GTCATTTCCT CCAAAATAGA ATACCTTATA ATCTTAACAG 60 

AAAAAGAGCA TTTACGCCAT TATATGATAT CTATCTCTGT GATAAGTTTT TTTTATGGGT 120 

AATTTAAAAG ACCAAACGCA AGATGGCAAT CAAGACCACT CCAAAGAGAA CTGTTCCGAC 180 

TAGATTGCGG TAGCGAAAGG CTACCCAAGC TGTTGGAAAG ACGGCTAAGA AGTCCAGTCA 240 

TTTGATTTGA GGAAGACTGC CAACCTTACC TGTCACTACG CTTGAAAGAA TCAGGGCAAA 300 
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GATAATGGAA ACAGGCAAAA ACTTCAAAAA 


ACGCTCAACA 


ATCGCAGGCA 


GGCCCTTATA 


360 


CTTGACCAAG ATGAAGGGAA TCATACGGGG 


AATCCAAGTC 


ACCAAGCCAG 


AGAAAATAAC 


420 


TGCTAATAAA AGATACTTAC TGACCATCTA 


AAACCACCCC 


CATGCTACAA 


CCAAGTAGCG 


480 


TCGCAAACAG AACAGCTAGT GACTGAGACA TCACTGTCAA GAGCAAAAAG 


AAGGACACCG 


540 


CAACAACTGC TAGGATAATG AGCAGATTGC 


GGACAGGAAT 


CCGTCTTTGC 


ATAATCTGAA 


600 


ATTGCGAAGC AAAATACCAA TAAACATCCC 


AACCAGGGCA 


AAATCCAAGC 


CAAAGATTTC 


660 


TGGATTTGGT AGCAGGCCAC CCAGAGCCGT 


TCCGACTACT 


GTCCCCACAA 


ACCAAGCCAC 


720 


ATAGCTGTTA AGATTGTTTC CGTGCATCCA 


CATAGGATTT 


ACCTTGTCTG 


TATGGGCCAA 


780 


TTCACCCATC AAAACGCCAT AGGTCTCATC 


TGTCAAGATA 


CTAGACATAC 


CGATATTGTA 


840 


CCAAAGACTG GTATGACGGA AATAAGTCGA TGCGTGTAAA CTCAACAAAA AGAGACGCAA 


900 


GTTGATTAGA AAAACCGTCA TAGCAATAGC 


TGCCACAGGA 


GCTTGAACCA 




960 


CAACATGGCA AACTGGGCAC TCCCAGCATA 


AACAAAGAGA 


CTCATCAAGC 




1020 


AGGTGTCACA TAGGGCGCAC CGATAATTCC ACAGGCCAGG 


CCGATACTGA 


CATAGCCAAG 


1080 


AGCCGTTGGC ATGGCTGCCT GCGCCCCCTC 


CTAAAATCCT 


TTTTCTTTCA 


TCTTTCTCCT 


1140 


CATATTGTCT TAATAATACT CAATGAAAAT 


CAAAGAGCAA 


ACTAGGAAAC 


TAGCCGCAGG 


1200 


TTGCTCAAAA CACTGTTTTG AGGTTGCAGA 


TAGAACTGAT 


GAAGTCAGCT 


CAAAACACTG 


1260 


TTTTGAGGTT GTGGATAGAA CTGACGAAGT 


CAGCTCAAAA 


CACCGTTTTG 


AGGTTGTGGA 


1320 


TAGAACTGAC GAAGTCAGTA ACCATACCTA 


CGGCAAAGTG 


AAGCTGACGT 


GGTTTGAAGA 


1380 


GAGTTTCGAA GAGTACAAGT AGGCTGAAAA 


GAATCCAACC 


ACAGCATGGA 


CTATTATATA 


1440 


GCAGATTGAA ATAAGATGAG AACAAATCGA TTGGGAAAGT AAAATTAATT 


TCTATAAATG 


1500 


TTTTAGCAAT TGTTTCGTAC TATTTTAGAT 


TCAGTCTATT 


ATAACACATT 


CAGAAAAGAG 


1560 


AAAAAAGTCT GTTGATTTTG ACCATCATAA 


AAAGACTGGC 


AATCCAGTCT 


CAAACATATA 


1620 


TTATAGAAAT TCTCCACTAA ATACTTTCAC 


GAATATTCAG 


AAGCATAACA 


AAGGCAACTA 


1680 


GAAGAAATAG CAATAAAACA AAGCTAACTG 


CCAGAGTTCC 


AAAGCTAGTA 


GCAATGGTTA 


1740 


CCAAAGCTAT TGTAAATAAG CTAGGTAAAA 


CAACCGTAAT 


GGCACCGATA 


GAGGATTGAA 


1800 


CTGCTCCCAT TGACTCCTCA GGTATTTGTT 


TAAAAACGAG 


TTCTTGCAAT 


CTAGGAGAGA 


1860 


GAACACCTGC GAAAAAGGCA TCCAAGGTAC 


TAAAGATGAG 


AATCCAGTCA 


AAACGAACTG 


1920 


TGGCAAATCC TACTAGAAGA AGCAACTGGA 


TGACAAGTGA 


GGCATAGAGA 


GCTGTTTTTA 


1980 


TGGAAATGGT ATGTTGCAGA TAGCCACTTA 


CAAGGCTTCC 


GACAATCAGG 


GCTGATAATT 


2040 


CTAGTGTGGC TAACAAGGCA AGAGATTGAC 


CAGTTTGTAA 


ATTCAAAAAG 


GGCTGGTTCC 


2100 
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TTAAAAATAG AGTGGAAATA GGAACCGTAA CATTTATCAC TGCTTGACTA GTAGAGATAA 2160 

TAAACAAAAC CAAGAGCACC TTATTCATAT TCCATATCAA TTTCGATGAT TGGAGCAAAT 2220 

GCTGGCAAAA GGATTTTACA GAGAGTCCTT CTTGATAGCT AATCGTTTTT TCTACTTTCA 2280 

AGAGGTCAGT TTTTATGAAG AGGATACCTA AAAATGCGAT TAAAAAGGTA AGAGCGTTCA 2340 

GTAAGGAAAT AAACTGGATG GATAGAATGC CTAGTAAGAC TCCTCCTAGG ATATTACTGA 2400 

-TTGTTTTCAC TAAACTAACA GTTGACTGTT TAAAGCCAAT AGCTTCTGCC AGATGGTCTT 24 60 

GCCCAATAAT TCTAATGAAA ATCGGAGTGA GCATGGCGCC TGAAAAATAA CTCAATGTGT 2520 

CAGACAAGAG GTTAATCAGA CAAATAAATG CTACTAGCAA CAAGGAGAAA GACTGCCCTG 2580 

AAAGTGATAA AGACACTATA GAGTAAAGCA AAAATTTTGC AAAACTAATG ACTGTGTATT 2640 

TCAAGACACG ATGATGTTGA AAATCCGCCA AAACTCCCAG AAAGATTTGT AGAACTTGGG 2700 

GCAGGGTTTC TGAAATCGTG ATGAGTAAAA TCGCCAAAGG GGCAAAAGAT GCATCTGCCA 2760 

CATAATTCAG GAAGGCCAGA TAAAAAATCG TATCCCCAAG CGTTGAAATC CACTGGTTGA 2820 

TAGTTAATTG CCTAAAATCT CTATTTTGAA GAAATACTTT CATCACAACT CCTTCTTAAG 2880 

TTCAAATGGG AATCTTTCCC CAAGGATAGA CCGCGATACT ACTAACAACC AAAATTACAG 2940 

TAACATCAAA AGCTGACCAA TGCCATTGTA GACTATATGC AGTCCAATAG GCCAATAAAT 3000 

TGACTTTGTC ATTCTAAATA AGACTGCAAA TATAAGACCT CCACCCATAT AGAAGACAAA 3060 

GTCTGTCAAG ACCCAACCGT GATTACTAAT GTGCGAGACC CCAAATAAAA CAGCGGAACC 3120 

AAGTACATCT AGCCCCCATT TCTTTCCTTT TTCCAGAGCA GTCATCACTA ATCCACGATA 3180 

AATCATGTCT TCAAAAATGG GACCTGCAAT CACAGGATAA AAAAAATACA TCAAAAATGC 3240 

TGTAGCCCCT GTAAAAGTCG GAGCAGCATG TTGATAAGAA ATTTCATTTC GAGTAGGTGG 3300 

GAAAAGAAAA AAGGTAACGA AATTCCAAAC AACAAAAGCA AGCAGAGCTA GGAAGGAATA 3360 

GAAAAGATAG GATCCTTTAA ACTTTCTACT ATTGATTTTC TGCCATTTCC CCGACCAAAT 3420 

CATAGCAATA AGAGCAAATA AAACCACAAG AAAATTCAAC ATCATATCCG ACAGATAATA 3480 

GGCAAAGTCA GATAGCCCAG TAACAAGGTC GCTGCGTAAA ACTAGAACAC TGAACTTCTG 3540 

GTCAGCAATA ACTAGTAGAA AAACTATAAT AAAGTAGCGG TGTGAGATTA TCTTTTTCAT 3600 

ATATCACCTT TCTAATATCC AAATACCAAT AAAGTAACAA TGAGTAAGAA ACTATTCCAT 3660 

GAAGCATGCA GAGCTATAGC CCAATAGATG GATCGGGTGT AGCGAAACAT CATACAAAAT 3720 

ATCAAGCCCA TTCCAAAATA CTTTATGAAA TCTGTCGTTA TCCAACCATA CTGCAAAACA 3780 

TGCATAGCGC CAAATATGGC AGCGGAAACA AGAACATCAA GATAGTATCT CTTAACTTTA 3840 
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GATAAACTTG TCATCAAAAG ACCACGACAA ACAACCTCTT CTGATACAGG TGCGATAATA 3900 

CTAGTATAAA GTATTCGCGT AACAAAATAG CTAATTCCTG TTAAATTGGT GGCTACTTCT 3960 

ACGACTGTAC TTCCATTCTG GGTACGAGGA AAGATATAGG TTGTTAGATT TGCCCACACG 4020 

AACAATAAGA AAAAAGAAAG AAGGAAAACA CCCAGGTAAG ACCAACGAAA CTGGAAACGA 4080 

CCACACTCTT TCCAATGTTC ACTTTTGACA AAAGCAATTG TAGCTATAGT TCCCAGAATA 4140 

AGTACCAATA AAACTTGGAA CACATAGTAC ATATTATCAG ACAAAGCAAC CATAAAATCT 4200 

AAGTCTGATG TGACATTAAA AATGAGGTAA TAAGTCAAAA TCAACAAGCC AGTTGCTAGG 4260 

TGAAATTTCA CTTCTTTCAT TTTCTTCATC CTATTATCTC CTATAAGAGC CTATCTTCTA 4320 

CGGCGGCCAA ACAATCCATC TGCTAAATCT ATAGTCCAAT CAAAAGCTCC ACGATTAGGA 4380 

CTCATCCCTT GATTGCCCCA ACCAGGGTAA ATTCCTGGGA CGCCCCAACC AGATATACCA 4440 

CTTCTTCCAC CACCTCCCAT AGAATTTACG AGGTTGCCTC CTCTAACATC TTGCAACTCA 4500 

GCTTCTGTCA ATTCCATTGT TTCTGCAAAT TGTAAATTTA ACATCTTTTA CACTCCTTCA 4560 

ATTATCTTCA TTTGTAAACC ACTTCTGCGA CCTAGGATTT GCTTCAAGTG CTTTACAAGT 4620 

ACAGTATAAC ACGAACATTG GCTTATTTTA GAAAATCGCA TATTTGATAT TTTTTCTTAT 4680 

AGAAATTTCA GATTTGCGAT TTTGGTGAAT TTGATTACTT CTCTGGTATA ATAAAGTTAC 4740 

TACTAATGAG GAGTGGAGAA ATATGAAGAA ACAAATTTTA ACATTATTGA AA 4792 
(2) INFORMATION FOR SEQ ID NO: 157: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2156 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 157: 

CCGTTCTCGG CGACGGCCAT CTGATGAAGC TATTTATGAG GGAAACTGGC AAGCTGGAGA 60 

GTCAGAGTAT CTAGTCTTTC ACCGATTGCT GTGGCAGCAG ATGTGCAGGG AAAAGGAGTT 120 

GCTCAAACCT TCTTAGAGGG CTTGATTGAA GGTTTTGATT ATCTTGATTT TCGCTCAGAT 180 

ACGCATGCTG AAAACAAGGT TATGCAACAT ATTTTTGAAA AACTTGGTTT TAAACAAGTC 240 

GGTAAGATGC CAGTAGATGG CGAACGCTTG GCCTATCAAG AATTAAAGAA ATAATGCAAA 300 

AGAAGTATGT AAAAATCCTC TACTCCTCAC CAATTGGTAT TCTATCACTT GTAGCTGATG 360 

ACCATTATTT GTATGGAATT TGGGTTCAGG AGCAGAAGCA TTTTGAGAGG GGACTAGGAG 420 

ATGAAACGAT AGAAGAAGTT GTTAGTCATC CTATTTTAGA CCCAGTTATT GCTTGCTTAG 480 
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ATGATTACTT 


TAAAGGCAAG 


CCTCAGGATT 


TATCCAACTT 


GCTCTTGGCG 


CCAATCGGAA 


540 


CGAATTTTGA 


AAAGAGAGTT 


TGGGACTATT 


TACAGGGCAT 


TCCTTATGGT 


CAGACAGTGA 


600 


CCTATGGACA 


AATTGCTCAA 


GACCTGCAAG 


TGGCTTCTGC 


TCAAGCAATT 


GGTGGAGCAG 


660 


TGGGACGCAA 


TCCTTGGTCT 


ATCCTAGTAC 


CTTGTCATCG 


TGTGTTGGGA 


GCAGGCAAGC 


720 


GTCTGACAGG 


TTATGCTGCA 


GGAGTGGAAA 


AGAAAGCTTG 


GCTCTTGGAG 


CATGAAGGAG 


780 


. TAGATTTTAA 


AGATAGAAGC 


AATAGAAGGA 


GAAGCACATG 


TTAGAATTTA 


TCGAATACCC 


840 


CAAATGTTCA 


ACTTGTAAAA 


AAGCAAAACA 


AGAATTAAAT 


CAATTAGGTG 


TGGACTATAA 


900 


AGCCGTCCAT 


ATCGTGGAAG 


AAACACCTAG 


CCAAGAAGTC 


ATTTTGAATT 


GGCTAGAAAC 


960 


CTCAGGATTT GAATTGAAGC AATTTTTCAA CACCAGTGGT 


ATCAAATACC 


GTGAATTAGG 


1020 


GCTAAAAGAT 


AAGGTAGGAA 


GTTTGTCAAA 


CCAAGAAGCG 


GCTGAGTTGC 


TAGCAAGTGA 


1080 


CGGTATGTTG 


TTAAAACGGC 


CCATTTTAGT 


AGAAAATGGA 


ACTGTTAAGC 


AAATCGGTTA 


1140 


TCGAAAATCT 


TATGAGGAAC 


TGGGACTGAA 


ATAGTTTTTA 


TCTATCTCTT 


TGATAGATAA 


1200 


AATATATAAC 


TTCCCTGTTT 


CAAAGTATGA 


TAAACTAGTA 


GGTAGACAAA 


GTCTGTATCT 


1260 


GACCGTAGCA 


AATAATTTCA 


TTGACGGCAG 


AAGCATGGTA 


GCATGAATCA 


TTATCAGAAG 


1320 


AGGATGTTTT 


TATGAATGTT 


ACAACGATTT 


TAGCATCAGA 


TTGGTACCAA 


AACTTGATGC 


1380 


AATTGATTCC 


GGATGGCAAG 


CTGTTTAGCC 


TACGTTCGGT 


CTTTGATGGA 


ATCCCTAGAA 


1440 


TTGTCCAACA 


ACTTCCAACA 


ACAATTATGT 


TGACAATTGG 


TGGTGCCCTT 


TTTGGCTTGG 


1500 


TTTTGGCGCT 


TCTTTTTGCC 


ATTGTGAAGA 


TCAATCGTGT 


CAAGATTTTA 


TATCCCTTGC 


1560 


AGGCCTTCTT 


TGTTAGTTTC 


TTAAAAGGGA 


CACcGATTTT 


GGTGCAACTC 


ATGTTGACCT 


1620 


ACTACGGAAT 


CCCTTTGGCT 


TTGAAAGCCC 


TCAATCAGCA 


ATGGGGAACT 


GGTCTCAATA 


1680 


TCAATGCGAT 


TCCAGCTGCA 


GCTTTTGCGA 


TTGTCGCCTT 


TGCCTTTAAT 


GAGGCAGCTT 


1740 


ATGCTAGTGA 


AACCATTCGT 


GCAGCCATTC 


TCTCAGTTAA 


TCCTGGTGAG 


ATTGAGGCGG 


1800 


CACGCAGTCT 


GGGTATGACC 


CGAGCGCAAG 


TTTATCGACG 


AGTGATTATT 


CCTAATGCAG 


1860 


CGGTGGTAGC 


TACTCCAACC 


TTGATTAATT 


CCCTCATCGG 


TTTGACCAAG 


GGAACATCTC 


1920 


TAGCTTTTAG 


TGCGGGTGTT 


GTGGAAGTCT 


TTGCCCAAGC 


TCAGATTCTA 


GGTGGAGCTG 


1980 


ATTATCGCTA 


TTTTGAACGC 


TTCATCTCCG 


TTGCCCTTGT 


TTATTGGGTA 


GTCAATATCG 


2040 


GAATTGAAAG 


CCTCGGTCGT 


TTCATCGAGA GAAAAATGGC 


TATTTCTGCA 


CCTGATACAG 


2100 


TGCAACAGAT 


GTGAAAGGAG 


ACCTTCGTTA ATGATTAAGA 


TTTCGAATTT 


AAGCAA 


2156 


(2) INFORMATION FOR SEQ ID NO: 158: 
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U) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3140 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 





SEQUENCE DESCRIPTION: SEQ ID NO: 


158: 








' ACATGTCTTC AATCGATTTT GTTGTCCTCC 


! AATTTAATTC 


' CTTATATGCT 


60 




TTGCATAACA AGTTGCAACG TCTCCTGAAC 


GTCTTGGAAC 


TATTTTATAA 


120 




TCTTATTAAC ACTTTCAAAT GTATTTACAA GTTGTAATAC 


ACTAGTGCCT 


180 


TCTCCCGAGC 


CTAGGTTATA GATATAAACA TCTGTTTTTT 


CAGATACTTT 


TTCTAAAGCT 


240 


TTTATATGTC 


CTATTGCTAA ATCTACTACA TGGATATAAT 


CACGCACACC 


AGTACCATCA 


300 


AGCGTATCAT 


AATCATTTCC GAACACACTT AGCTCTGATA 


GCTTACCTAC 


CGCTACTTGT 


360 


GCAATATAAG 


GCATCAAGTT GTTAGGAATT CCTGAGGGAT 


CTTCCCCAAT 


CAAACCAGAC 


420 


TCATGAGCAC 


CAATTGGATT GAAATAACGA AGCAACGCAA 


TACTCCATTC 


TGAATCTGCC 


480 


ACATGAACAT 


CTTTTAAAAT TTGCTCAAGC ATCACTTTCG 


TATACCCATA 


AGGATTTGTC 


540 


GCACTTGTTT 


GCATCGTCTC AATTAGAGGT GACTGATTGT 


TAATTCCATA 


TACAGTCGCA 


600 


CTTGAAGAAA 


AGACAATCTT TTTAACATTA AATTCTGACA 


TCACTTCAAC 


AAGTGCCAAT 


660 


GTACTCATAA 


TATTATTTTT GTAGTACATC ACAGGCTTTT 


GCACGGATTC 


TCCGACAGCT 


720 


TTATAACCTG 


CAAAATGAAT TGCAGCATCA ATCGATTCTT 


GTTCAAATAC 


CTTTCTCAAT 


780 


GCTTGTTTAT 


CACAAACATC TAATTCGTAA AACACGGGAC 


GTATTCCTGT 


AATTGCTTCA 


840 


ATACGGTCTA 


GCACCAAGAT GCTAGAGTTC GAAAGGTTGT 


CGACAATGAT 


AACTTCCTTT 


900 


CCTAAATTTA GTAATTCTAC TACGGTATGG CTACCAATAT 


AACCAGCTCC 


GCCTGTTACC 


960 


AATATTGCCA 


TCTGGGTTTC CTCCTAATTA ATTCCAACCG 


ACTTAACAAA 


TCTCATAAAC 


1020 


GCTTCATGCC 


CAGACGGTGT ATTCTTATAA ACTCCTGCAT 


CTTCCAGAAC 


TCTCGCAAAC 


1080 


ACTTGTCCTG 


CTTCGTGTTG AACTACGCTA TTAACCTCTT 


CTTTATTAAT 


GCGAGGATAT 


1140 


TTTTCTTTCA 


ATTGGTCGGC CCATTCTAAA TGATAATCCG 


CAATTGGATT 


ATCCTCTCCT 


1200 


AAAAGATATT 


TTCCAACTTC TTCTAACTCT GGTTTCAAAC 


GAGGTGGTAA 


TATCGCAAGT 


1260 


CCCATCACTT 


CGATTAACCC GATATTTTCC TTTTTAATAT 


GTTGTACATC 


TTGATGAGGA 


1320 


TGGAAAACAC 


CATCTGGGTA TTGTTCAGTA GTATGATTAT 


CTCTTAGAAC 


AATATCTAAT 


1380 


TCGTATCTCC 


CGTCCACTTT ACGAGCAATA GGAGTCACCG 


TATGGTGTGG 


GACATCTTCA 


1440 


GTCATAGCAA 


TGATGTCTAC TTCTAAATCT GAATATTCTC 


TCCACTTATT 


TAGAATTTTA 


1500 



A 
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GTAGCTAAAT CTAACAAGCG ATTTTTATTT TCACTTTGTA 


ACCTAATTAC 


TGACATTGGC 


1560 


CATTTTACAA TACCAGCATT AACATCCTCA AAGTCTTTAA 


AACAAAATTC 


ACTCTCAAAT 


1620 


TTTGCTTTTT CCATTGGGAA AATATGTTTC CCTCCCTGGT 


AGTGGTTATG 


ACTAAGAATG 


1680 


GAGCCTCCTG AGATAGGAAG ATCAGAATTT GAACCAGCAA 


AATATCCTGG 


CAAAATATCA 


1740 


ACAATCTCCA ATAATTGTTC AAATGTTTTA GAGGTAATAG 


CCATTGGTAC 


ATGTTGACTA 


1800 


TTCAAAAATA TCGCATGCTC ATTAAAGTAT GAGTAGGGAG 


AATACTGGAA 


TCCCCATACT 


1860 


TCGTCACCAA GTTTCAACCG AATAATTCTA TGATTCGAAC 


GTGCTGGATA 


ATTTATTCGC 


1920 


CCCTGATATC CTTCATTTTC CATACATAGT AAACATTTGG 


GATAATTAGT 


TGCTTTTACT 


1980 


AATTTTTCAG CAGCAATTGT TTTTGGATCT TTTTCGGGTT 


TTGACAAATT 


TATCGTAATC 


2040 


TCTAGCTCTC CGTATTTAGT TGATGCTCGA AACTCAATAT 


TCTTAGCAAT 


AGCAGAAGTT 


2100 


TTAATATAAT CACTATCTTT ACTTAACTTA TAAAACTCTT 


CAACTGCTTC 


TTGAGGTGAT 


2160 


ATATCATATG AACTCCAAAA AATATCATTT AATCGACTAG 


GTAAAGGAAC 


TATGAAATTC 


2220 


ATTAACTCTG CTCCTAAACA TTCCTTTTCC TCGATTAAAT 


CTTTAATTTT 


ACCGTTTTTT 


2280 


AAGGCGATTT CCACTAAGTA ATCTTTTATT TGTTTCAGGT 


CATTTTCATC 


GGAAATGCGA 


2340 


TCAATTCCCT CCTCACCTAT TAACGCTAGT ACTCTATTTT 


TCACATATAT 


TTTGTCAATT 


2400 


TCATTATACA TTCCGTATTC AATTACTCTA TCAACAAAAT 


TATCAATAAT 


TGTTTTCATA 


2460 


TATTTTTCTT TCTAATTTAT GTTCCCATAT TTTCTATACA 


TTATCCATTT 


ATAAATTGCT 


2520 


TGCGTAGTAT GAGCAATTTT ATCAAGGTGA TGAATAATAT 


CTAAAGCACT 


AATTACTTCA 


2580 


GAAACGTTCC CATCATCTTC AAATATGTAA TTCATTATTT 


TCTTTTCCAT 


ATTTATACTA 


2640 


AGCTCTTCTA TCTCATTCTG TTTTTGTATA ACAACCATAT 


CTAAACATCC 


AGATTGTTCC 


2700 


TCTCTATAAC AAGATATAGC CCTATTCATA TGCAGTCCGA 


TAACTTCATG 


AAGTATTTTT 


2760 


ATTTTTGAAA TAATTTTCTT CAAAATTTCA TTATTTTGAA 


GAATCTGTAG 


ATTTTTTAAA 


2820 


ATTTCAACAA TTCTATCCCC AATACGTTCA ATGTCAGTTG ATATTTTTAT TACACTAATA 


2880 


ATTCTTCTTA AGTCATATGA AACAGGATGT TGTAAACAAA 


TTAACTCATA 


TCCTTTTTTA 


2940 


TCAATATTTA GAACTGACTC ATTTATGATT AAATCTTCTT 


TAATCAATTC 


TACTCGTTCT 


3000 


TCATTTGATA AATATTCAAA TAACTTCTCA TATTTATCAA 


GCACAGATAC 


CCAAATGGTC 


3060 


TCTAAATTAT TTGATAATTC TATAATTTCA TTTTCTAAAT 


ATAACCTTAA CATTTAGGTA 


3120 


CCTCTTCTTA ACAAAGTTCG 






3140 


(2) INFORMATION FOR SEQ ID NO; 159: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9048 base pairs 

(B) TYPE: nucleic acid 
(Ci STRANDEDNESS: double 
(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 159: 



CCGGATGATT 


TCCTGGTCAG 


ATAGGGGGAA 


AGTGACTTCC 


TCAGCAATCG 


CGCGTAGAGT 


60 


AGGATTCCCT 


TCACGGATAA 


TATCGTTCAT 


ATCAATTAAG 


TGAGCAGCTT 


TTGTAATACG 


120 


TTCTATTGCA 


GACATTTTCT 


CTCCTTATAT 


TATGTTTAGT 


GCAGTTAGCT 


ACTGCCAAAG 


180 


CCCAAGTGGT 


ATACTTGGAA 


TAAGCCACTG 


TGGATTAGTT 


CATTTTCTTT 


CATTACCTCT 


240 


ACATGATATC 


ACAAAATGAC 


AAGAATTGAA 


AGCATTATGG 


CATTTAGGAT 


TTATAGAAAA 


300 


TAGATAGGAA 


GTTCAATTCA 


ATTGTGAAAG 


AAATACTTAT 


CTGTGATATA 


ATAAAAAGAA 


360 


AAGGCTTGCA 


TAAGAAAGTA GGGAGAACGA AGATACAAAG 


AAGACAAAAT 


CGAAATCAGG 


420 


GTGGTTTAGC 


TTTTCGTTTT 


ATGAAGGGCT 


TGGTAAACTT 


TTTAGGAGTT 


ATCGCAAGTG 


480 


GAGCAATAAG 


GGATTTGTGG 


CGATACTCTT 


GCTAGCAGTT 


GGTTTATCAA 


TGGGCTTGGT 


540 


CTTGTTGTTT 


GAAAGCTTCC 


AAGGAATCCC 


TTGACTAGTC 


AAAAACGAGA 


TACTATTTCT 


600 


CAAGAGGGGA 


CTAAGCAAAA 


GTCTCAGGAG 


TAGGAAGAGG 


AAAAAACTGC 


CAGAATTATG 


660 


GCCCACGGGG 


ATTTGCTCTA 


CCACGATGGA 


CTTTTCTTTT 


CAGCTAAAAA 


AGAAGACGGT 


720 


ACCTATGACT 


TTCATGAAAA 


TTTTGAGTAT 


GTGACTCCTT 


GGCTCAAGCA 


AGGGGACTAA 


780 


GCAGCAGATT 


TAGCTATTGG 


TGATTTTGAA 


GGAACCATTA 


ATAAGGATCA 


TTATTTAGCG 


840 


GGTTATCTTC 


TCTTTAATGC 


TCCTGTTGAA GTTATGGATG 


CTATTAAGGA 


GGCAGGTTAT 


900 


CATGTGCTGG ATTTAGCTCA TAATCATATT TTGGATTCGC AAATTGAGGG AGTTATTTCA 


960 


ACGGCCGATA 


TTATTGAGAA 


AGCTGGAATC 


ACTCCAATCG 


GAGTTTATAC 


GCACGAACCA 


1020 


CGTGATCAGG 


CTCCGCTGGT 


CATTAAGGAA 


GTGAATGGTA 


TCAAGGTTGC 


ATTGTTAGCC 


1080 


TATTCCTATG 


GTTTCAATGG 


AATTGAGCAG 


TATATTTCTC 


AGGAAGACTA 


TAATCGTTAT 


1140 


CTTTCAGATT 


TAAACGAAGA 


TAAGATGAAG 


GTTGAAATTG 


AACGGGCAGA 


GAAGGAAGCA 


1200 


GATATCACCA 


TTATCATGCT TCAGATGGGT 


GTTGAGTATC 


GATTGGAACC 


AACTGAAGAA 


1260 


CAAAAAGCTC 


TTTATCACAA GATGATCGAT 


TTGGGAGCGG 


ATATTATCTT 


TGGAGGGCAT 


1320 


CCTCACGTTG 


TTGAACCATC 


TGAAACGGTT 


GAAAAAGATG 


GAGATAAGAA 


ACTCATTATC 


1380 


TATTAAATGG GGAACTTCAT TTCCAATCAA CGAATTGAAT CTATGGGAGA TGAAGAGAAT 


1440 


GCTAAGTGGA 


CTGAACGTGG 


TGTTCTCATG 


GATGTCACCA 


TCAAGAAGAA 


GGATGGAAAA 


1500 
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ACAACTATCG 


GAACAGCTAA 


AGCTCATCCT 


ACTTGGGTCA 


ATCGAACACC 


AAAGGGAACC 


1560 


TTTTCACCAG 


AAGGATATCC 


CTTGTATCAT 


TACCAAACTT 


ATATTTTGGA 


AGATTTTATA 


1620 


GAGGATGGCA 


GTCATCGTGA 


CCAGTTAGAT 


GAAGCGACTA 


AGGAACGAAT 


TGATACAGCC 


1680 


TATAAAGAAA 


TGAATGAACA 


TGTGGGATTG 


AAGTGGTATT 


AGCTTGAATC 


CAGAGGAAAG 


1740 


TAAATGATGA 


TTAAGGTAAT 


TGCGACAGAT 


ATGGATGGGA 


CCTTGCTGGA 


TGCTAGAGGT 


1800 


CAGCTTGATC 


TCCCACGATT 


GGAAAAGATT 


TTAGATCAGT 


TGGATCAAAG 


GGGCATTCGT 


1860 


TTTGTCATTG 


CGACGGGCAA 


TGAAATTCAC 


CGCATGAGAC 


AACTACTGAG 


TCCCTTGGTG 


1920 


GATCGAGTGG 


TTCTGGTTGT 


TGCTAATGGC 


GCTCGTATTT 


TTGAAAACAA 


TGAATTGATT 


1980 


CAGGCTCAGA 


CATGGGATGA 


CGCCATTGTC 


AACAAGGCTT 


TGACTCATTT 


CAAGGGTCGA 


2040 


GCGTGTCAGG 


ACCAGTTTGT 


TGTAACGGGG 


ATGAAGGGTG 


ATTTTGTCAA 


GGAAGGTACG 


2100 


ATTTTTACAG 


ATCTTGAAAG 


TTTTATGACT 


CCAGAAATGA 


TTGAAAAATT 


CTACCAACGG 


2160 


ATGCAATTTG 


TGGATGAATT 


AACATCTGAC 


CTCTTTGGTG 


GTGTGCTCAA 


GATGAGCATG 


2220 


GTTGTTGGTG 


AGGAACGTTT 


GAGTTCGGTT 


TTGGAAGAAA 


TCAATGCTCT 


CTTTGATGGC 


2280 


CGTGTCCGAG 


CTGTATCCAG 


TGGCTATGGT 


TGCATTGATA 


TCCTCCAAGC 


TGGGATTCAT 


2340 


AAAGCATGGG 


GCTTGGAGGA 


ATTACTCAAG 


CGCTGGGACT 


TG AAATCCC A 


AGAAATCATG 


2400 


GCTTTTGGTG 


ATAGTGAAAA 


TGATGTTGAA 


ATGCTTGAAA 


TGGCTGGAAT 


TGCCTATGCG 


2460 


ATGGAAAATG 


CTGATGAGAA 


AGCCAAAGCT 


GTGGCGACTG 


CTCTAGCACC 


AGCCAACAGC 


2520 


CAAGGAGGAG 


TTTATCAAGT 


CTTGGAAAAC 


TGGTTAGAAA 


AAGGAGAATG 


AAGTGGCAGT 


2580 


ACAGTTATTA GAAAATTGGC 


TCCTAAAGGA 


ACAAGAAAAA 


ATTCAAACTA 


AGTATCGTCA 


2640 


CCTAAATCAC 


ATTTCTGTTG 


TAGAACCAAA 


CATTCTTTTT 


ATTGGGGATT 


CCATTGTCGA 


2700 


GTATTATCCT 


CTACAGGAGC 


TATTTGGGAC 


TTCAAAGACG 


ATTGTCAATC 


GAGGAATTPR 


2760 


TGGCTATCAG 


ACAGGACTGT 


TACTAGAGAA 


CCTTGATGCT 


CATCTATATG 


GTGGAGCAGT 


2820 


AGATAAAATT 


TTTCTTCTGA 


TTGGGACAAA 


TGATATCGGA 


AAGGATGTTC 


CTGTGAATGA 


2880 


GGCTCTCAAT 


AATCTCGAAG 


CTATCATTCA 


ATCCGTTGCT 


CGCGATTATC 


CATTGACAGA 


2940 


GATTAAATTG 


CTTTCCATTT 


TGCCTGTCAA 


TGAGAGAGAG 


GAGTACCAGC 


AGGCAGTCTA 


3000 


TATCCGCTCG 


AATGAAAAAA 


TTCAGAACTG 


GAATCAAGCC 


TATCAAGAGC 


TTGCATCTGC 


3060 


CTATATGCAG 


GTGGAATTTG 


TGCCAGTATT 


TGATTGTTTG 


ACAGACCAAG 


CAGGCCAACT 


3120 


CAAAAAAGAA 


TATACAACTG 


ATGGACTGCA 


CCTCAGTATT 


GCTGGTTATC 


AGGCTTTGTC 


3180 


AAAATCCTTG 


AAAGACTATC 


TTTACTAAAT 


AGCTAAATAA 


TGTTAAATTT 


GAGCATAATA 


3240 



A 
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TCTTGTAAAA AATTCTAAAA TCCTTTAAAA TAAAAAGTGA CGGAGGAATT TATGAATGTA 3300 

AATCAGATTG TACGGATTAT TCCTACTTTA AAAGCTAATA ATAGAAAATT AAATGAAACA 3360 

TTTTATATTG AAACCCTTGG AATGAAGGCC TTGTTAGAAG AATCGGCCTT TCTGTCACTA 3420 

GGTGACCAAA CGGGTCTTGA AAAGCTGGTT TTAGAAGAAG CTCCCAGTAT GCGTACTCGT 3480 

AAGGTAGAGG GAAGAAAAAA ACTAGCTAGA TTGATTGTCA AGGTGGAAAA TCCCTTAGAA 3540 

ATTGAAGGAA TCTTATCTAA AACAGATTCG ATTCATCGAT TATATAAAGG TCAAAATGGC 3600 

TACGCTTTTG AAATTTTCTC ACCAGAAGAT GATTTGATTT TGATTCATGC GGAAGATGAC 3660 

ATAGCAAGTC TAGTAGAAGT AGGAGAAAAG CCTGAATTTC AAACAGATTT GGCATCAATT 3720 

TCTTTAAGTA AATTTGAGAT TTCTATGGAA TTACATCTCC CAACTGATAT CGAAAGTTTC 3780 

TTGGAATCAT CTGAAATTGG GGCATCCCTT GATTTTATTC CAGCTCAGGG GCAGGATTTG 3840 

ACTGTGGACA ATACGGTTAC CTGGGACTTA TCTATGCTCA AGTTCTTGGT CAATGAATTA 3900 

GACATAGCAA GTCTTCGCCA GAAGTTTGAG TCTACTGAAT ATTTTATTCC TAAGTCTGAA 3960 

AAATTCTTCC TTGGTAAAGA TAGAAATAAT GTTGAATTGT GGTTTGAAGA AGTATGAAGT 4020 

GGACCAAGAT TATTAAAAAA ATAGAAGAAC AAATCGAGGC AGGGATTTAT CCCGGAGCCT 4080 

CTTTTGCGTA TTTTAAGGAC AATCAATGGA CAGAGTTCTA TTTAGGCCAG AGTGACCCAG 4140 

AGCATGGCTT GCAGACTGAG GCAGGACTAG TTTATGACCT AGCTAGTGTC AGCAAGGTTG 4200 

TTGGGGTTGG CACAGTTTGT ACCTTCTTGT GGGAAATAGG TCAATTAGAT ATTGATAGAC 4260 

TGGTAATAGA TTTTTTACCT GAGAGTGATT ATCCAGACAT CACTATTCGC CAGCTCTTGA 4320 

CTCATGCAAC AGACCTTGAT CCTTTTATTC CTAATCGTGA TCTTTTAACA GCCCCTGAAT 4380 

TAAAGGAAGC GATGTTTCAT CTCAACAGAC GAAGTCAGCC AGCCTTTCTT TATTCGGATG 4440 

TCCATTTTTT GCTGTTGGGC TTTATTTTGG AAAGAATTTT TAATCAAGAT TTGGATGTGA 4500 

TTTTAAAGGA TCAAGTCTGG AAACCTTGGG GAATGACGGA AACTAAGTTT GGGCCAGTTG 4560 

AGCTTGCTGT TCCAACAGTT AGAGGTGTAG AGGCAGGCAT AGTGCATGAT CCCAAGGCfC 4620 

GTCTCCTGGG TAGACATGCT GGGAGTGCTG GTTTATTTTC GACTATAAAG GATTTACAAA 4680 

TCTTTTTAGA ACACTATTTA GCAGATGATT TTGCAAGAGA CTTAAATCAA AATTTTTCTC 4740 

CTTTGGATGA CAAGGAACGT TCTTTAGCAT GGAATTTGGA AGGAGATTGG CTAGACCATA 4 800 

CGGGCTATAC AGGTACCTTT ATCATGTGGA ATCGTCAGAA GCAAGAAGCC ACTATTTTCC 4860 

TATCGAATCG TACCTATGAA AAGGACGAGA GAGCTCAATG GATATTAGAC CGCAATCAAG 4920 

TGATGAACTT GATTCGCAAA GAAGAGTAAG GAGAGACATG TCAAATAGTT TAAAAGGGAC 4980 

TTTACTAACA GTTGTGGCTG GTATTGCTTG GGGGTTGTCA GGAACGAGTG GCCAATACCT 5040 
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AATGGCACAC 


GGAATTTCGG 


CTCTGGTCTT 


GACTAACTTG 


CGTCTTTTAA 


TCGCTGGTGG 


5100 


AATTCTCATG 


CTCTTGGCTT 


ATGCTACTGC 


AAAGGATAAA 


ATACTGGTCT 


TTTTAAAGGA 


5160 


TAGAAAGAGT 


TTGCTGTCTC 


TTCTTATTTT 


TGCTCTGATT 


GGTCTTTTTC 


TCAACCAATT 


5220 


CGCCTATCTG 


TCTGCTATTC 


AGGAGACCAA 


TGCGGGAACA 


GCGACGGTGC 


TTCAGTATGT 


5280 


TTGTCCTGTC 


GGAATTTTAA 


TTTATAGCTG 


TATCAAGGAT 


AGGGTGGCAC 


CGACACTGGG 


5340 


- AGAGATAGTT 


TCCATCATAT 


TCGCCATCGG 


AGGAACCTTC 


CTGATCGCAA 


CACATGGGCA 


5400 


GTTGGACCAG 


TTATCCATGA 


CACCTGCTGG 


TCTGTTCTGG 


GGTCTCTTTT 


CTGCCTTGAC 


5460 


TTATGCTCTG 


TATATCATTT 


TACCCATAGC 


CTTGATTAAA 


AAGTGGGGGA 


GCAGCTTGGT 


5520 


CATTGGTGTG 


GGAATGGTCA 


TAGCAGGTTT 


GGTCGCCCTT 


CCTTTTACAG 


GGGTTCTACA 


5580 


GGCCGATATC 


CCGACTAGTC 


TTGATTTTCT 


CCTTGCGTTT 


GCAGGCATTA 


TCCTTATCGG 


5640 


GACTGTCTTT 


GCCTATACAG 


CTTTCCTTAA 


AGGAGCCAGT 


CTGATAGGAC 


CGGTCAAGTC 


5700 


AAGCTTGTTG 


GCTTCAATTG 


AGCCAATATC 


GGCGATTTTC 


TTTGCCTTCT 


TAATAATGAA 


5760 


TGAACAATTT 


TATCCCATTG 


ATTTTCTTGG 


TATGGCAATG 


ATATTGTTTG 


CTGTAACTTT 


5820 


GATTTCTTTG 


AAAGATTTAT 


TCTTAGAAAA ATAAAAAAGA 


CTCTTTGTCC 


GTGACAGAGA 


5880 


GTTTTTGCGT GGTAATCTAA TTATTTTCAA GATAAAATTC AAAGCGTTCG CCTACATATT 


5940 


GACTTTTTAC 


GTATTCAAAA 


GCAGTACCAT 


CTTCTAGGTA 


GGAAACCTGG 


GTCAATCCAA 


6000 


GAATAGCATG 


TCCTTTTTCA 


ACTTCCAAAT 


AGTGGGCAAT 


CTTTTCTTTA 


GCAAGGCGAG 


6060 


CATAGATGGT 


CTGTTGAGAT 


TTGCCGATAC 


GATAGCCATG 


TTTTTGCAAG 


GTTTGGAAGA 


6120 


AATGACTGGT 


GATTTCTTCT 


TTTTTAAAGT 


CCTTAATGAA 


TTTTTCAGGA 


ATAGAAGCAA 


6180 


CTTCATAAAC 


TAGGGGAACT 


TGGTCGGCAT 


AGCGGACCCG 


CTCCATTCGG 


ATAATATTGT 


6240 


CCGTTGGAAA 


AATTCCTAGC 


TTGGCAACTT 


CTTGCTCATT 


GGGAATGGTT 


TTTTTGTAGG 


6300 


AAATGAGCTG 


GCTAGAGGGA 


ACTTTACCTT 


GGGATTTGAC 


AATTTCAGTA 


AAACTGGTTG 


6360 


TCCCTCGCAT 


CTTTTCTTGT 


ACTCGAGTAC 


TGGAAACAAA 


GGTGCCGCTT 


CCTACACGGC 


6420 


GCTCTAAGAC 


GCCTTCTTCG 


ACTAATAGAG 


ATACGGCTTG 


GCGGAGGGTC 


ATGCGACTGA 


6480 


CCGCAAACTG 


CTCAGCTAAA 


TCTCTTTCAC 


TGGGAAGCCT 


CTCACCAATA 


GCCCAACGGT 


6540 


ACTCGTCAAT 


ATCCTTTTTT 


ATCTGATCAT 


GGATTTTTAT 


ATAAGCAGGT 


AGCATATTTT 


6600 


TCACTTCATT 


TCTATCTTTT 


CTCTATTGTA 


CCCCAATAAA 


CTAGAAAAAG 


TCAAACTTCG 


6660 


CCTTGTTTAG 


TTGGTAATTC 


GCCCTTATTT 


GTGATAGAAT 


ATTGAGAAAA 


GATATTTCTT 


6720 


TTGAGAAAGG 


AAAAAGATGA 


GCAACATTTC 


AACTGATTTG 


CAAGATGTAG 


AAAAAATCAT 


6780 



A 
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CGTATTGGAC TATGGTAGCC AGTACAACCA GCTGATTTCA CGCCGTATCC GTGAGATTGG 6840 

TGTTTTTTCA GAACTAAAAA GCCATAAAAT TTCAGCTGCT GAAGTTCGTG AAGTCAATCC 6900 

TGTAGGAATT ATTCTATCAG GTGGTCCAAA TTCTGTATAT GAAGATGGTT CATTTGATAT 6960 

TGACCCAGAA ATCTTCGAAC TCGGAATTCC AATTTTGGGA ATCTGTTATG GTATGCAGTT 7020 

ATTGACCCAT AAACTTGGAG GAAAAGTTGT TCCTGCAGGT GATGCTGGAA ATCGTGAATA 7080 

CGGTCAATCA ACCCTAACTC ACACACCATC AGCGCTTTTT GAATCAACAC CTGATGAACA 7140 

GACTGTTTTG ATGAGCCATG GTGATGCGGT TACTGAGATT CCTGCTGACT TTGTTCGTAC 7200 

AGGTACATCA GCTGACTGCC CATACGCAGC CATCGAAAAC CCAGATAAAC ACATTTACGG 7260 

TATCCAATTC CACCCAGAAG TTCGTCATTC TGTATACGGA AATGATATCC TTCGTAACTT 7320 

TGCCCTTAAC ATTTGTAAGG CTAAAGGTGA CTGGTCAATG GATAATTTCA TTGACATGCA 7 380 

GATCAAAAAA ATTCGTGAAA CCGTCGGTGA TAAACGTGTC CTTCTTGGTC TATCAGGTGG 7440 

TGTTGACTCA TCTGTCGTTG GGGTTCTTCT CCAAAAAGCG ATTGGCGATC AATTGATCTG 7 500 

TATCTTCGTA GACCACGGTC TTCTTCGTAA AGGCGAAGCT GATCAAGTTA TGGACATGCT 7560 

CGGTGGTAAG TTTGGTTTGA ATATCGTCAA AGCAGACGCT GCTAAACGTT TCCTTGACAA 7620 

ACTTGCTGGC GTTTCTGACC CTGAACAAAA ACGTAAAATC ATCGGTAACG AGTTTGTCTA 7680 

TGTATTCGAT GACGAAGCAA GCAAGCTCAA AGATGTGAAA TTCCTTGCTC AAGGTACTTT 7740 

ATATACAGAT GTTATCGAGT CTGGTACGGA TACAGCTCAA ACTATCAAGT CACACCACAA 7800 

CGTGGtGGTC TTCCAGAAGA TATGCAGTTT GAATTGATTG AACCACTCAA TACTCTTTAC 7860 

AAGGATGAAG TTCGTGCTCT TGGTACAGAG CTTGGTATGC CAGACCATAT CGTATGGCGC 7920 

CAACCATTCC CAGGACCAGG ACTTGCTATC CGTGTCATGG GTGAAATCAC TGAAGAGAAA 7980 

CTTGAAACCG TTCGTGAATC AGACGCTATT CTTCGTGAAG AAATCGCTAA AGCTGGACTT 8040 

GACCGCGATA TTTGGCAATA CTTCACTGTT AACACAGGCG TTCGTTCAGT CGGTGTTATG 8100 

GGTGACGGTC GTACGTATGA CTACACGATT GCAATCCGTG CTATCACTTC TATCGATGGT 8160 

ATGACTGCTG ATTTTGCCAA AATTCCATGG GAAGTACTTC AAAAAATCTC AGTACGTATC 8220 

GTAAATGAAG TGGATCATGT TAACCGTATC GTCTACGATA TTACAAGTAA ACCACCTGCA 8280 

ACAGTTGAGT GGGAATAATC GCAAAAAAAT TAAAAGCTTT GTAAAATCAA CGGTTACAGA 8340 

GGATTAAAAA CTGTAACTGG GATTAAAACG GGAACATTTG CTAAAAAGAA TAAATTGAAT 8400 

AATAGTTCCA AGTGGTTTAC ATTTGGACAA AAAATTAGAC CGTAGTTTTC AAGCTGCGGT 84 60 

CTTTTGATAT ATATAATGAG AATTAATGGC TCTTTGTCAA CTGTAGTGGG TTGAAGTCAG 8520 

CTAAGCTCGA GAAAGGACAA ATTTTGTCCT TTCTTTTTTG AT ATT C AG AG CGATAAAAAT 8580 
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CCGTTTTTTG AAGTTTTCAA AGTTCCGAAA ACCAAAGGCA TTGCGCTTGA TAAGTTTGAT 8640 

GAGATTATTG GTCGCTTCCA ATTTGGCGTT AGAATAGTGT AGTTGAAGGG CGTTGACGAT 8700 

TTTCTCTTTG TCCTTTAGAA AGGTTTTAAA GACAGTCTGA AAAAGAGGAT GAACCTGCTT 8760 

TAGATTGTCC TCAATGAGTC CGAAAAATTT CTCCGGTTCC TTATTCTGAA AGTGAAACAG 8820 

CAAGAGTTGA TAGAGCTGAT AGTGATGTTT CAAGTCTTGT GAATAGCTCA AAAGCTTGTT 8880 

- TAAAATCTCT TTATTGGTTA AATGCATACG AAAAGTAGGG CGATAAAAAT GTTTATCGCT 8940 

GAGTTTACGA CTATCCTGTT GTATGAGCTT CCAGTAGCGC TTGATAGCCT TGTATTCATG 9000 

AGACTTTCGA TCCAATTGAT TCATGATTTG AACACGCACA CGACTCGG 9048 
(2) INFORMATION FOR SEQ ID NO: 160: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 10399 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 160: 

GTACCTTTAT TGATGAATGG ACTGTTTAAA TCAGTAGCAC GCCAACCAGA TATGCTTTCT 60 

GAGTTTCGTA GTTTGATGTT TTTAGGTGTT GCCTTTATTG AAGGAACTTT CTTTGTAACT 120 

CTTGTCTTCT CATTTATTAT CAAATAAATA CATGGAACGA GAAGAAAAGG GAGGATTTTA 180 

GATGGAAGAA AGTATTAATC CAATCATCTC TATTGGTCCT GTTATCTTCA ATCTGACTAT 240 

GTTAGCCATG ACTTTGTTGA TTGTGGGAGT TATTTTTGTC TTTATTTATT GGGCAAGCCG 300 

CAATATGACC TTGAAACCCA AAGGAAAGCA AAATGTACTT GAGTATGTCT ATGACTTTGT 360 

TATTGGATTT ACAGAACCTA ACATTGGTTC GCGCTACATG AAAGATTACT CACTCTTTTT 420 

CCTTTGTTTA TTCCTTTTCA TGGTGATTGC CAATAACCTT GGCTTAATGA CAAAGCTTCA 480 

AACGATCGAT GGGACTAACT GGTGGAGTTC GCCAACCGCT AATTTACAGT ATGACTTAAC 540 

CTTATCTTTT CTTGTCATTT TGTTGACACA TATAGAAAGC GTTCGTCGTC GTGGATTTAA 600 

AAAAAGTATA AAATCTTTTA TGAGTCCTGT TTTTGTCATA CCGATGAATA TCTTGGAAGA 660 

ATTTACAAAC TTCTTATCTT TGGCTTTGCG GATTTTTGGG AATATCTTTG CAGGAGAGGT 720 

CATGACGAGT TTGTTACTTC TTCTTTCCCA CCAAGCTATT TATTGGTATC CAGTAGCCTT 780 

TGGAGCTAAT TTGGCTTGGA CTGCATTTTC TGTCTTTATT TCCTGCATCC AAGCTTATGT 840 

TTTTACTCTT TTGACATCTG TGTATTTAGG GAATAAGATT AATATTGAAG AGGAATAGAA 900 
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AGGAGTAACT GATGCACGTA ACAGTAGGTG AATTAATTGG TAATTTTATT TTAATCACTG 960 

GCTCTTTTAT TCTTTTGCTA GTCTTGATTA AAAAATTTGC ATGGTCTAAT ATTACAGGCA 1020 

TTTTCGAAGA AAGAGCTGAA AAAATTGCTT CAGATATTGA CAGAGCTGAA GAAGCCCGTC 1080 

AAAAAGCAGA AGTATTGGCT CAAAAACGCG AAGATGAATT GGCTGGTAGC CGTAAAGAAG 1140 

CTAAGACAAT CATTGAAAAT GCAAAGGAAA CAGCTGAGCA AAGTAAGGCT AATATCTTAG 1200 

CAGATGCTAA ACTAGAAGCA GGACACTTAA AAGAAAAAGC CAATCAAGAA ATTGCTCAAA 1260 

ATAAAGTAGA AGCTTTACAG AGTGTTAAGG GTGAGGTCGC AGATTTGACC ATCAGCTTAG 1320 

CTGGTAAAAT CATCTCACAA AACCTTGACA GTCATGCCCA TAAAGCACTC ATTGATCAGT 1380 

ATATCGATCA GCTAGGAGAA GCTTAATGGA CAAGAAAACA GTAAAGGTAA TTGAAAAATA 1440 

CAGCATGCCT TTTGTCCAAT TGGTACTTGA AAAAGGAGAA GAAGACCGTA TCTTTTCAGA 1500 

CTTGACTCAA ATCAAGCAAG TTGTTGAAAA AACAGGTCTG CCTTCTTTTT TAAAACAAGT 1560 

GGCAGTAGAC GAGTCGGATA AGGAAAAAAC AATTGCTTTT TTCCAAGATT CTGTGTCGCC 1620 

TTTATTACAA AACTTTATCC AGGTTCTGGC CTACAATCAC AGAGCAAATC TTTTTTATGA 1680 

TGTGCTTGTA GATTGCTTGA ACCGACTTGA AAAAGAAACA AATCGATTTG AAGTGACGAT 1740 

TACGTCTGCT CATCCTCTAA CTGATGAACA GAAGACTCGT TTGCTCCCTT TGATTGAGAA 1800 

AAAAATGTCT CTGAAAGTAA GGAGTGTAAA AGAACAAATC GATGAAAGTC TCATTGGTGG 1860 

TTTTGTCATT TTTGCCAATC ACAAGACAAT TGATGTGAGT ATTAAACAAC AACTTAAAGT 1920 

TGTTAAAGAA AATTTGAAAT AGAAAGTGGT GTTCTTTTGG CAATTAACGC ACAAGAAATC 1980 

AGCGCTTTAA TTAAGCAACA AATTGAAAAT TTCAAACCCA ATTTTGATGT GACTGAAACA 2040 

GGTGTTGTAA CCTATATCGG GGACGGTATC GCGCGTGCTC ACGGCCTTGA AAATGTCATG 2100 

AGTGGAGAGT TGTTGAATTT TGAAAACGGC TCTTATGGTA TGGCTCAAAA CTTGGAGTCA 2160 

ACAGACGTTG GTATTATCAT CCTAGGTGAC TTTACAGATA TCCGTGAAGG CGATACAATC 2220 

CGCCGTACAG GGAAAATCAT GGAAGTCCCT GTAGGTGAAA GTCTGATTGG TCGTGTTGTG 2280 

GATCCGCTTG GTCGTCCAGT TGACGGTCTT GGAGAAATCC ACACTGATAA AACTCGTCCA 2340 

GTAGAAGCAC CAGCTCCTGG TGTTATGCAA CGTAAGTCTG TTTCAGAACC ATTGCAAACT 2400 

GGTTTGAAAG CTATTGACGC CCTTGTACCG ATTGGTCGTG GTCAACGTGA GTTGATTATC 2460 

GGTGACCGTC AGACAGGGAA AACAACCATT GCGATTGATA CAATCTTGAA CCAAAAAGAT 2520 

CAAGATATGA TCTGTATCTA CGTCGCGATT GGACAAAAAG AATCAACAGT TCGTACGCAA 2580 

GTAGAAACAC TTCGTCAGTA CGGTGCCTTG GACTACACAA TCGTTGTGAC AGCCTCTGCT 2640 

TCACAACCAT CTCCATTGCT CTTCCTAGCT CCTTATGCTG GGGTTGCTAT GGCGGAAGAA 2700 
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TTTATGTATC 


AAGGTAAGCA 


TGTTTTGATT 


GTATACGATG 


ATCTATCAAA 


ACAAGCGGTA 


2760 


GCTTATCGTG 


AACTGTCGCT 


CTTGCTTCGT 


CGTCCTCCAG 


GTCGTGAAGC 


CTTCCCAGGG 


2B20 


GATGTTTTCT 


ATCTCCACAG 


CCGTTTGCTT 


GAGCGCTCAG 


CTAAAGTTTC 


TGATGAACTT 


2880 


GGTGGTGGAT 


CAATTACAGC 


CCTACCATTT 


ATCGAGACAC 


AAGCAGGAGA 


TATCTCAGCC 


2940 


TATATCGCAA 


CCAACGTGAT 


TTCTATCACT 


GATGGACAAA 


TCTTCCTTGG 


CGATGGCCTC 


3000 


TTCAATGCAG 


GTATTCGTCC 


AGCCATCGAT 


GCGGGTTCAT 


CTGTATCTCG 


TGTAGGTGGT 


3060 


TCTGCACAAA 


TCAAAGCCAT 


GAAGAAGGTT 


GCTGGTACAC 


TTCGTATCGA 


CCTTGCTTCA 


3120 


TACCGTGAGT 


TGGAAGCCTT 


TACTAAGTTT 


GGTTCTGACT 


TGGACGCAGC 


AACACAGGCT 


3180 


AAGTTGAACC 


GTGGACGTCG 


TACCGTTGAG 


GTCTTGAAAC 


AACCTGTTCA 


CAAACCATTA 


3240 


CCTGTTGAGA 


AACAAGTAAC 


CATTCTTTAT 


GCTTTGACAC 


ATGGTTTCTT 


GGATACTGTT 


3300 


CCAGTAGATG 


ATATTGTTCG 


TTTCGAGGAA 


GAGTTCCATG 


CCTTCTTTGA 


TGCTCAACAT 


3360 


CCAGAGATTT 


TGGAAACCAT 


TCGTGATACA 


AAAGACTTGC 


CAGAAGAAGC 


AGTCTTGGAT 


3420 


GCTGCGATTA 


CAGAGTTTCT 


CAATCAATCT 


AGCTTCCAAT 


AAGAATAGAG 


GTGTCAGATG 


3480 


GCAGTATCTC 


TAAATGATAT 


TAAAACAAAA 


ATCGCCTCAA 


CAAAAAATAC 


GAGTCAAATC 


3540 


ACTAATGCCA 


TGCAAATGGT 


ATCGGCTGCT 


AAGCTAGGTC 


GTTCTGAAGA 


AGCTGCTCGC 


3600 


AACTTCCAAG 


TTTACGCTCA 


GAAAGTGCGT 


AAACTTTTGA 


CAGATATCCT 


TCATGGTAAT 


3660 


GGAGCTGGTG 


CTTCAACTAA 


TCCGATGTTG 


ATTAGCCGTT 


CTGTGAAGAA 


GACAGGCTAT 


3720 


ATCGTTATCA 


CTTCAGACCG 


CGGTTTGGTT 


GGAGGTTATA 


ATTCCTCTAT 


TTTGAAAGCT 


3780 


GTTATGGAGT 


TGAAAGAAGA 


ATACCACCCA 


GACGGTAAAG 


GTTTTGAAAT 


GATCTGTATC 


3840 


GGTGGGATGG 


GAGCTGATTT 


CTTTAAGGCT 


CGCGGTATTC 


AACCACTTTA 


TGAATTACGT 


3900 


GGCTTGTCAG 


ACCAACCTAG 


CTTTGATCAA 


GTTCGTAAGA 


TTATTTCAAA 


AACTGTTGAA 


3960 


ATGTACCAAA 


ATGAACTCTT 


TGATGAGCTT 


TATGTTTGCT 


ACAACCACCA 


TGTCAATACG 


4020 


CTAACCAGTC 


AAATGCGTGT 


GGAACAAATG 


CTTCCGATTG 


TTGACTTGGA 


TCCAAATGAA 


4080 


GCGGATGAAG 


AGTACAGCTT 


GACTTTTGAA 


TTGGAAACCA 


GCCGAGAAGA 


AATTCTGGAG 


4140 


CAGTTGTTGC 


CTCAGTTTGC 


AGAAAGTATG 


ATTTACGGTG 


CCATTATCGA 


TGCCAAGACA 


4200 


GCTGAGAATG 


CTGCGGGCAT 


GACAGCCATG 


CAAACAGCGA 


CAGATAATGC 


TAAGAAAGTC 


4260 


ATCAATGATT 


TGACAATTCA 


GTATAACCGT 


GCCAGACAGG 


CGGCGATTAC 


ACAAGAAATT 


4320 


ACAGAAATCG 


TAGCAGGTGC 


TAGTGCCTTA 


GAATAGGCTC 


TAGTCCAGCT 


CGTATGAAAA 


4380 


TGAACTTAGG 


ACCTAGTTGA 


GCTAGGAACC 


GACAGTATCT 


TATATAGAAT 


AGGAGAAGGA 


4440 



WO 98/18931 



PCTYUS97/19588 



1040 

GATGAGTTCA GGTAAAATTG CTCAGGTTAT CGGTCCCGTT GTAGACGTTT TGTTTGCAGC 4500 

AGGGGAAAAA CTTCCTGAGA TTAACAATGC ACTTGTCGTC TACAAAAATG ACGAAAGAAA 4560 

AACAAAAATC GTCCTTGAAG TAGCCTTGGA GTTAGGAGAT GGTATGGTTC GTACTATCGC 4620 

CATGGAATCA ACAGATGGGT TGACTCGTGG AATGGAAGTA TTGGACACAG GTCGTCCAAT 4680 

CTCTGTACCA GTAGGTAAAG AAACTTTGGG ACGTGTCTTC AACGTTTTGG GAGATACCAT 4740 

TGACTTGGAA GCTCCTTTTA CAGAAGACGC AGAGCGTCAG CCAATTCATA AAAAAGCTCC 4800 

AACTTTTGAT GAGTTGTCTA CCTCTTCTGA AATCCTTGAA ACAGGGATCA AGGTTATTGA 4860 

CCTTCTTGCC CCTTACCTTA AAGGTGGTAA AGTTGGACTT TTCGGTGGTG CCGGAGTTGG 4920 

TAAAACTGTC TTAATCCAAG AATTGATTCA CAACATTGCC CAAGAGCACG GTGGTATTTC 4980 

AGTATTTGCT GGTGTTGGGG AACGTACTCG TGAGGGGAAT GACCTTTACT GGGAAATGAA 5040 

AGAATCAGGC GTTATCGAGA AAACAGCCAT GGTCTTTGGT CAGATGAATG AGCCACCAGG 5100 

AGCACGTATG CGTGTTGCCC TTACTGGTTT GACAATCGCT GAATACTTCC GTGATGTGGA 5160 

AGGCCAAGAC GTGCTTCTCT TTATCGATAA TATCTTCCGT TTCACTCAGG CTGGTTCAGA 5220 

AGTATCTGCC CTTTTGGGTC GTATGCCATC AGCCGTTGGT TACCAACCAA CACTTGCTAC 5280 

GGAAATGGGT CAATTGCAAG AACGTATCAC ATCAACCAAG AAGGGTTCTG TAACCTCTAT 5340 

CCAGGCTATC TATGTGCCAG CGGATGACTA TACTGACCCA GCGCCAGCAA CAGCCTTCGC 5400 

TCACTTGGAT TCAACAACAA ACTTGGAACG TAAGTTGGTA CAATTGGGTA TCTACCCAGC 5460 

CGTTGACCCA CTTGCTTCAA GCTCACGTGC CTTGGCACCT GAAATCGTTG GAGAAGAGCA 5520 

CTATGCAGTT GCTGCTGAAG TAAAACGTGT CCTTCAACGT TACCATGAAT TGCAAGATAT 5580 

CATTGCTATC CTTGGTATGG ATGAGCTTTC TGATGAAGAA AAGACCTTGG TTGCTCGCGC 5640 

CCGTCGTATC CAGTTCTTCT TGTCACAAAA CTTCAACGTT GCGGAACAAT TTACTGGTCA 5700 

GCCAGGTTCT TATGTTCCAG TTGCTGAAAC TGTACGTGGC TTTAAGGAAA TCCTTGATGG 5760 

TAAATACGAC CACTTGCCAG AAGATGCCTT CCGTGGTGTA GGTTCTATCG AAGATGTGAT 5820 

TGCAAAAGCT GAAAAAATGG GATTTTAAGA GGTGATCTAT GGCTCAGTTA ACTGTCCAGA 5880 

TCGTGACACC AGATGGTCTC GTCTATGATC ACCATGCCAG CTATGTATCG GTTCGAACTC 5940 

TGGATGGTGA GATGGGGATC TTGCCACGAC ATGAAAATAT GATTGCGGTT TTAGCAGTTG 6000 

ATGAAGTAAA GGTAAAACGT ATCGATGATA AAGATCACGT GAACTGGATT GCAGTAAACG 6060 

GTGGCGTTAT TGAAATTGCC AATGATATGA TCACAATCGT CGCTGACTCT GCAGAACGTG 6120 

CTCGTGATAT CGATATCAGT CGTGCAGAAC GTGCCAAACT TCGTGCAGAA CGTGCAATTG 6180 

AAGAAGCACA AGACAAACAT TTGATTGACC AAGAACGTCG TGCTAAGATT GCTTTGCAAC 6240 
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GTGCTATTAA CCGTATTAAT GTCGGAAATA GACTATAAGA AAAAATGAAC TTGAAAATAC 6300 

CAAGTTCATT TTTTATGGTG TTTTAAGGAG CAAAACGGAT GCAGACTGCT TCGGGAACAT 6360 

GGAAGTCGTT GGAGAGTTCT GCTAGACGAC CATTGTCACA ATTACGTTTA AAGACAGTTG 6420 

CATTGTCAGA GTCTTGATGG ACAACAATGA GAAATTTTTG GTCGGGTGTC AAATCAAAAT 6480 

CACGTGGAGT CTGACCATGC GTTGGAACGA TTTCTAATAA CTCTAAGCTA CCGTCCGCAA 6540 

•GGATGGTATA TACTGCGATA GAATCATGGC CACGGTTAGA AGCGTAGAGG TATTTACCGT 6600 

CTTTAGAGAG ATGAATAGCA GCGGTTCCAT TAAAGCCTTC GTAAGCTTCC GGTAAAGTTG 6660 

AAATGACCTG CATACGTTCA AATTCGCCAA CGCCATCGTA GATTAAAACT TCGATAGTAC 6720 

TATTGAGTTC ACAAATGAGA TAAGCGATTT TATAGTGGTT ATGGAAAATG ATATGGCGTG 6780 

AGCCTGCTCC TGGCTTGCTG TGATAGGTAT AGAGCTTAGA TAATTTTCCT TCTTGATCGA 6840 

GGTCATAGGT GATGACTTGG TCAGTTCCCA AGTCGCAGGT CACTAGATAG TGGTCAGGTG 6900 

TTAAATCTGT ATAGTGAACA TGGGGGGAAG CTTGATTTTC ATGTGGACCT TGGCCACTGT 6960 

GTTGATCCAT ATCACTAAGT AGAAGACTAC CATCTTCCTG GCGTTTATAA ACAAGGACTT 7020 

GTCCCTTGTG ATAGTTAGCT GCGTAAACCA AATCACGCTT TTCATCGACA GCAACATAAC 7080 

AGTGGGGAGC TCCTTCTTCA ACAACATGAT TTAACACAGT CCCGTCAGTT TGATAGGCTG 7140 

CAATTCCCCC CTTATCGTCT TGGCTACCAA CAGTGTATAA ATGTTGGTGC TGGTCAAAGG 7200 

CAAGGTAGGT TGGACTTGGC TCAGCTGCAA AAAGTTCTAG ATTTGAAAGC TGACCAGTTT 7260 

CTGTATCAAA GTCTGCCTTG TAAATCCCTT GAGAAGTACG ACGTGTATAA GTTCCAAAAT 7320 

AAACAGTTTC TTTCATTACT ATACCTCTGT GTAAAGATAA GACTATTATA TCACAAAAAC 7380 

AAGTAAATTA AAGATATCCA ATT AG AT GT A AGCACTTTAA AAAAGAGTTA TTTTGTTTCA 7440 

AAAATGGTAT AATGAGAGAA CAATAGAAAG GAAGTATTTA TGGAGCAAAA AGAGAAACAT 7500 

TTTAGCCTAT CTTGGTTTTT CAAGTGGTTT TTAGATAACA AGGCAATTAC GGTATTTTTA 7560 

GTAACCTTAT TATTGGGACT GAATCTTTTT ATTTTAAGTA AGATTAGTTT TCTATTTTCA 7620 

CCTGTTTTAG ACTTTTTAGC AGTTGTGATG TTGCCAGTCA TTTTGTCTGG TTTGTTATAT 7680 

TATTTGTTGA ATCCTATTGT TGATTGGATG GAGAAGCATA AGGTTAATCG TGTTATAGCT 7740 

ATCACTATTG TCTTTGTTAT CATCGCTCTC TTTATCATTT GGGGCTTGGC AGTCGCCATT 7800 

CCAAATCTGC AACGTCAGGT TTTGACCTTT GCAAGAAACG TTCCTGTTTA CTTAGAAGAT 7860 

ATAGATAGGA TTGTTAATGG ATTGGTAGCC CAGCACCTGC CAGATGATTT CAGACCTCAA 7920 

TTAGAGCAAG TTTTGACCAA TTTTTCTAGC CAGGCTACAG TTTTGGCAAG TAAGGTTTCA 7980 
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TCTCAGGCAG TCAACTGGGT GAGTGCCTTT ATTAGCGGGG CTTCTCAAGT GATTGTTGCC 8040 

TTGATTATCG TTCCTTTCAT GCTCTTTTAT CTCTTGCGTG ATGGGAAAGG CTTGCGTAAC 8100 

TATTTGACCC AATTCATTCC AAGAAAATTG AAGGAACCTG TTGGACAAGT TTTATCAGAT 8160 

GTGAATCAAC AGTTGTCCAA CTATGTTCGA GGGCAAGTGA CAGTGGCTAT TATTGTAGCA 8220 

GTAATGTTTA TCATCTTCTT CAAGATTATT GGTCTACGCT ATGCGGTTAC GCTGGGGGTT 8280 

ACTGCTGGTA TTTTAAATCT GGTCCCTTAT CTTGGTAGCT TTCTAGCCAT GCTTCCTGCT 8340 

CTAGTATTGG GTTTGATTGC TGGTCCAGTC ATGCTTTTGA AAGTAGTGAT TGTCTTTATC 8400 

GTAGAACAAA CTATTGAAGG CCGTTTTGTC TCTCCATTGA TTTTGGGAAG TCAATTAAAC 8460 

ATCCACCCTA TTAATGTTCT CTTTGTTTTG TTAACTTCAG GATCTATGTT TGGTATCTGG 8520 

GGAGTTTTAC TTGGTATTCC GGTTTATGCC TCTGCTAAGG TTGTCATTTC AGCCATTTTC 8580 

GAATGGTATA AGGTAGTCAG TGGTCTATAT GAATTAGAGG GTGAGGAAGT CAAGAGTGAA 8640 

CAATAGTCAA CAGATGTTAC AGGCTTTGGA GGAGCAAGAT TTAACTAAGG CTGAGCATTA 8700 

TTTCGCCAAA GCTTTAGAAA ATGATTCAAG TGATCTTCTG TATGAATTGG CAACTTATCT 8760 

TGAAGGGATT GGTTTCTATC CTCAGGCCAA GGAAATTTAC CTGAAAATTG TAGAGGATTT 8820 

TCCAGAGGTT CATCTTAATC TAGCTGCAAT TGCTAGCGAG GATGGTCAAA TAGAAGAAGC 8880 

CTTTACCTAT CTTGAGGAAA TCCAAGCTGA CAGTGACTGG TATGTCTCGT CTTTGGCTCT 8940 

GAAGGCAGAC CTTTACCAGC TGGAAGGTTT GACAGATGTG GCACGTGAGA AATTATTGGA 9000 

GGCCTTGACC TACTCAGAGG ATTCTCTCTT GATATTGGGT TTGGCAGAGT TGGATAGTGA 9060 

GTTGGAAAAT TACCAAGCGG CTATTCAAGC CTATGCCCAG TTAGATAATC GCTCGATTTA 9120 

TGAGCAAACG GGCATTTCCA CCTATCAACG AATTGGCTTT GCCTATGCTC AGTTAGGGAA 9180 

ATTTGAAACG GCTACTGAGT TTTTAGAAAA AGCCCTGGAG TTAGAATACG ATGACTTAAC 9240 

AGCTTTTGAG TTGGCCAGTC TTTATTTTGA TCAAGAAGAA TATCAAAAAG CCACCCTCTA 9300 

CTTTAAGCAG CTTGATACCA TTTCTCCTGA CTTTGAAGGC TATGAGTATG GGTACAGT CA 9360 

GGCTTTACAT AAGGAACATC AAGTTCAAGA AGCCCTGCGT ATCGCTAAGC AAGGATTAGA 9420 

GAAAAATCCC TTTGAAACTC GCCTCTTGCT AGCTGCTTCA CAATTTTCTT ATGAATTGCA 9480 

TGATGCTAGT GGTGCAGAAA ATTATCTCCT TACTGCAAAA GAAGACGCTG AGGATACAGA 9540 

AGAAATCTTG CTTCGTTTAG CCACTATTTA TCTGGAGCAG GAGCGTTATG AGGATATTCT 9600 

AGAATTGCAG AGTGAGGAGC CAGAAAATCT TTTGACCAAG TGGATGATTG CTCGTTCTTA 9660 

TCAAGAAATG GACGATTTGG ATACTGCTTA TGAGTATTAT CAAGAGTTGA CAGGAGATTT 9720 

GAAGGACAAT CCAGAATTTC TGGAACACTA TATCTATCTC TTGCGTGAAT TGGGACATTT 9780 
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TGAAGAAGCA AAAGTCCATG CTCACACTTA CTTAAAACTG GTTCCAGATG ATGTGCAAAT 9840 

GCAAGAACTG TTTGAGAGAT TGTAAGAATG TTTAACCCAA ATCATTCATA CCTCTCTCAA 9900 

CTAGATGTAA CTTACAAAAC CCCTGACCTC ATGAGCCACT TTCTTCCTCC TCATGAGGTC 9960 

AGTTTTACTT TCTGCTGTTC CAGTATCGTT TTTCCTCGCT AGATTTCCTC AAAAGGGCAG 10020 

ACTCCTCCCT TGGTGCGTCA CACGATTTTT TCATCTCGAC TGTTCTTTAA TGCATCATTA 10080 

. ACGACGCTTT TCTTCTAGGT GGTTCATAAG GAACAGGAAG ATTCAGGTTG ACTTTTCTAA 10140 

TCCTAGAATA AAGTGCTGAA AACAATTCGG AATAGGCATA GAGACTAGAC AATTTGAGGA 10200 

GCTGCTTGCG TCCTGTTCGA ACAGATTTTC CCACCACGTG AAGAAAAAGA TGGCGGAAGC 10260 

GTTTGATTGT TAAAGTTTGG AAGTCACCTC CAGCTAGATG TTTGAGAAAA AGATAGAGAT 10320 

TGTAGGCGAT ACAGCTCATC ATCATACGAA TTCGTTTTTG ATTAAGGTTG AACTATCCGT 10380 

TTTATCGCCA AAAAATCGG 10399 

(2) INFORMATION FOR SEC; ID NO: 161: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9409 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 161: 

GATAAGATTA AGTTAGAAAA GAAAGAACTA GGACATATCT ACCAGATTCA GGTTTTTAAT 60 

AGCTATGGGC AGGAAGAAAT CTATCGTGTG ATTTTGATGG AGACCAATAT TAGTTCGGTT 120 

TCAACCAATA TCAAGTATGC TGCTGTCTTG ATTAATACCA GTCAGTTGGA ACAGGCTAGT 180 

CAAAAGCATG AGCAATTGAT TGTGGTCGTG ATGGCTAGTT TCTGGATTTT GTCTTTACTT 240 

GCCAGTCTCT ATCTAGCTAG GGTCAGTGTT AGGCCCCTGC TTGAGAGTAT GCAGAAGCAA 300 

CAGTCTTTTG TGGAAAATGC CAGTCATGAG TTACGAACTC CACTCGCAGT TTTGCAAAAT 360 

CGCTTAGAGA CCCTTTTTCG TAAGCCAGAA GCTACCATTA TGGATGTGAG CGAAAGCATT 4 20 

GCATCGAGTT TGGAAGAAGT CCGAAATATG CGTTTTTTAA CGACAAGCTT GCTGAACTTA 480 

GCTCGGAGAG ATGATGGGAT TAAGCCGGAG CTTGCAGAAG TTCCAACTAG CTTTTTTAAT 540 

ACAACTTTCA CAAACTACGA GATGATTGCT TCGGAAAATA ATCGTGTCTT CCGTTTTGAA 600 

AATCGTATCC ATCGAACAAT TGTCACAGAT CAGCTTCTTC TGAAACAACT GATGACCATT 660 

CTTTTCGATA ATGCCGTCAA GTATACTGAG GAGGATGGTG AAATTGATTT TCTTATCTCG 720 
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GCGACCGATC GCAATCTTTA TTTACTTGTT TCTGATAATG GAATCGGTAT TTCGACAGAA 780 

GATAAAAAGA AAATTTTTGA CCGTTTTTAT CGAGTAGACA AGGCTAGAAC CCGGCAAAAA 840 

GGTGGTTTTG GTTTAGGATT ATCCCTAGCC AAGCAAATTG TAGATGCTCT AAAAGGAACT 900 

GTTACTGTCA AAGATAATAA ACCCAAGGGA ACAATCTTTG AAGTGAAGAT TGCCATTCAG 960 

ACACCATCTA AAAAGAAAAA ATAAAAATAT CGCTCCAATT GGGGCGATAT TTTGGATTTA 1020 

TCTTCTACGT TTTCGTTTGA TAATAGACCG TTGAACTTTT AAAACAAGTA AGCTGAATCC 1080 

GATTGCTGCG GCAAAGGCAA GAGCAGTTGA TAATTTTAAT GCTAAAAAGA TAAAACTAAA 1140 

GATAGCAATA CAGATACAAA AAACAGCGAT ATTAATAAAA AATAGGATTT CCTTGAGATT 1200 

GGCATCAGAT TGCGCTTCAG GTGTATAAGC TTGGTAATGA GGAAGCTGCT GGTTTAATTC 1260 

TTCTTGATAG TCTACCTCAT AGGATTGTAA TTTTCTTACG GGCATGATTC TCTCCTTAAC 1320 

AGTACATACC TATTTTATCA TTTTTTCGGC AGAGAATTAT TACAGAAAGG TTACAAAAAG 1380 

AATAAAGTCC CTTTTCATTT TCAAAGCATG GCTGATTTTG GAGAAATGTG GTATAATTTT 1440 

TCTTATGGAA AAGATTGTCA TTACAGCAAC TGCTGAAAGT ATTGAACAAG TTGAACAACT 1500 

ACTCGAAGCT GGCGTAGACC GTATCTATGT CGGTGAGAAA GATTTTGGTC TTCGTCTGCC 1560 

AACGACCTTT AGTTATGACC AATTACGTGA AATCGCTAAG TTGGTTCATG ATGCTGGTAA 1620 

GGAATTGATC GTTGCGGTCA ATGCTCTCAT GCACCAAGAT ATGATGGACC GTATCAAGCC 1680 

TTTCTTAAAC TTCTTGGAAG AAATCAAGAC AGACTATATT ACGATTGGGG ATGCAGGCGT 1740 

CTTTTACGTA GTTAACCGCG ATGGTTATTC ATTTAAGACC ATCTACGATG CTTCAACCAT 1800 

GGTAACTAGC AGTCGTCAGA TTAACTTCTG GGGACAAAAG GCTGGCGCAT CTGAGGCTGT 1860 

TTTGGCGCGT GAAATTCCAT CAGCTGAACT TTTCAAAATG CCAGAGATTT TGGAAATTCC 1920 

TGCTGAAGTT TTGGTTTACG GTGCTAGCGT CATCCATCAT TCTAAACGTC CACTCTTGCA 1980 

AAACTACTAT AACTTTACAC ATATCGATGA TGAAAAGACG CATAAACGTG ACCTCTTCTT 2040 

GGCTGAGCCA AGTGATCCAG AGAGCCACTA TTCCATTTTT GAAGATAATC ATGGGACCCA 2100 

TATCTTTGCC AACAATGACC TTGATTTGAT GATCAAATTA ACAGAATTGG TGGAGCATGG 2160 

CTTTACTCGC TGGAAACTAG AAGGGCTCTA CACTCCTGGT CAGAACTTTG TTGAGATTGC 2220 

AAAACTCTTT ATCCAAGCGC GTAGCTTGAT TCAAGAGGGC AACTTTAGTC ATGCTCAAGC 2280 

CTTCTTGCTG GATGAAGAAG TTCGTAAACT TCACCCTAAA AACCGTTTCC TTGATACAGG 2340 

ATTTTATGAC TACGATCCTG ACATGGTTAG ATAAAATACA TGATTCGTTG AGAGAAGGAA 2400 

GATGCAAACA TTTCTTCTCT CAATTTTTCG TATTTCTTCA CTATTTTACA AAAATCAGCA 2460 

GGCTAGAATG CTCTATTCGA TGGGATTTTT AAGAAAAGTA GTGTTCTTGA GTTTGAAAAT 2520 
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TATCCTATGT TTGCAGGTGC CAAATGGCCC TTTTTTTGGT ATAATTTTTT ATAATGAAAA 2580 

CGATTGGTAA TCGCTATGTT GTGGTGGATT TAGAGGCAAC TAGCACAGGT AGTAAGGCTA 2640 

AAATTATCCA AGTGGGAATT GTCGTGATTG AGGACGGAGA AATCGTCGAT CACTATACGA 2700 

CGGATGTCAA TCCACATGAA CCCTTGGATG CTCATATCAA AGAACTGACA GGATTGACAG 2760 

ACCAACGTCT GGCGCAAGCA CCTGATTTTT CGCAAGTTGC CAGAAAAATA TTTGACTTGG 2820 

.TGGAGGATGG GATTTTTGTA GCCCATAATG TTCAGTTTGA TGCTAATCTC TTGGCGGAAA 2880 

ATTTATTTTT TGAAGGCTAT GAGCTAAGAA ACCCTCGTGT TGATACGGTC GAATTGGCCC 2940 

AGGTCTTTTT CCCTGAACTG GAAAAATATA GCTTGCCGAT TTTGTGTCGA GAATTAGGAA 3000 

TTCCTCTTAA ACACGCACAC ACAGCCCTTT CAGATGCCCA AGCTACAGCA GAATTACTTC 3060 

TTTTTTTACG GAAAAAGATG ACCCAGCTTC CTAAAGGTCT CTTGGAACGC TTGCTGGAAA 3120 

TGGCTGACGC TCTCCTATAT GAGTCCTACC TGGTTATTGA GGAAACTTAT CGCAACCAAT 3180 

CTATCCTGAG TTCTCCAGAC TTGGTCCAAG TTCAAGGTCT ATATTTTAAG AAAACGGAAG 3240 

CTTCTCTGGA GCCACGAAAA CTATCTCAAG ACTTTTCTAA AAATATTTCT CTGTTGAACC 3300 

TTGAAGTGAG GGAGGAACAA GAAAGTTTTG CTAAAGAGGT TGGCTTGCTA TTGAAAGATG 3360 

AACCTGTCTC TCTGATTCAA GCGCCGACAG GGATTGGGAA AACCTATGGC TATCTCTTAC 3420 

CCGCTTTATC TCAATCCAAA GAGCGACAAA TTGTTCTTAG TGTTCCGACA AAGATTCTTC 3480 

AAAATCAAAT CATGGAAGAA GAAGGTAAAC GCCTCAAGGA AGTGTTCCAT ACAGATATTC 3540 

ATAGCTTAAA GGGACCACAA AATTATCTGA AGTTGGATGC CTTTTATCAT TCCTTGCAGG 3600 

AAAATGATGA AAATCGCTTA TTTAGACGCT TTAAAATGCA AGTCTTGGTC TGGCTTACTG 3660 

AGACAGAGAC AGGAGATTTG GATGAAATCG GGCAACTCTA CCGTTACCAA CATTTTCTAG 3720 

CAGACCTTCG TCATGATGGG AATTTATCAT CCCAGAGCTT ATTTGTGACG GAAGATTTTT 3780 

GGAAACGTAG TCAAGAAAGG GCAGAGACTT GCAAGCTTTT AGTGACTAAT CATGCCTATC 3840 

TCGTAACCAG ACTTGAAGAT AATCCTGAAT TTGTCAGTGA CCGTTTACTG ATTATTGATG 3900 

AAGTCCAAAA GATTTTGTTA GCTCTAGAAA ATCTGCTTCA AGAGACCTAC GATATACAAT 3960 

CTATTATCGA TTTAATTGAT AAGGCTTTAG TAGGAGAAGA AAACAGGGTT CAACAACGGA 4020 

TACTAGAAAG TATTCGCTTT GAGTGTCTCT ACTTGATAGA ACAATTTCAG TCTGGCAAAT 4080 

CTAGGAAAAA TATCTTAGAT TCTCTGGACA ATCTCCATCA GTATTTTTCA GAATTGGAAG 4140 

TAGAAGACTT TGATGAGCTG GTTCGCTATT TTACAGCTGA AGGTGATTAC TGGCTTGAAG 4200 

TAACTGAAAC GAGTCAAAAG AAAATTCAGA TTTCTTCTAC AAAATCAGGC CGTACTCTTC 4260 



WO 98/18931 



PCTYUS97/19588 



1046 

TGTCCTCTTT ACTTCCTGAG AGTTGCCAAG TCTTGGGAGT ATCGGCTACT CTTGAGATTA 4 320 

GTCAGAGGGT TTCTTTGGCA GACCTTTTAG GCTATCCTGA AGCTAAATTT GTCAAGATTG 4380 

AATCTCGGGG AAAACAGGAA CAAGAAGTGG TCATGGTCAA AGATTTCCCT CTGGTAACAG 4440 

AAACCTCCTT AGAAGTCTAT GCCAGAGAGG TAGCTGCTTT ACTAGTGGAA ATTCAAGCTT 4500 

TCCAGCAACC GATTTTGGTT CTCTTTACCG CTAAAGACAT GCTTCTAGCA GTATCGGATT 4560 

TACTTACAGT TAGCCACTTG GCCCAGTATA AAAATGGGGA TGTTCATCAG CTAAAGAAAC 4620 

GCTTTGAAAA AGGTGAACAA CAAATCTTGC TTGGTGCAGC AAGTTTCTGG GAGGGAGTTG 4680 

ATTTTTCAAG CCATCCTTCT GTGATTCAAG TTGTACCGAG GCTTCCTTTC CAAAATCCTC 4740 

AAGAACCCTT GACGAAAAAG ATTAATCAAG AACTGAATCA AGAAGGGAAA AATGCCTTTT 4800 

ATGATTATCA ATTGCCAATG GCCATTATTC GTTTAAAACA GGCTTTGGGA AGAAGTATGA 4860 

GACGTGAATA CCAACGTTCC TTAACTCTTA TTTTGGATAG GAGAATCGTC GGAAAACGAT 4920 

ACGGCAAACA AATAGTAGCA TCTCTAGCAG AAGAAGCGAC TGTTAAAACC ATCTCTCGAT 4980 

CCGAAGTTGA CGAGGCTATT GATAGATTTT TTAATGAGCT TTGATAAATA GTATTGTATG 5040 

AAAGTATAAG GTTAGTATAT ATGAAACGTT CTCTCGACTC AAGAGTCGAT TACAGTTTGC 5100 

TCTTGCCAGT ATTTTTTCTA CTGGTCATCG GTGTGGTGGC TATCTATATA GCCGTTAGTC 5160 

ATGATTATCC CAATAATATT CTGCCCATTT TAGGGCAGCA GGTCGCCTGG ATTGCCTTGG 5220 

GGCTTGTGAT TGGTTTTGTG GTCATGCTCT TTAATACAGA ATTTCTTTGG AAGGTGACCC 5280 

CCTTTCTATA TATTTTAGGC TTGGGACTTA TGATCTTGCC GATTGTATTT TATAATCCAA 5340 

GCTTAGTTGC ATCAACGGGT GCCAAAAACT GGGTATCAAT AAATGGAATT ACCCTATTCC 5400 

AACCGTCAGA ATTTATGAAG ATATCCTATA TCCTCATGTT GGCTCGTGTC ATTGTCCAAT 5460 

TTACAAAGAA ACATAAGGAA TGGAGACGCA CGGTTCCGCT GGACTTTTTG TTAATTTTCT 5520 

GGATGATTCT CTTTACCATT CCAGTCCTAG TTCTTTTAGC ACTTCAAAGT GACTTGGGGA 5580 

CGGCTTTGGT TTTTGTAGCC ATTTTCTCAG GAATCGTTTT ATTATCAGGG GTTTCTTGGA 5640 

AAATTATTAT CCCAGTATTT GTGACTGCTG TAACAGGAGT TGCTGGTTTC TTAGCTATCT 1,700 

TTATTAGCAA GGACGGACGA GCTTTTCTTC ACCAGATTGG AATGCCGACC TACCAAATTA 5760 

ATCGGATTTT GGCTTGGCTC AATCCCTTTG AGTTTGCCCA AACAACGACT TACCAGCAGG 5820 

CTCAAGGGCA GATTGCCATT GGGAGTGGTG GCTTATTTGG TCAGGGATTT AATGCTTCGA 5880 

ATCTGCTTAT CCCAGTTCGA GAGTCAGATA TGATTTTTAC GGTTATTGCA GAAGATTTTG 5940 

GCTTTATTGG CTCTGTCCTG GTTATTGCCC TCTATCTCAT GTTGATTTAC CGTATGTTGA 6000 

AGATTACTCT TAAATCAAAT AACCAGTTCT ACACTTATAT TTCCACAGGT TTGATTATGA 6060 
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TGTTGCTCTT CCACATCTTT GAGAATATCG GTGCTGTGAC TGGACTACTT CCTTTGACGG 
GGATTCCCTT GCCTTTCATT TCGCAAGGGG GATCAGCTAT TATCAGTAAT CTGATTGGTG 
TTGGTTTGCT TTTATCGATG AGTTACCAGA CTAATCTAGC TGAAGAAAAG AGCGGAAAAG 
TCCCATTCAA ACGGAAAAAG GTTGTATTAA AACAAATTAA ATAAGGAGAA AATCATGGTA 
AAAGTAGCAG TTATATTAGC TCAGGGCTTT GAAGAAATTG AAGCCTTGAC AGTTGTAGAT 
. GTCTTGCGTC GAGCCAATAT CACATGTGAT ATGGTTGGTT TTGAAGAGCA AGTAACGGGT 
TCGCATGCAA TCCAAGTAAG AGCAGATCAT GTCTTTGATG GAGATTTATC AGACTATGAT 
ATGATTGTTC TTCCTGGAGG TATGCCTGGT TCTGCACATT TACGTGATAA TCAGACCTTG 
ATTCAAGAAT TGCAAAGCTT CGAGCAAGAA GGGAAGAAAC TAGCAGCCAT TTGTGCGGCA 
CCAATTGCCC TCAATCAAGC AGAGATATTG AAAAATAAGC GATACACTTG TTATGACGGC 
GTTCAAGAGC AAATCCTTGA TGGTCACTAC GTCAAGGAAA CAGTAGTGGT AGATGGTCAG 
TTGACAACCA GTCGGGGTCC TTCAACAGCC CTTGCCTTTG CCTACGAGTT GGTGGAGCAA 
CTAGGAGGGG ACGCAGAGAG TTTACGAACA GGAATGCTCT ATCGAGATGT CTTTGGTAAA 
AATCAGTAAA ACGGGAGTTA TTCTCTCGTT TTTTATGTGG AAAACTCAGG GAAATCATCG 
CTTTTTTCAT AAAAAAATGC TATAATGAAG GGTATGAAAT ATCACGATTA CATCTGGGAT 
TTAGGTGGAA CTTTACTGGA TAATTATGAA ACTTCAACAG CTGCATTTGT TGAAACATTG 
GCACTGTATG GTATCACACA AGACCATGAC AGTGTCTATC AAGCTTTAAA GGTTTCTACT 
CCTTTTGCGA TTGAGACATT CGCTCCCAAT TTAGAGAATT TTTTAGAAAA GTACAAGGAA 
AATGAAGCCA GAGAGCTTGA ACACCCGATT TTATTTGAAG GAGTTTCTGA CCTATTGGAA 
GACATTTCAA ATCAAGGTGG CCGTCATTTT TTGGTCTCTC ATCGAAATGA TCAGGTTTTG 
GAAATTTTAG AAAAAACCTC TATAGCAGCT TATTTTACAG AAGTGGTGAC TTCTAGCTCA 
GGCTTTAAGA GAAAGCCAAA TCCCGAATCC ATGCTTTATT TAAGAGAAAA GTATCAGATT 
AGCTCTGGTC TTGTCATTGG TGATCGGCCG ATTGATATCG AAGCAGGTCA AGCTGCAGGA 
CTTGATACCC ACTTGTTTAC CAGTATCGTG AATTTAAGAC AAGTATTAGA CATATAAGAA 
AAAGGAATAA GATGACAGAA GAAATCAAAA ATCTGCAGGC ACAGGATTAT GATGCCAGTC 
AAATTCAAGT TTTAGAGGGC TTAGAGGCTG TTCGTATGCG TCCAGGGATG TACATTGGAT 
CAACCTCAAA AGAAGGTCTT CACCATCTAG TCTGGGAAAT TGTTGATAAC TCAATTGACG 
AGGCCTTGGC AGGATTTGCC AGCCATATTC AAGTTTTTAT TGAGCCAGAT GATTCGATTA 
CTGTTGTGGA TGATGGGCGT GGTATCCCAG TCGATATTCA GGAAAAAACA GGCCGTCCTG 
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CTGTTGAGAC 


CGTCTTTACA 


GTCCTTCACG CTGGAGGAAA 


GTTCGGCGGT 


GGTGGATACA 


7860 


AGGTTTCAGG 


TGGTCTTCAC 


GGGGTGGGGT CGTCAGTAGT 


TAATGCCCTT 


TCCACTCAAT 


7920 


TAGACGTTCA 


TGTTCACAAA 


AATGGTAAGA TTCATTACCA 


AGAATACCGT 


CGTGGTCATG 


7980 


TTGTCGCAGA TCTTGAAATA GTTGGAGATA CGGATAAAAC AGGAACAACT GTTCACTTCA 


8040 


CACCGGACCC 


AAAAATCTTC 


ACTGAAACAA CAATCTTTGA 


TTTTGATAAA 


TTAAATAAAC 


8100 


GGATTCAAGA 


GTTGGCCTTT 


CTAAATCGCG GTCTTCAAAT 


TTCAATTACA 


GATAAGCGCC 


8160 


" AAGGTTTGGA 


ACAAACCAAG 


CATTATCATT ATGAAGGTGG 


GATTGCTAGT 


TACGTTGAAT 


8220 


ATATCAACGA 


GAACAAGGAT 


GTAATCTTTG ATACACCAAT 


CTATACAGAC 


GGTGAGATGG 


8280 


ATGATATCAC 


AGTTGAGGTA 


GCCATGCAGT ACACAACTGG 


TTACCATGAA 


AATGTCATGA 


8340 


GTTTCGCCAA 


TAATATTCAT 


ACCCATGAAG GTGGAACACA 


TGAACAAGGT 


TTCCGTACAG 


8400 


CCTTGACACG 


TGTTATCAAC 


GATTATGCTC GTAAAAATAA 


GTTACTGAAA 


GACAATGAAG 


84 60 


ATAATTTAAC 


AGGGGAAGAT 


GTTCGCGAAG GCTTAACTGC 


AGTTATCTCA 


GTTAAACACC 


8520 


CAAATCCACA 


GTTTGAAGGA 


CAAACCAAGA CCAAATTGGG 


AAATAGCGAA 


GTGGTCAAGA 


8580 


TTACCAATCG 


CCTCTTCAGT 


GAAGCTTTCT CCGATTTCCT 


CATGGAAAAT 


CCACAGATTG 




CCAAACGTAT 


CGTAGAAAAA 


GGAATTTTGG CTGCCAAGGC 


TCGTGTGGCT 


GCCAAGCGTG 


8700 


CGCGTGAAGT 


CACACGTAAA AAATCTGGTT TGGAAATTTC 


CAACCTTCCA 


GGGAAACTAG 


o/ou 


CAGACTGTTC 


TTCTAATAAC 


CCTGCTGAAA CAGAACTCTT 


CATCGTCGAA 




8820 


CTGGTGGATC AGCCAAATCT GGTCGTAACC GTGAGTTTCA GGCTATCCTT 


CCAATTCGCG 


8880 


GTAAGATTTT 


GAACGTTGAA 


AAAGCAAGTA TGGATAAGAT 


TCTAGCCAAC 


GAAGAAATTC 


8940 


GTAGTCTTTT 


CACAGCCATG 


GGAACAGGAT TTGGCGCAGA 


ATTTGATGTT 


TCGAAAGCCC 


9000 


GTTACCAAAA 


ACTCGTTTTG 


ATGACCGATG CCGATGTCGA 


TGGAGCCCAC 


ATTCGTACCC 


9060 


TTCTTTTAAC 


CTTGATTTAT 


CGTTATATGA AACCAATCCT 


AGAAGCTGGT 


TATGTTTATA 


9120 


TTGCCCAACC 


ACCAATCTAT 


GGTGTCAAGG TTGGAAGCGA 


GATTAAAGAA 


TATATCCAGC 


9180 


CGGGTGCAGA 


TCAAGAAATC 


AAACTCCAAG AAGCTTTAGC 


CCGTTATAGT 


GAAGGTCGTA 


9240 


CCAAACCGAC 


TATTCAGCGT 


TATAAGGGGC TAGGTGAAAT 


GGACGATCAT 


CAGCTGTGGG 


9300 


AAACAACCAT 


GGATCCCGAA 


CATCGCTTGA TGGCTAGAGT 


TTCTGTAGAT 


GATGTGCAGA 


9360 


AGCAGATAAA 


ATCTTTGATA 


TGTTGATGGG GATCGAGTTG 


TCCTCGTCG 




9409 


(2) INFORMATION FOR SEQ ID NO: 162: 









(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6415 base pairs 

(B) TYPE: nucleic acid 
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(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 162: 

CCTGGGAAAG TCTTGAAAAT TATGATAGAA TGGTGGAAGG AAAAATTCAG GAGAGTAGTA 60 

GTGACTCAAA ATGTTGAAAG TCTTCTCGTA TCCATTGTAA TCAGTGCATA CAATGAAGAA 120 

AAATATCTGC CTGGTCTAAT TGAAGACTTA AAAAATCAAA CCTATCCTAA AGAGGATATT 180 

GAAATTCTAT TTATAAATGC TATGTCCACA GATGGGACCA CAGCTATCAT TCAGCAATTT 240 

ATAAAGGAAG ATACAGAGTT TAACTCAATT AGATTGTATA ACAATCCTAA GAAAAATCAA 300 

GCTAGTGGTT TTAACCTGGG AGTTAAACAT TCTGTAGGGG ACCTTATTTT AAAAATTGAT 360 

GCTCATTCAA AAGTTACTGA GACTTTTGTA ATGAACAATG TGGCTATTAT TCAACAAGGT 450 
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CACGGATTAC TTGGCAGTTA ATCTACTCCA GACGCAAAAA AGGTAGTGGT GATGGAGAAC 1620 

ACCGTCGGAC CTTCTTGATT GGTGCCGGTG ATGGTGGGGC TCTTTTTATG GATAGTTACC 1680 

AACATCCAAC CAGTGAATTA GAACTGGTCG GTATTTTGGA TAAGGATTCT AAGAAAAAGG 1740 

GTCAAAAACT TGGTGGTATT CCTGTTTTGG GCTCTTATGA CAATCTGCCT GAATTAGCCA 1800 

AACGCCATCA AATCGAGCGT GTCATCGTTG CGATTCCGTC GCTGGATCCG TCAGAATATG 1860 

AGCGTATCTT GCAGATGTGT AATAAGCTGG GTGTCAAATG TTACAAGATG CCTAAGGTTG 1920 
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TTGAGCTGCC CGCAGCTAGT TTCCTAGTTT GCTCTTTGAT TTTCATTGAG TATTACTTCA 3420 

TTTTCTTCTG AAATGGAATT GTTACCCAGT CTATGCTATT GAAAATACGC CAAAACTTCT 3480 

AAGGGTTTGT GAGCGATATA ATCAGGTTGA TAGTTTAGTA GATCTGCTTG CTCTCCAAAT 3540 

CCCCAAGTGA TGGCCAATTT CTGAATACCT GTTTCTCGAG. CTCCCAGCAT ATCAAACTTG 3600 

GTATCTCCGA TGATGATGGC TTGTTCTGGT GCTAGTTGAT GTGTCTGCAA GGCTTGGTGA 3660 

ATGACATCTG CCTTATGGGG TGCTTCAGGG CTAGAACCAT AAATGCCATC AAAGAAATGA 3720 

TGGATTTCCA AGTTTTTTGC CATGTCTTGA GCAGTAGATG TATCCTTTGT CGTGGTGATG 3780 

TAGAGTGGAT AACTGCTCGA TAACTCCTCA AGCAAGTCTA TAATCTGAGG AAAGAGTTGA 3840 

GCTTCATAGA TGCCTTTTGC CTTATAGTAA GAACGATATA TCTGCACGGC TTCAGAAATT 3900 

TGGTCTTTGG ACAGGCAGGT CGCAAAACTA CTTTCGAGAG GTGGTCCCAT AAAACCACGA 3960 

ATAGTTTTGG CATCAGGGCT AGGCACCCCC AGCTCTTTAA AGGTATAGGT AAAGGCATTG 4020 

TGAATCCCGA TAGAACTATC AACGAGGGTT CCATCCAAAT CGAAAAAAAT CGCTGTGATA 4080 

GAGGTCATGG TTTCTCCTAT TTGATAAGCT TATTCTCCGA AAATTTCTTT TTGGAGGCGA 4140 

CGACCAGTAG GGGTGGTAGC GAGTCCACCT TCAGCTGTTT CACGAAAGGC AGTTGGCATG 4200 

CTTGCTCCTA CTTGGTACAT GGCATCGATC ACTTCATCCA CAGGGATTTT AGATTCGATA 4260 

CCTGCCAAGG CCATGTCTGC TGCGATGAAA GCAAAGCTAG CTCCCATGGC ATTACGTTTG 4320 

ACACAGGGAA CTTCGACCAA ACCTGCAACA GGGTCACAGA TGAGGCCTAG CATATTTTTA 4 380 

ATGACAAAGG CAATAGCTTG ACTGGCCTGA TAAGGTGTTC CACCTGCAGC CAGAGTCAAG 4440 

GCGGCAGCAC TCATAGCAGA GGCTGAACCA ACTTCAGCTT GACACCCACC CTCAGCACCT 4500 

GAGATGGAGG CATTGTTTGC GATGACTAGT CCAAAGGCAC CAGCAGCAAA GAGGAAATCC 4560 

AATTGTTGCT CGTGGCTGAG GTCTAATTTT TCAATAGCAG CAGTGAGAAC GGATGGCAGA 4620 

CAGCCAGCAC TTCCAGCGGT TGGAGTGGCA CAGACCAAGC CCATTTTGGC ATTGTGTTCA 4680 

TTGACTGCGA TGGCATTTCG GGCAGCCGAG AGAATCGTAT AATCTGACAG AGTTTTTCCG 4740 

TTTTCGATGT AGTGATCCAA TTTGGCAGCA TCTCCACCTG TCAGGCCACT ACGAGATTTA 4800 

TTTTCATTGA GGCCAAGTTG GACAGAGGCT TTCATAACTT CCAGATTGCG TTCCATGAGA 4860 

AGGAAGACTT CTTCACGTTC GCGACCGGTC AATTCAAACT CTGTTGTAAT CATGAGTTCT 4920 

GCGACATTTC CTTGAAAGTC CAGATCTGCT TGCTCGACCA ATTCTTTGAT AGAATAAAAC 4980 

ATGCTTCCTC CTATTTAAAG AAATTGACAT TGTGGAGATG AGGGATTTTT CGAATTTCTT 5040 

CGATAGCCTC ATCACAGTTG CGACTGTCAA CTTCGATAAT CATAATGGCT TTTTCACCAG 5100 
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CTTTTTCACG AGTGACATTC ATCTGGGCGA TATTGATACC ATAGCGGGAA AGCGCCTCTG 5160 

TAACAAGGGC AATCATACCT GGAATATCTT GATGAACGAT GATGATAGTC GGTGTATTCA 5220 

TATTGAGAGA GACGGCAAAA CCATTGAGTT CGGTTACCTG AATATTTCCT CCACCGATAG 5280 

AAATACCAGT CACGCTGATG GTCTTGTGGG CATTTTTAAC AG T AATTTT A GTGGTGTTAG 5340 

GGTGAGGGGC ATTGCTGTCT TTCTGAATGG TCCAGACAAT CTTGATACCA CGCTTGTGGG 5400 

CAATTTCCAG ACTATTTGGA ATTTCAGGAT CATCTGTATC CATTCCTAAA ATACCTGCAA 54 60 

CAAGGGCTAG GTCTGTTCCG TGACCACGAT AGGTCTTGGC AAATGAGTTA AAAAGTTGGA 5520 

ATTCAACTTC TGTCGGAGTA TCATCAAAAA TGGAAGAGAC AATCTTCCCA ATACGAACAG 5580 

CACCAGCGGT ATGGCTACTA GATGGGCCAA TCATAACTGG TCCGATGATA TCAAAGACAG 5640 

ATTGAAAACG AAGTGATTTC ATCAGTTTCC CCTTATAAAA ATTCTTATCT CTATTATATC 5700 

AAAGAATGAG GGGCTTGGCT TTAATTGTGG ATGAAAACCT TTCTAATACC TCAAATAGCA 5760 

TAAAAATAGT ATCTTTTATG ACAAAAAACA CCTTATTTAG GGAAATAAAA AATAATTTTG 5820 

TAATATTTCT ACATAAAAGT GTCAAGAAAC GGTAATATTT AAAGGGTATG ATAGAACTAT 5880 

AGAAAGAAGG AGAATTTTCG AATATGAAAT CAATAACTAA AAAGATTAAA GCAACTCTTG 5940 

CAGGAGTAGC TGCCTTGTTT GCAGTATTTG CTCCATCATT TGTATCTGCT CAAGAATCAT 6000 

CAACTTACAC TGTTAAAGAA GGTGATACAC TTTCAGAAAT CGCTGAAACT CACAACACAA 6060 

CAGTTGAAAA ATTGGCAGAA AACAACCACA TTGATAACAT TCATTTGATT TATGTTGATC 6120 

AAGAGTTGGT TATCGATGGC CCTGTAGCGC CTGTTGCAAC ACCAGCGCCA GCTACTTATG 6180 

CGGCACCAGC CGCTCAAGAT GAAACTGTTT CAGCTCCAGT AGCAGAAACT CCAGTAGTAA 6240 

GTGAAACAGT TGTTTCAACT GTAAGCGGAT CTGAAGCAGA AGCCAAAGAA TGGATCGCTC 6300 

AAAAAGAATC AGGTGGTAGT ATACAGCTAC AAATGGACGT TATATCGGAC GTTACCAATT 6360 

AACAGATTCA TACCTGAACG GTGACTACTC AGCTGAAAAC CAAGAACGGG TACCG 6415 
(2) INFORMATION FOR SEQ ID NO: 163: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8494 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS : double 
{DJ TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 163: 

TACCCCTTTC GAATTTTGGC AAAAATTCGG TAAGGCTTTG ATGGTAGTTA TCGCGGTTAT 60 

GCCGGCTGCT GGTTTGATGA TTTCAATCGG TAAGTCTATC GTGATGATTA ACCCAACCTT 120 
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TGCACCACTT GTCATCACAG GTGGAATTCT TGAGCAAATC GGTTGGGGGG TTATCGGTAA 180 

CCTTCACATT TTGTTTGCCC TAGCCATTGG AGGAAGCTGG GCTAAAGAAC GTGCTGGTGG 240 

TGCTTTCGCC GCTGGTCTTG CCTTCATCTT GATTAACCGT ATCACTGGTA CAATCTTTGG 300 

TGTATCAGGC GATATGTTGA AAAATCCAGA TGCTATGGTA ACTACTTTCT TTGGTGGTTC 360 

AATCAAAGTT GCTGATTACT TTATCAGTGT TCTTGAAGCT CCAGCCTTGA ACATGGGGGT 420 

ATTCGTAGGG ATTATCTCAG GTTTTGTAGG GGCAACTGCT TACAACAAAT ACTACAACTT 4 80 

CCGTAAACTT CCTGATGCAC TTTCATTCTT CAACGGGAAA CGTTTCGTAC CATTTGTAGT 540 

TATTCTTCGT TCAGCAATCG CTGCAATTCT ACTTGCTGCT TTCTGGCCAG TAGTTCAAAC 600 

AGGTATCAAT AACTTCGGTA TCTGGATTGC CAACTCACAA GAAACTGCTC CAATTCTTGC 660 

ACCATTCTTG TATGGTACTT TGGAACGTTT GCTCTTGCCA TTTGGTCTTC ACCACATGTT 720 

GACTATCCCA ATGAACTACA CAGCTCTTGG TGGTACTTAT GACATTTTAA CTGGTGCAGC 780 

TAAAGGTACT CAAGTATTCG GTCAAGACCC ACTATGGCTT GCATGGGTAA CAGACCTTGT 840 

AAACCTTAAA GGTACTGATG CTAGTCAATA TCAACACTTG TTAGATACAG TACATCCAGC 900 

TCGTTTCAAA GTTGGACAAA TGATCGGTTC ATTCGGTATC TTGATGGGTG TGATTGTTGC 960 

TATCTACCGT AATGTTGATG CTGACAAGAA ACATAAATAC AAAGGTATGA TGATTGCAAC 1020 

AGCTCTTGCA ACATTCTTGA CAGGGGTTAC TGAACCAATC GAATACATGT TCATGTTCAT 1080 

CGCAACACCT ATGTATCTTG TTTACTCACT TGTTCAAGGT GCTGCCTTCG CTATGGCTGA 1140 

CGTCGTAAAC CTACGTATGC ACTCATTCGG TTCAATCGAG TTCTTGACTC GTACACCTAT 1200 

TGCAATCAGT GCTGGTATTG GTATGGATAT CGTTAACTTC GTTTGGGTAA CTGTTCTCTT 1260 

TGCTGTAATC ATGTACTTTA TCGCAAACTT CATGATTCAA AAATTCAACT ACGCAACTCC 1320 

AGGGCGCAAC GGAAACTACG AAACTGCTGA AGGTTCAGAA GAAACCAGCA GCGAAGTGAA 1380 

AGTTGCAGCA GGCTCTCAAG CTGTAAACAT TATCAACCTT CTTGGTGGAC GTGTAAACAT 1440 

CGTTGATGTT GATGCATGTA TGACTCGTCT TCGTGTAACT GTTAAAGATG CAGATAAAGT 1500 

AGGAAATGCA GAGCAATGGA AAGCAGAAGG AGCTATGGGT CTTGTCATGA AAGGACAAGG 1560 

GGTTCAAGCT ATCTACGGTC CAAAAGCTGA CATTTTGAAA TCTGATATCC AAGATATCCT 1620 

TGATTCAGGT GAAATCATTC CTGAAACTCT TCCAAGCCAA ATGACTGAAG CACAACAAAA 1680 

CACTGTTCAC TTCAAAGATC TTACTGAGGA AGTTTACTCA GTAGCAGACG GTCAAGTTGT 1740 

TGCTTTGGAA CAAGTAAAGG ATCCAGTATT TGCTCAAAAA ATGATGGGTG ATGGATTTGC 1800 

AGTAGAACCT GCAAATGGAA ACATTGTATC TCCAGTTTCA GGTACTGTGT CAAGCATCTT I860 
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CCCAACAAAA 


A 1 UV. Ill 1 Vj 


u 1 A 1 l\j 1 viAL 


GUAAGCAGGT 


CTTGAAGTAT 


TGGTTCACAT 


1920 


TGGTTTGGAC 


nV-rtVj l hnu 1 L 


1 IvjAAvjVj I AA 


ta a mmm tv x 

ALCATTTACA 


G TTCATGTTG 


CTGAAGGACA 


1980 


AAAAGTTGCA 


VjLnVjVinun 1 L 


1 L.L 1 i\j 1 L. AL 


ALjL I GAL ™rG 


GATGCTATCC 


GTGCAGCAGG 


2040 


ACGTGAAACT 


1 LnnLnu 1 Avt 


1 lulLl 1L.AL- 


iv 7v Ti rn/^/^rrv^ tv m 
AAA A GG TGA T 


r> » » rrvn tv tv tv 

GCAATTAAAT 


CAGTTAAGTT 


2100 


AGAAAAAACA 




PAPpfpa imp 
L.ALjL. 1 AAAAL 


AGCAG1 IGCT 


ta l\ T\ Ofti iv tv iv rn 

AAAGTAGAAT 


TGTAATATAC 


2160 


TTGAGGTTGG 


AAfiPfpTATT 


LLAALLl HI 


tv mm mmv" < k o 

ATTTTGGGAG 


AAAAGAATGA 


AATTTTTAAC 


2220 


ACTCAATACT 


P AP AP/PTfifi A 


1 WiAbAAAIjA 


TV r^f~* TV TV Z"* 1 ^ TV TV 

AGL AG AGG AA 


TV. IV IV fTWTl^O TV ^ TV 

AAATTCCAGA 


TTTTGCTTGA 


2280 


AGATATTCTT 


(i A A A & f2P a PT> 


A rt»A 1 1 luA 1 


TTGTTTTCAA 


/"> TV TV tv m/^ TV TV m/t 

GAAATCAATC 


AGGAGATGAC 


2340 


v— x v»vj i ^rtuAU 


uiouAuui 1A 


A IXj ACCTTTA 


TCAAGCTTTG 


CCAGCAGCTG 


AGCCTATTCA 


2400 


CPAAGAPPA'P 


Inlul 1 AuAL 


TCTTGGTTGA 


AAAGTTGTCT 


GAGCAAGGGA 


AAAATTACTA 


2460 


L 1 uunUW loo 


PPPTR H>TV TV /"»0 

LjL.L. 1 A I AAL.C 


ATATCGGCTA 


TAACCGCTAC 


CACGAAGGTG 


TGGCTATCTT 


2520 


Oil l/UvuiLA 


PPTiTVPfti^TV jv 


CCAGAGAAAT 


TTTGGTTTCA 


GATGTGGATG 


ATCCAACAGA 


2580 


pt & tv* & t a pt 

LI Al 1 AL 1 


LuL.L.G TGTTG 


CCCTAGCTGA 


AACTGTAGTC 


GATGGCAAGG 


AGCTAGCAGT 


2640 


r> p tv p mp mm 
l\>V.Lnu lull 


/■> ta m m^nw niti 

I LTCTCTT 


GGTGGGATAA 


AGGTTTCCAA 


GAAGAATGGG 


CACGATTTGA 


2700 


VtviL.1u1L.JI l(j 


AAAAAATTGA 


ACAAGCCACT 


TTTACTAGCT 


GGAGATTTCA 


ACAATCCGGC 


2760 


louALAuuAA 


GGTTACCAAG 


CTATTTTAGC 


TAGTCCATTA 


GGCTTACAAG 


ACGCATTTGA 


2820 


Af2 r nv2P m P a & 


P A IV TV TV IV Offl/l 

uAGAAAAGTG 


GTAGCTATAC 


TGTTCCGCCT 


GAAATTGATG 


GCTGGAAAGG 


2880 


flAArAPTfit & 
1 uAA 


LL.L.L TTCGAA 


TCGATTATGT 


CTTTACTACC 


AAAGAGTTAG 


CGGTGGAAAA 


2940 




ulAl 1 IajAI u 


GTAACAAGAG 


TCCACAAGTG 


AGTGATCACT 


ATGGCTTGAA 


3000 


TGCTATATTA 


a a ptp.p aR&Ti 


AATAACTGAA 


AAGAGGTTGG 


fv Tv ^ifn tv nn ta tv tx iv 

AACTATAAAA 


TTCCAGCCTT 


3060 


TTCTTAPTAfi 


Afi A AfiPT 21 p*p 


GGAAATAGCC 


TAAATAAGTG 


AGACTACTGT 


AATGGAATAA 


3120 


AATATGGTAT 


AATTGATAAR 


GTAGATAGAA TCGAGGATGT 


TATGTCATTT 


TV /-»/"• x *v TV mrtvp/-» 
AL.L>AAA i 1 I L 


3180 


AATTTAAAAA 


CTATATTAGA 


GAAGCCTTGA AGGAGTTAAA 


ATTTACAACT 


CCAACAGAGG 


3240 


TGCAAGACAA 


GTTGATTCCT 


ATTGTTTTGG 


CAGGTCGTGA 


CCTAGTAGGA 


GAATCAAAAA 


3300 


CAGGTTCAGG 


TAAGACTCAT 


ACTTTCTTGT 


TACCGATTTT 


CCAGCAATTA 


GATGAAGCTA 


3360 


GCGATAGTGT 


ACAAGCAGTG 


ATTACTGCAC 


CGAGTCGTGA 


GTTGGCTACT 


CAAATTTACC 


3420 


AAGTAGCGCG 


TCAGATTTCA 


GCTCACTCAG 


ATGTCGAAGT 


TCGTGTGGTT 


AATTATGTGG 


3480 


GTGGTACGGA 


TAAGGCTCGC 


CAGATTGAGA AATTGGCAAG 


CAATCAGCCT 


CATATTGTTA 


3540 


TTGGAACACC 


AGGCCGTATC 


TACGACTTGG 


TTAAATCTGG 


TGATTTAGCT 


ATTCATAAAG 


3600 


CCAAGACATT 


TGTTGTTGAT 


GAAGCAGATA 


TGACCTTGGA 


TATGGGATTC 


TTGGAAACTG 


3660 
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TTGATAAGAT TGCTGGCAGT 


1 CTTCCAAAAG 


ACTTGCAATT 


' CATGGTCTTC 


TCAGCGACTA 


3720 


TCCCACAAAA ACTGCAACCA 


TTCTTGAAAA 


AATACTTATC 


AAATCCTGTT 


ATGGAGAAAA 


3780 


TTAAGACCAA AACGGTTATT 


TCTGACACCA 


TTGATAATTG 


GTTGATTTCG 


ACCAAGGGAC 


3840 


ATGATAAGAA TGCTCAAATT 


TACCAGTTGA 


CTCAGTTGAT 


GCAGCCGTAT 


TTGGCAATGA 


3900 


TTTTTGTTAA CACTAAAACG 


CGTGCTGATG 


AATTGCATTC 


ATATCTGACT 


GCTCAAGGCT 


3960 


, TGAAGGTTGC AAAAATCCAT 


GGCGATATTG 


CCCCTCGTGA 


ACGCAAGCGA 


ATCATGAATC 


4020 


AGGTGCAAAA TCTGGATTTT 


GAGTATATTG 


TCGCAACAGA 


TTTGGCAGCG 


CGTGGGATTG 


4080 


ACATTGAAGG TGTCAGCCAT 


GTCATCAATG 


ATGCCATTCC 


GCAAGACTTA 


TCTTTTTTTG 


4140 


TTCATCGTGT TGGTCGTACT 


GGACGAAATG 


GCCTACCAGG 


TACAGCTATT 


ACCCTTTATC 


4200 


AGCCAAGTGA TGACTCGGAT 


ATCCGTGAGT 


TGGAGAAATT 


GGGAATCAAG 


TTTAGTCCTA 


4260 


AGATGGTCAA AGACGGGGAA 


TTTCAAGATA 


CCTATGACCG 


TGATCGTCGT 


GCCAACCGTG 


4320 


AGAAAAAACA AGATAAACTT 


GATATCGAAA 


TGATTGGTTT 


GGTTAAAAAG 


AAAAAGAAAA 


4380 


AAGTCAAACC GGGTTATAAG 


AAGAAAATTC 


AATGGGCGGT 


TGATGAAAAG 


CGCCGTAAAA 


4440 


CCAAGCGTGC TGAAAATCGC 


GCTCGCGGTC 


GTGCAGAGCG 


TAAAGCTAAA 


CGCCAAACAT 


4500 


TTTAATAGAA ATTGTTGGAG 


TATTGAGCTC 


CAACTTTTTT 


ATTTATGAGA 


ACGAACTATC 


4560 


TAAACCGAAA CACTACATTA 


AAGACTGCAA 


ATTGCGATTA 


AAAATGGTAT 


AATGATAAAG 


4620 


TTATATAGTC CCGATAAGAT 


GGTAGGTATT 


TATTACGAAG 


AGTTTTCCTA 


TCAGTACTTT 


4680 


GTAACTCTAT AACAATATTT 


TTTAAGGGGG 


GACATTTTTA 


TGTCAGAGCG 


TAAATTATTC 


4740 


ACGTCTGAAT CTGTATCTGA 


GGGGCATCCG 


GATAAGATTG 


CAGACCAAAT 


TTCAGATGCG 


4800 


ATTTTGGATG CTATTTTAGC 


AAAGGATCCA 


GAGGCGCACG 


TTGCTGCTGA 


AACAGCTGTA 


4860 


TATACTGGTT CTGTCCACGT 


TTTTGGTGAA 


ATTTCTACAA 


ATGCCTATGT 


GGATATTAAC 


4920 


CGTGTGGTTC GTGATACCAT 


TGCAGAGATT 


GGTTATACCA 


ATACAGAATA 


TGGATTTTCT 


4980 


GCTGAGACGG TGGGAGTACA 


CCCATCTTTG 


GTGGAACAAT 


CTCCTGACAT 


CGCTCAAGGT 


5040 


GTTAACGAAG CCTTGGAGGT 


TCGTGGAAAT 


GCTGATCAAG 


ATCCACTGGA 


CTTGATTGGA 


5100 


GCAGGTGACC AAGGGCTCAT 


GTTTGGATTT 


GCAGTAGATG 


AAACAGAAGA 


GCTTATGCCA 


5160 


TTGCCAATTG CACTCAGTCA 


TAAATTGGTT 


CGTCGTCTGG 


CAGAACTTCG 


TAAGTCTGGA 


5220 


GAAATTAGCT ATCTCCGTCC 


AGATGCAAAA 


TCACAAGTTA 


CAGTTGAGTA 


CGATGAAAAT 


5280 


GACCGTCCGG TACGTGTAGA 


TACAGTCGTT 


ATTTCTACTC 


AGCATGATCC 


AGAGGCCACT 


5340 


AATGAACAAA TCCATCAAGA TGTGATTGAC AAGGTCATCA AAGAAGTTAT TCCATCTTCT 


5400 
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TATCTTGATG 


ATAAGACAAA 


ATTCTTTATC 


AATCCGACAG 


GTCGTTTTGT 


AATCGGTGGT 


5460 


CCTCAAGGGG 


ACTCAGGTTT 


GACTGGTCGT 


AAGATTATTG 


TAGATACTTA 


TGGTGGCTAC 


5520 


TCTCGTCATG GTGGTGGTGC CTTCTCTGGT AAAGATGCGA CTAAGGTGGA TCGTTCAGCC 


5580 


TCTTATGCGG 


CTCGCTATAT 


TGCCAAGAAT 


ATCGTTGCAG 


CAGACCTTGC 


TAAGAAGGCA 


5640 


GAAGTGCAGT 


TGGCCTATGC 


TATCGGTGTT 


GCGCAACCTG 


TTTCTGTTCG 


TATCGATACT 


5700 


TTCGGTACAG GAACAGTAGC 


TGAAAGTCAA 


CTTGAAAAAG 


CGGCTCGTCA 


AATCTTTGAC 


5760 


CTTCGCCCTG 


CAGGGATTAT 


CCAAATGCTG 


GACCTCAAGC 


GTCCAATTTA 


CCGTCAAACA 


coon 
JOZU 


TCGGCTTACG 


GTCACATGGG 


ACGTACAGAT 


ATTGATCTTC 


CATGGGAACG 


TTTGGATAAG 


coon 

JOOU 


GTAGATGCTT 


TGAAAGAAGC 


AGTAAAATAA 


GATTTTAAGA 


GGGGAACGTC 


CTCTCTTTTT 




TATAGTTTTT 


AACTATACTG 


GGATACTGTT 


CTGAAAATCC 


ATTTTGCGAA 


AGTAGAGATT 


£ Ann 

bUUU 


TACATGTATA 


GTAGATTGAA 


ACTAGAATAG 


TACACCTCAA 


CTTCTAAAAC 


ATTGTTAGCA 


6060 


ATCAATTTGA 


CTGTCCTGAT 


CGATTTCTCC 


TGTTCTTGTT 


TCATTTTACT 


ATATTTCTTT 


C 1 1 A 

biz (J 


AAAAATGATA 


AAGGTTAAGA 


TTTCTCCTCG 


TAATAGATAA 


TCTTGGGGAT 


ATTTCAATCC 


ci on 


AAAGTTTTAT 


TCGTTATCAC 


TTGACTATTG 


CAAGGTTTTC 


TAGAGCAACA 


GAGTCATGGA 




ATGGACTCAT 


GGTTGAGATT 


TCTCCTTGTT 


GCTTGGACTT 


CATTCAAAAG 


TCTGTTACCC 


O JUU 


AAGCCTTGTT 


CAAACTTCTA ATACACTAGC 


TGTTTCCATA 


GCATGACTTC 


TGTACTAGAC 


DJDU 


TTTCTTTTCC 


GAATAAATAG 


ATAGAACCAC 


AGAATCTAGT 


AAACCTAGAA 


TTAAAATTAT 


6420 


GGTATAATAT 


TAGCAATAAA 


AGAAATCTGG 


AGGATTAGAA 


TCATGGTATC 


AACGAAAACA 


6480 


CAAATTGCTG 


GTTTTGAGTT 


TGACAATTGC 


TTGATGAATG 


CAGCAGGTGT 


GGCTTGTATG 


6540 


ACGATAGAGG 


AGTTAGAAGA GGTCAAAAAC 


TCAGCGGCAG 


GAACCTTTGT 


TACTAAGACA 


6600 


GCGACCTTGG 


ACTTCCGTCA 


GGGGAATCCT 


GAGCCACGCT 


ACCAAGATGT 


TCCACTTGGT 


6660 


TCCATCAACT 


CTATGGGCTT 


GCCAAATAAT 


GGCTTAGACT 


ATTATTTGGA 


TTATCTTTTA 


6720 


GATTTGCAGG 


AAAAAGAGTC 


GAACCGAACT 


TTCTTCTTAT 


CTCTGGTCGG 


CATGTCTCCA 


6780 


GAGGAAACCC 


ATACTATTTT 


GAAAAAAGTC 


CAAGAGAGTG 


ATTTTCGTGG 


TCTGACTGAG 


6840 


CTAAATCTTT 


CCTGTCCAAA 


TGTTCCAGGT 


AAACCTCAGA 


TTGCCTATGA 


TTTTGAGACA 


6900 


ACAGACCGGA 


TTTTGGCAGA 


AGTGTTTGCT 


TACTTCACCA 


AACCTCTTGG 


AATTAAATTG 


6960 


CCACCTTATT 


TTGATATTGT 


TCACTTTGAC 


CAAGCGGCAG 


CTATTTTCAA 


CAAATATCCG 


7020 


CTCAAGTTTG 


TCAACTGCGT 


TAACTCTATC 


GGAAACGGCC 


TCTATATAGA 


AGACGAATCT 


7080 


GTCGTTATTC 


GGCCTAAGAA 


TGGTTTTGGT 


GGAATTGGTG 


GAGAATACAT 


CAAACCGACT 


7140 


GCTTTAGCCA 


ATGTTCACGC 


CTTTTATCAA 


CGTTTAAATC 


CTCAAATCCA 


AATTATCGGA 


7200 
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ACAGGTGGCG TTCTGACTGG TCGAGATGCC TTTGAACACA TCCTCTGTGG AGCAAGTATG 7260 

GTGCAGGTGG GAACGACCCT TCACAAAGAA GGCGTCAGTG CTTTTGACCG CATTACCAAT 7320 

GAACTGAAAG CAATCATGGT GGAAAAAGGC TACGAGAGCT TAGAAGATTT CCGTGGGAAA 7380 

TTGCGCTATA TTGACTAAAT TAAATCGAAA AATCTGAAGA AAGGAGAGAC GATGCTAGCC 7440 

ATTGAAGAAA GTCAGAAGTT GACTTTATCA AATTTACCGA GCCTGAGCCT ATTTACAGGG 7500 

ACAGATCAGG GTCAGTTTGA AGTGATGAAG AGTCAAATGT TGAAACAGAT TGGGTATGAT 7560 

TCTGCTGACC TCAACTTTGC CTACTTTGAT ATGAAAGAAG TAGTTTACAA GGATGTGGAA 7620 

CTGGAGTTGG TCAGCCTTCC TTTCTTTGCG GATGAGAAAA TCGTGATATT AGATTATTTT 7680 

ATGGATATCA CGACTGCTAA GAAACGCTTT TTGACAGATG ATGAGCTTAA GTCATTTGAG 7740 

GAATACCTTG ACAATCCTTC TCCAACAACC AAGTTGATAA TCTTTGCAGA AGGAAAGCTG 7800 

GATAGCAAAA GACGGTTAGT CAAATTACTT AAGCGTGATG CCAAGGCCTT CGATGCAGTA 7860 

GAAGTAAAAG AACAAGAATT GCGCCAGTAC TTCCAAAAGT GGAGTCAGAA ACAAGGTCTG 7920 

CAGTTTACCA ATCATTCTTT TGAAAATCTC CTCATCAAGT CGGGGTTTCA ATTTAGCGAA 7980 

ATCCAGAAAA ATCTTCTCTT TTTACAGTCC TATAAGGCGA ATTCTGTTAT TGAGGAAGAG 8040 

GATATTGTTA ACGCAATTCC CAAGACTTGC AGGACAATAT TTTTGATTTA ACTCAGTTTA 8100 

TTCTGACTAA AAAGATGGAT CAGGCGCGCG ATTTGGTGAG AGACTTGACC TTGCAAGGGG 8160 

AAGATGAAAT CAAACTGATT GCAGTCATGC TGGGACAATT TCGGACTTTT ACTCAGGTGA 8220 

AGATTTTGGC GGAGTCTGGC CAAACAGAAT CGCAGATTGC AAGTAGTTTA GGTAGTTATC 8280 

TGGGACGTAA CCCAAATCCT TATCAAATCA AGTTTGCATT AAGAGATTCG AGAGGACTTT 8340 

CTTTGAGCTT TTTGAAGCAA GCTATTTCCT ATTTGATTGA GACAGACTAT CAGATTAAGA 8400 

CAGGTCTTTA TGAAAAAGGT TTCCTTTTTG AAAAGGCACT CTTACAGATT GCTAGTCAGG 84 60 

TCAATTGACA TTTGTTGAAA CTACTAACCC GCGG 8494 
(2) INFORMATION FOR SEQ ID NO: 164: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9707 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 164: 

CCGGTCAGTT CGTTCAGTAC AAGGAATCAT AATGAACGAT CAATCAGAAA AAAAGACTAG 60 
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AAAGAAGACT 


' GTATGGATAA 


> TCGACCAATT GGTTTTTTGG 


1 ATTCGGGTGT 




1 on 


ACCGTTGTGC 


GCGAGCTCAT 


GCGCCAGCTT CCCCATGAAG 


AAATCGTCTA 


1 A 1 I UVj Alj A l 


loU 


TCGGCGCGGG 


CGCCCTATGG 


CCCCCGTCCT GCTGAGCAAA 


TTCGTGAATA 


i At- 1 TCaJCAG 


240 


CTGGTCAACT 


TTCTCTTGAC 


CAAGGATGTC AAAATGATTG 


TCATTGCTTG 


1 AAl_ At_ IajCvj 


300 


ACTGCGGTCG 


TCTGGGAAGA AATCAAGGCT CAACTAGATA TTCCTGTCTT 


/-* /~*rr\/ m * fry ft ft mm 


360 


TTGCCAGGAG 


CTTCGGCAGC 


CATCAAGTCC AGTCAAGGTG 


GGAAAATCGG 


AGTGATTGGA 


420 


ACGCCCATGA 


CGGTACAATC 


AGACATATAC CGTCAGAAAA 


TCCATGATCT 


VsQjATCCCGAC 


480 


TTACAGGTGG 


AGAGCTTGGC 


CTGTCCCAAG TTTGCTCCCT 


TGGTTGAGTC 




540 


TCAACCAGTG 


TTACCAAGAA 


GGTGGTCTAT GAAACCCTGC 


GTCCCTTGGT 


IwaAAAGGTG 


600 


GATAGCCTGA 


TTTTGGGCTG 


TACTCATTAT CCACTCCTTC 


GCCCTATTAT 


(-v_AAAATGTG 


660 


ATGGGGCCAA 


AGGTTCAGCT 


CATCGATAGT GGGGCAGAGT 


GCGTACGGGA 


TATCTCAGTC 


720 


TTACTCAATT 


ATTTTGAAAT 


CAATCGTGGT CGCGATGCTG 


GACCACTCCA 


TCACCGTTTT 


780 


TACACAACAG 


CCAGTAGCCA 


AAGTTTTGCA CAAATTGGTG 


AAGAATGGCT 


GGAAAAAGAG 


840 


ATTCATGTGG 


AGCATGTAGA 


ATTATGACAA ATAAAATTTA 


TGAATATAAG 


CsATCjACCAGG 


900 


ACTGGTATGT 


TGGGTCTTAT 


AGTATTTTTG GTGGCGTTAA 


CAGTTTGAGC 


QjACTATAAGA 


960 


CAGATTTTCC 


TCTGTTTGAA 


TTCTCCAAAA TATTTGGAGA 


TGAAGAGTAT 


111 LLLuL 


1020 


TTTCAGTTAC 


TGTTTTACGC 


TATGGTTCTA TCTACCGTTT 


GTTCTCCTTT 


Li rtiGTAGAC A 


1080 


TGCTTAATCA 


AGAAATGGGA 


uunnni, I 1 KjKj MiVo I 1 A I 1 


AtJJj i t_A 1 AjOvj 


(jCCCTGCTCT 


1140 


TGGTTGAAAA 


TGGGCAACTC 


TTGTATGTAG AATTGCCTAA 


AGAAGGGGTC 


AA 1\j ITCATG 


1200 


ATTTCTTTGA 


GACAAGCAAG 


GTCAGAGAAA CCTTGTTGAT 


TGCGACTCGT 


A/V\_ vj/\Ao<j 1 A 


1260 


AAACCAAGGA 


ATTCCGAGCT 


ATCTTTGATA AGTTAGGCTA 


CGATGTGGAA 


& A TVTUT'Ti TV TV* 
AA XV 1 i AA 1 \j 


1320 


ACTACCCTGA 


CCTGCCTGAA 


GTAGCAGAAA CAGGTATGAC 


CTTTGAAGAA 






IX AAGGC AGA 


AACCATTTCT 


CAATTAACGG GCAAGATGGT 


TTTGGCAGAT 


GATTCTGGTC 


1440 


TCAAAGTCGA 


TGTCCTTGGT 


GGCTTACCAG GCGTCTGGTC 


AGCTCGTTTC 


GCAGGTGTGG 


1500 


GAGCAACTGA 


CCGTGAAAAT 


AATGCCAAAC TCTTGCACGA 


ATTGGCCATG 


GTCTTTGAAC 


1560 


TCAAGGACCG 


CTCGGCTCAG 


TTCCACACAA CCCTAGTCGT 


AGCCAGCCCA 


AATAAGGAAA 


1620 


GTTTAGTTGT 


TGAAGCAGAC 


TGGTCAGGTT ATATTAACTT 


TGAACCTAAG 


GGTGAAAATG 


1680 


GCTTTGGCTA 


TGATCCCCTC 


TTCCTTGTAG GAGAAACAGG 


TGAGTCATCA 


GCTGAATTAA 


1740 


CCCTGGAAGA 


AAAAAATAGT 


CAATCTCACC GTGCCTTAGC 


CGTTAAGAAA 


CTTTTGGAGG 


1800 


TATTTCCATC 


ATGGCAAAGC 


AAACCATCAT TGTAATGAGC 


GATTCCCATG 


GCGATAGCTT 


1860 
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GATTGTGGAA GAAGTCCGTG ATCGCTATGT GGGCAAAGTC GATGCTGTTT TTCATAACGG 1920 

CGATTCTGAA CTACGTCCGG ATTCTCCACT TTGGGAGGGC ATCCGCGTTG TTAAAGGGAA 1980 

CATGGACTTC TACGCCGGCT ACCCAGAACG TCTGGTGACT GAGCTTGGTT CGACCAAGAT 2040 

TATCCAAACT CATGGTCACT TGTTTGACAT CAATTTCAAC TTTCAAAAGT TGGACTACTG 2100 

GGCTCAGGAG GAAGAGGCCG CTATCTGCCT CTATGGTCAC TTGCATGTGC CAAGTGCTTG 2160 

GTTGGAAGGC AAGATCCTCT TTCTAAATCC AGGTTCTATC AGTCAACCAC GAGGTACCAT 2220 

CAGAGAATGT CTCTATGCTC GTGTGGAGAT TGATGATAGT TACTTCAAAG TGGACTTTTT 2280 

GACACGAGAT CACGAGGTGT ATCCAGGTTT GTCCAAGGAG TTTAGCCGAT GATTGCCAAG 2340 

GAGTTTGAGA CTTTCTTGTT GGGGCAGGAG GAAACTTTTT TGACCCCTGC TAAAAATCTA 2400 

GCTGTGTTGA TTGATACCCA CAATGCGGAT CATGCGACCC TCTTGCTCAG TCAGATGACC 2460 

TATACCCGTG TTCCCGTTGT GACAGATGAA AAACAGTTTG TTGGGACGAT TGGACTCAGA 2520 

GATATTATGG CTTATCAGAT GGAGCATGAC TTGAGCCAAG AAATCATGGC GGATACGGAT 2580 

ATCGTTCATA TGACAAAAAC GGACGTAGCG GTTGTTTCGC CTGATTTCAC CATTACGGAG 2640 

GTCTTGCACA AGCTAGTAGA TGAGTCCTTC TTACCGGTTG TGGATGCAGA GGGTATTTTC 2700 

CAAGGGATTA TTACGCGCAA GTCCATCCTC AAGGCCGTTA ATGCCCTCTT GCATGACTTT 2760 

AGTAAGGAAT ATGAGATTCG ATGCCAATGA GAGACAGGAT TTCAGCCTTT TTAGAGGAAA 2820 

AGCAGGGCTT GTCTGTCAAT TCCAAGCAGT CCTATAAGTA TGATTTGGAG CAATTTTTAG 2880 

ACATGGTAGG TGAGCGGATT TCTGAGACCA GTCTCAAGAT TTACCAAGCC CAGCTAGCCA 2940 

ATCTAAAAAT CAGCGCCCAG AAGCGAAAGA TTTCGGCCTG TAACCAATTT CTATACTTTC 3000 

TCTATCAAAA AGGAGAGGTG GACAGCTTTT ACCGCTTGGA AT TAG CC AAA CAAGCTGAAA 3060 

AGAAGACGGA AAAGCCAGAG ATTCTATACC TAGACTCTTT TTGGCAGGAA AGCGACCATC 3120 

CAGAGGGCCG CTTGCTAGCG CTCTTAATCC TAGAAATGGG GCTCTTGCCC AGTGAGATTT 3180 

TAGCCATCAA GGTTGCGGAC ATCAATCTGG ATTTTCAGGT GTTGCGAATC AGCAAGGCTT 3240 

CCCAACAGAG GATTGTCACC ATTCCCACGG CCTTGCTTTC AGAATTGGAA CCCTTGATGG 3300 

GGCAGACCTA TCTTTTTGAA AGAGGAGAGA AACCCTATTC TCGTCAGTGG GCCTTTCGTC 3360 

AGTTAGAATC TTTTGTCAAG GAGAAAGGTT TTCCATCCTT ATCAGCTCAA GTCTTACGTG 3420 

AACAGTTTAT TCTAAGACAA ATAGAAAACA AGGTCGATTT GTACGAAATT GCAAAAAAAT 3480 

TAGGATTAAA AACAGTCCTG ACCTTAGAAA AATATAGATA ATGGATATTA AATTAAAAGA 3540 

TTTTGAAGGA CCCCTGGACT TGCTCTTGCA TCTGGTTTCT AAGTACCAGA TGGATATCTA 3600 
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CGATGTGCCC 


ATTACGGAAG 


! TCATCGAACA GTATPTAnrT 


TATGTCTCAA CCCTGCAGGC 


3660 


CATGCGTCTG 


GAAGTGACGG 


GTGAGTACAT GGTCATGGCT 


AGTCAGCTCA 


. TGCTGATTAA 


1 "71 A 


GAGTCGTAAA 


CTCCTTCCGA 


AGGTAGCAGA AGTGACAGAC 


TTGGGGGATG 


ACCTGGAGCA 


J /ou 


GGACCTCCTC 


TCTCAAATCG 


AAGAATATCG CAAGTTCAAG 


CTCTTGGGTG 


AGCACTTGGA 




AGCCAAGCAC 


CAAGAACGGG 


CCCAGTATTA TTfTAAAfiPf: 


CCGACAGAGT 


TGATTTACGA 


3900 


AGATGCGGAG 


CTTGTGCATG 


ACAAGACGAf* PATTRArrW 


TTTTTGACTT 


TTTCAAATAT 


3960 


CCTAGCCAAG 


AAAAAAGAGG 


AGTTTGCAPA AAATf 2LChfYZ 


ACGATCTTGC 


GGGATGAGTA 


4020 


TAAGATTGAG 


GACATGATGA 


■» i-njv,\ji unn nUnu X X X\j 


ATTGGACGAG 


ATCAATTGCG 


4080 


CTTGCAGGAT 


TTGTTCAAGG 


AAfW"PAf2AA T t r"VTT , TL RPAP 


GTCATCACCC 


TCTTTTTGGC 


4140 


AACCCTAGAG 


TTAATCAAAA 


C % CC7^fZtZh.fz r P f P ^'2A*TW ,, *Tv ,, ^ , ^v , 

\* V- UUAVtU X X UA A\» V. i (j lu 


CAAGAGGAGA 


GTTTTGGAGA 


4200 


TATCTATCTC 


ATGGAAAAGA 


Af^f^A Af?A llf nvn Xf , TV , /" , r*'P 
nuunnunnAu lUyUjlvA.Li 


CAAAGCTAGA 


CTTGATAGAG 


4260 


AGGAAAGATG 


AGTACTTVAf: 


^.AAAAA 1 AuA AGCGCTCTTG 


TTTGTAGCGG 


GTGAAGATGG 


4320 


GATTCGGGTC 




\- l\jAAt 1 x v-TCTCTGCCA 


CCGACAGGCA 


TCCAGCAAAG 


4380 


TTTAGGAAAA 




A Li i A 1 GAAAA GG AC C C AG AT 


TCCAGTTTGG 


CTTTGATTGA 


4440 


GACAAGTGGT 


GCTTATAGAT 


TGGTGACCAA GCCTCAATTT 


GCAGAGATTT 


TGAAGGAATA 


4500 


CTCTAAGGCG 


CCTATCAACC 


AGAGCTTGTC TCGGGCTGCC 


CTTGAGACCT 


TGTCCATTAT 


4560 


TGCCTACAAA 


CAGCCGATTA 


CGCGGATAGA AATTGATGCC 


ATCCGTGGAG 


TTAACTCGAG 


4620 


TGGAGCCTTG 


GCAAAGTTGC 


AGGCTTTTGA CCTGATAAAG 


GAAGACGGGA 


AAAAGGAAGT 


4680 


ATTGGGGCGC 


CCCAACCTCT 


ATGTGACTAC GGATTATTTC 


CTAGATTACA 


TGGGGATAAA 


4740 


CCATTTAGAA 


GAATTACCAG 


TGATTGATGA GCTTGAGATT 


CAAGCCCAAG 


AAAGCCAATT 




ATTTGGTGAA 


AGGATAGAAG 


AAGATGAGAA TCAATAAGTA 


TATTGCCCAC 


GCAGGTGTGG 


4ooU 


CCAGTAGGAG 


AAAAGCAGAA GAGCTGATTA AGCAAGGCTT GGTGACGGTT AACGGCCAAG 




lub IajCvj xajA 


ACTAGCAACC 


ACTATCAAGT CAGGCGACAA 


GGTCGAAGTT 


GAAGGTCAAC 


4980 


CTATCTACAA 


CGAAGAAAAG 


GTCTACTATC TGCTTAACAA 


ACCACGCGGT 


GTGATTTCCA 


5040 


GTGTGACAGA 


TGATAAGGGT 


CGCAAGACGG TTGTCGACCT 


CTTGCCCAAT 


GTCAAAGAGC 


5100 


GTATTTACCC 


TGTGGGTCGT 


TTGGACTGGG ATACATCAGG 


TGTCTTGATT 


TTGACCAATG 


5160 


ATGGGGACTT 


TACAGACGAG 


ATGATTCACC CTCGTAATGA GATTGACAAG 


GTTTATGTCG 


5220 


CGCGTGTTAA 


AGGTGTGGCC 


AATAAGGACA ATCTCCGCCC 


CTTGACCCGT 


GGTCTTGAGA 


5280 


TTGATGGTAA 


GAAAACCAAG 


CCAGCTGTTT ATGAAATTCT 


CAAAGTGGAC 


CCAGTCAAAA 


5340 


ATCGCTCTGT 


GGTGCAGTTG 


ACCATCCATG AAGGGCGTAA 


CCATCAGGTT 


AAAAAGATGT 


5400 
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TTGAAGCTGT TGGTCTCCAA GTAGATAAGT TGTCTCGGAC TCGTTTCGGA CACCTAGACT 5460 

TGACAGGACT CCGTCCAGGA GAATCCCGTC GTCTTAATAA AAAAGAAATC AGCCAACTAC 5520 

ACACCATGGC TGTAACTAAG AAATAATGAA ACGAATTTTA ATAGCGCCTG TGCGCTTTTA 5580 

CCAACGTTTT ATCTCACCAG TCTTTCCACC CTCTTGTCGC TTTGAGCTGA CTTGTTCCAA 5640 

CTACATGATT CAGGCTATTG AAAAACATGG GTTTAAGGGG GTATTGATGG GCTTGGCTCG 5700 

. GATTTTACGT TGTCATCCCT GGTCGAAAAC AGGTAAGGAC CCCGTTCCAG ACCGCTTTTC 5760 

CCTTAAACGA AATCAAGAAG GGGAATGAGG TGGGGTAAAT AGATTTCAAA ATGATAAAAA 5820 

CGCATCCTAT CAGGTTTGAG TGAACTTGAT AGGATGCGTT TTAGAATGTC AAAATTTTAT 5880 

ACTCTTCGAA AATCTCTTCA AACCGCGTCA GCTTTCATCT GCAACCTCAA AACAGTGTTT 5940 

TGAGCAACCT GCGGCTAGTT TCCTAGTTTG CTCTTTGATT TTCATTGAGT ATTAAATTGA 6000 

GTTTGAAGTG GCTTATTTCA AAGCTTTTTG TATGTCTTCA ATCATGAGTT TTGTTGATTC 6060 

AAGTCCGCCT CCGCTTAGAT ACCAGAGGTC TGGTGTTAGT TGGATAATCT TACCATTTTT 6120 

AGCAGCAGGT GTTTCAGCGA TAAGGGCATT TTCTAGGACA CCGTCGTTGC TAGAGTTGTC 6180 

CCCACCGATG GCAAGGGTAC GGTTGATGAC AAAGAGGATG TCAGGGTTGA TTTCTTTGAC 6240 

ACTTTCAAAG CTGACTTCTT GTCCGTGGCG TGAGTCTTCA AATTTTGTAT CAGTTGGTTT 6300 

GAATTTCAAG GTTTGGTACA AGAAAGAGAA ACGAGATTTG GCACCAAAGG CTGCCATTTT 6360 

TCCTTCATTA AGGAGGATCG CAAGGGCTTT TTTGTCAGAG CTTTCATTTT TAGTAGCGAC 6420 

TTCTTGGATG CTCTTGTCTA GCTTGGTCAA TTCTTCCTTG GCTTTCTGTG TACCAGTTTC 6480 

GCCGAAGGCA CTTGCTAAGG ATTCGATATT AGCCTTGGTA GAAGTCCAGT AGTCGTCCTT 6540 

GCTTGCTTGG AAGAGAACGG TTGGGGCGAT TTCTTTGAAT TTGTCTACGA ATTTTTGTGT 6600 

ACGTGGCGAA GCGATAATCA AATCAGGCTC AAGGGCGGCG ATAGCTTCTA AATCAGGTTC 6660 

TTTCATAGAA CCAACATTTT TGACAGTTCC CACTAGGTCT TTTAGATAAG TCGGAACAGT 6720 

TTTTGTAGGC ATTCCGACGA TATTTTTTTC AAATCCTAAA GCGCGAATAG TATCCGCAGC 6780 

GCCGAGGTCA AAGGTCACAA TCTTTTCAGG AACTTTGGAA AGTTTGACCT CGTCCAGTGA 6840 

ACTTTTAATG GTTACCTCTG TTGGAGCAGA GCTACTGGTC TCTGTCTGAC TAGTGCTTGA 6900 

GTTTGTACTA CATGCACCAA GTAGGAGCAA GAAGCTGGCC ACTAGGGCAG TGAAATAAAG 6960 

TTTAAGGGAT GTTTTCATAA TTTCTCCTTT TTAAAATGTG ATAACGATTT AGGGAGTCTC 7020 

TTAATCTTAT TGACTAAGAG ACTGAAGGTT CTCTAACTTG AGCTTTTATG TTACTAGCTA 7080 

TAGATACAGA TCTTTTTGTC ATTGATATCA GCTAGCGTGA TGGGAATCTC ATAAAGTTGA 7140 
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CTGAGCAGGT CAGCCTGCAT GATTTGATCG GTTCTTCCCT TGCTAAAGAC CTGGCCGTCC 7200 

TTGAAGGCGA CAATTTCATC TGCATACTGA CTGGCCATGT TGATATCGTG GAGGACGATG 7260 

ATAATGGTCT TGCCGAGTTC CTCCACCAGT CGTCGAAGAA TCTGCATCAT GCTGACGCTT 7320 

TGCTTGATAT CGAGATTGTT GAGTGGTTCG TCCAGCAAGA TAAAGTCCGT ATCCTGGGCC 7380 

AGTACCATAG CGATAAAGAC GCGCTGGAGT TGCCCCCCTG ACAGGCTATT GATGTAGCGG 7440 

TCTTTTAAGT TGGTCAGTTC TAAATAGTTC AGAGTTTCTC GGATTTTTTC CCAGTCTTCT 7500 

"GATCTAAGTC GACCTCGGCT GTAGGGAAAA CGTCCAAAAC TGACCAGTTC TTCAACAGTC 7560 

AATTTGGCTT GGTAATTGAT TTTCTGTTTT AGGATGGTTA GTTCTTGGGC CAGTTCTTGC 7620 

GAATTCCAGC TCTCGATTTC ACGTCCTTTG ATACTGAGAA CTCCCTGATC TTTCTTGGTT 7680 

AGCCTGCTCA TGATGGAGAG GAGAGTCGAT TTTCCAGCAC CATTTGGACC AATAAAGGCT 7740 

GTCAGTTTTT GAGGACTGAC TTCAAGCGAA ATGCCTTGCA AAATATCCTG TTTTTGAATG 7800 

GATTTGTCAA TGTTTTCCAG TTTCACTGAC GAGACCTCCT ATATAGTAAG ATAAAGAATA 7860 

AGAAGCCACC CACACTCTCA ATGATCATAC TGATACGAAT TTCCAGTGCA AAGACTCGTT 7920 

CAATCAAGGC TTGCCCCAAG GTTAAGCTAA TAAATCCAAC CAGAATGGCC ACTATAAAGA 7980 

GTAACTTGTG CTGATAGTCT TTGACAATCA GGTAGGTGAG GTTGGCCAGT ATAAAGCCGA 8040 

AGAAGGCCAT AGGTCCTACC AAGGCAGTGG CCGTTGAGGT CAAAAGCACG ATTCCCCAGA 8100 

GGAGCTCTTT CTGTTCTTTT TCAACATCGA GTCCCAATAT CTGAGCCGTT TCTCTTTGCA 8160 

GGTGCAAGAC ATCTAGAACG ACTGCTTTTC GAAAGAAAAA GATTGTCAAA GCGAGGATGA 8220 

TCAGAGAACC GATGGCTAGG ATGGAAGTGT TGAGATGTTG AAAGGAGGCA AAAAGACTAT 8280 

TTTGCAGTTT ATCGTATTCG TTTGGATCCA TTAGGACTTG AAGGAAGGTG CTGATATTTC 8340 

GAAAGAGACT TCTGAGCGCT AG AC AG AT CA GCAGGACGAA GACCAGGTCT TGCTTCATCA 8400 

GTGTCTTCAA GTAACCTTGT AAGGCGAGAA AGAAGAGGGA CTGGACAAGA AGTAAGACTA 8460 

GGAATTCTAA GATAGGGGAT TTGCCAAGTT GAAGAAACTT GCTTTCAAAA ACCAGTAG1A 8520 

GGGTTTGTAG TAGGACGTAG AAGGATTCAA TTCCCAAAAT ACTAGGCGTC AGGAAGCGAT 3580 

TTTCCGTCAG GGTTTGAAAA CTAATGGTCG AAATCCCAGT CGCGATGGCT ACCAAGAGAT 8640 

AAACGATGAT CTTTTGGGAA CGCAACTTCC AAGCAAAGGC TGACAAGTGA GTGATGGGCC 8700 

AAAAGTAGAG AAGACAAGCT CCGATGGCAA GAATAATGAG AATCCAGAAG AGCTTGGTAT 8760 

GTTTGCTTTT AGTCTGCATC TTTTCGTCCC CCTCTCCAGA GAAGTAGGAT AAAGACGAGA 8820 

CTACCGATGA TTCCTAGCAA GAGACTGACA GACAACTCAT AGGGCCTAAT CAGAACTCGG 8880 

GATAGGATAT CGCAAGCCAG AACTAGATTG GCACCAACCA GTGCGACCAT GAGTTTGGTT 8940 
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TGACTTAGAT TATCTCCATA GCGCTTGCGA ACAAGATTGG GAACGATAAC TCCGAGAAAT 9000 

GGTAGGCCAC CCACGGTAAT CATGGTGACG CTTGTCGTTA GCGCCACCAG AAAGAGGGCC 9060 

AGTTTTTCAA GTAGGGAGTA GGAAATCCCC AAACTCTCGC TGGTTTCTTT CCCTAGATTC 9120 

ATGATGGTGA AGGTTTGGGA TAATTTCCAA ACGGTTATCA GGATGATGAG GCCTAAGAAG 9180 

AGCCACTCAT ACTGATGGGT CTGAATCATG GAGAAGGAGC CCTGGGTCCA GGCAGTCATA 9240 

CTCTGAACCA GATTGAAACG ATAGGCGATA ACTTCTGTGA CTGAGCCGAT AATCCCGCTA 9300 

TAGATGATCC CAATCAGAGG CAACATCCAC CTTTCCTTTA CAGTAAAAAT GGTCATAAAG 9360 

GCTAGGAAGA AGAGGGTGAA TACGATGGAT GAAACAAAAG CGAAGAGCAT CTTGTGGGTC 9420 

AGACTAGCCG ATGGAAAGAC AAAAAGGCTC AGCACCATTC CCAGTTTGGC GGCTTCAGTC 9480 

GTTCCAACTG TACTCGGTGC AGCAAACTGA TTTTGGGTAA TAGTCTGCAT GAGAAGGCCT 9540 

GCCATACTCA TACTAGAGGC AGTCAGGAGA ATACTGATAG TTCTTGGGAG ACGGGACTCT 9600 

TGAAAGAGGA GCCAGGTCTG CTGGTCGAAA TCAAATAGCT TTCCCCATGA AAAATCACTG 9660 

GTCCCAATGC TAATAGAGAG AAAGACTAGG AGTAGAAGTA AGCCAGG 9707 

(2) INFORMATION FOR SEQ ID NO: 165: 

(i) SEQUENCE CHARACTERISTICS: 

<A) LENGTH: 5910 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 165: 

CCGCAATTAT GCTTGAAAAG GAGTATACTT ATAAGTAACG CAAACGTTTG CGTCTGAAAA 60 

ATACGCAACG TTCCATTATT TTAACACACG AGGTGCTATT ATGAAAAAAC GTCAAAGTGG 120 

TGTGTTGATG CACATCTCTT CTCTTCCAGG AGCTTACGGA ATCGGATCAT TTGGTCAAAG 180 

TGCTTACGAC TTCGTTGATT TCTTGGTCCG TACAAAACAA CGTTACTGGC AAATCCTTCC 240 

ATTAGGAGCA ACTAGTTACG GGGATTCTCC TTACCAATCT TTCTCAGCCT TCGCAGGAAA 300 

CACTCATTTT ATCGATTTAG ATATCTTGGT GGAGCAAGGT TTGTTGGAAG CAAGTGACCT 360 
TGAAGGAGTT GACTTTGGTA GCGATGCGTC TGAAGTTGAC TATGCTAAAA TCTACTATGC • 420 

ACGTCGTCCT CTTTTAGAAA AAGCGGTGAA ACGTTTCTTT GAAGTCGGAG ATGTTAAAGA 480 

TTTTGAGAAA TTTGCTCAAG ACAACCAATC ATGGCTTGAG CTCTTTGCTG AGTATATGGC 540 

TATCAAAGAG TATTTTGACA ATCTTGCTTG GACTGAATGG CCAGATGCAG ATGCTCGTGC 600 
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TCGTAAAGCT 


TCAGCACTTG AAAGCTATCG 


TGAGCAATTG 


GCAGACAAGT 


j. vjvj a x x rvv«w«v 


660 


CCGTGTGACT 


CAATACTTCT TCTTCCAACA 


ATGGTTGAAA 


TTGAAAGCTT 


APGCTAACGA 


720 


CAACCACATC 


GAAATCGTTG GGGACATGCC 


AATCTACGTA 


GCGGAAGATT 


VHAVJ X WO 1 *V X 


780 


GTGGGCAAAT 


CCACATCTCT TCAAAACAGA 


TGTCAATGGT 


• MfcVJVJ \^ X fW, X X 


GTATPft P 

vj x n 1 V-Vj^.x\\j\j 


840 


ATGCCCACCA 


GATGAGTTTT CTGTAACTGG 


TCAGCTTTGG 


GGTAATPPAA 


*TV ^ |^ » TV! R ^ w l «/T 
1 V- i. -TV luHL 1 \3 




GGAAGCAATG 


GACAAAGACG GCTACAAATG 


GTGGATTGAA 




AAAfiPTTPAA 




' AATCTACGAT 


ATCGTTCGTA TCGACCACTT 


CCGTGGCTTC 


GAATCTTACT 


GGGAAATCCP 


1020 


TGCTGGTTCC 


GATACAGCAG CACCTGGTGA 


GTGGGTGAAA 


GGTCCAGGTT 

wu X vwOvV X X 


APAAHPTTTT 
nwiAVju x x x x 


1080 


TGCAGCCGTT 


AAGGAAGAAC TTGGTGAGCT 


AAACATCATC 


GCAGAAGACC 


X X UwL X X X 


114 0 


GACAGATGAA 


GTGATCGAAT TGCGTGAACG 


TACTGGCTTC 


CCAGGAATGA 


AGATTCTTCA 


1200 


ATTTGCCTTC 


AACCCAGAAG ACGAAAGCAT 


TGATAGCCCA 


^-o.^— x x uu\>n^_ 


PTYSPT A IfPr 


L Z DU 


AGTTATGTAC 


ACAGGAACAC ACGATAACAA TACGGTTCTT 


v3\j l l uu x nLL 


u 1 /Ui loAUA 1 


i ion 


TGATGATGCG 


ACTCGTGAGT ACATGGCTCG 


TTACACGAAC 


CGTAAAGAAT 




lJOU 


GGTACACGCT 


ATGCTTCGTA CAGTATTTTC 


ATCAGTTAGC 


TTTATGGCAA 


TTGCAACTAT 


1 A A (\ 


GCAAGATTTA 


CTAGAATTGG ATGAGGCAGC 


TCGTATGAAC 


TTCCCATCTA 


CCCTTGGTGG 


XDUU 


AAACTGGTCT 


TGGCGTATGA CTGAAGATCA 


ATTGACACCA 


GCTGTCGAGG 


AAGGTTTGCT 




TGACTTGACA 


ACAATTTATC GCCGAATTAA 


TGAAAATTTG 


GTAGATTTAA 


AGAAATAAGA 




CAATAATCAG 


GAGACAACTA AACATGTTAT 


CACTACAAGA 


ATTTGTACAA 


AATCGTTACA 




ATAAAACCAT 


TGCAGAATGT AGCAATGAAG 


AGCTTTACCT 


TGCTCTTCTT 


AACTACAGCA 


1740 


AGCTTGCAAG 


CAGCCAAAAA CCAGTCAACA CTGGTAAGAA AAAAGTTTAC 


TACATCTCAG 


1800 


CTGAGTTCTT GATTGGTAAA CTCTTGTCAA ACAACTTGAT TAACCTTGGT CTTTACGACG 


1860 


ATGTTAAAAA 


AGAACTTGCA GCTGCAGGTA AAGACTTGAT 


CGAAGTTGAA 


GAAGTTGAAT 


1920 


TGGAACCATC 


TCTTGGTAAT GGTGGTTTGG 


GACGTTTGGC 


TGCCTGCTTT 


ATCGACTCAA 


1980 


TTGCTACTCT 


TGGTTTGAAT GGTGACGGTG 


TTGGTCTTAA 


CTACCACTTT 


GGTCTTTTCC 


2040 


AACAAGTTCT 


TAAAAACAAC CAACAAGAAA 


CAATTCCAAA 


TGCATGGTTG 


ACAGAGCAAA 


2100 


ACTGGTTGGT 


TCGCTCAAGC CGTAGCTACC 


AAGTACCATT 


TGCAGACTTT 


ACTTTGACAT 


2160 


CAACTCTTTA 


CGATATTGAT GTTACTGGTT 


ATGAAACAGC 


GACTAAAAAC 


CGCTTGCGTT 


2220 


TGTTTGACTT 


GGATTCAGTT GATTCTTCTA 


TTATTAAAGA 


TGGTATCAAC 


TTTGACAAGA 


2280 


CAGATATCGC 


TCGCAACTTA ACTCTCTTCC 


TTTACCCAGA 


TGATAGTGAC 


CGTCAAGGTG 


2340 


AATTGCTCCG 


TATCTTCCAA CAATACTTCA 


TGGTTTCAAA 


CGGTGCGCAA 


TTGATCATCG 


2400 
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ACGAAGCAAT 


CGAAAAAGGA 


AGCAACTTGC 


ATGACCTTGC 


TGACTACGCA 


GTTGTCCAAA 


2460 


TCAACGATAC 


TCACCCATCA 


ATGGTGATTC 


CTGAATTGAT 


TCGTCTTTTG 


ACTGCACGTG 


2520 


GTATCGATCT 


TGACGAAGCA 


ATCTCAATTG 


TTCGTAGCAT 


GACTGCCTAC 


ACTAACCACA 


2580 


CAATCCTTGC 


TGAAGCGCTT 


GAAAAATGGC 


CTCTTGAATT 


CTTGCAAGAA 


GTGGTTCCTC 


2640 


ACTTGGTACC 


AATCATCGAA 


GAATTGGACC 


GTCGTGTGAA 


GGCAGAGTAC 


AAAGATCCAG 


2700 


CTGTTCAAAT 


CATCGATGAG 


AGCGGACGTG 


TTCACATGGC 


TCACATGGAT 


ATCCACTACG 


2760 


GATACAGTGT 


TAACGGGGTT 


GCAGCACTCC 


ATACTGAAAT 


CTTGAAAAAT 


TCTGAGTTGA 


2820 


AAGCCTTCTA 


CGACCTTTAC 


CCAGAAAAGT 


TCAACAACAA 


AACAAACGGT 


ATCACTTTCC 


2860 


GTCGTTGGCT 


TATGCATGCT 


AACCCAAGAT 


TGTCTCACTA 


CTTGGATGAG 


ATTCTTGGAG 


2940 


ATGGTTGGCA 


CCATGAAGCA 


GATGAGCTTG 


AAAAACTTTT 


GTCTTATGAA 


GACAAAGCAG 


3000 


TTGTCAAAGA 


AAAATTGGAA AGCATCAAGG 


CTCACAACAA 


ACGTAAATTG 


GCTCGTCACT 


3060 


TGAAAGAACA 


CCAAGGTGTG 


GAAATCAATC 


CAAATTCTAT 


CTTTGATATC 


CAAATCAAAC 


3120 


GTCTTCACGA 


GTACAAACGC 


CAACAAATGA 


ACGCTTTGTA 


CGTGATCCAC 


AAATACCTTG 


3180 


ACATCAAAGC 


TGGTAACATC 


CCTGCTCGTC 


CAATCACAAT 


CTTCTTTGGT 


GGTAAAGCAG 


3240 


CTCCAGCCTA 


CACAATCGCT 


CAAGACATTA 


TCCATTTAAT 


CCTTTGCATG 


TCAGAAGTTA 


3300 


TTGCTAACGA 


TCCAGCAGTA 


GCTCCACACT 


TGCAAGTAGT 


TATGGTTGAA 


AACTACAACG 


3360 


TTACTGCAGC 


AAGTTTCCTT 


ATCCCAGCAT 


GTGATATCTC 


AGAACAAATC 


TCACTTGCTT 


3420 


CTAAAGAAGC 


TTCAGGTACT 


GGTAACATGA 


AATTCATGTT 


GAACGGAGCT 


TTGACACTTG 


3480 


GTACTATGGA 


CGGTGCTAAC 


GTGGAAATCG 


CTGAGTTGGT 


TGGAGAAGAA 


AACATCTACA 


3540 


TCTTCGGTGA 


AGATTCAGAA 


ACTGTTATCG 


ACCTTTACGC 


AAAAGCAGCT 


TACAAATCAA 


3600 


GCGAATTCTA 


CGCTCGTGAA 


GCTATCAAAC 


CATTGGTTGA 


CTTCATCGTT 


AGTGATGCAG 


3660 


TTCTTGCAGC 


TGGAAACAAA 


GAGCGCTTGG 


AACGTTTTTA 


CAATGAATTG 


ATCAACAAAG 


3720 


ACTGGTTCAT 


GACTCTTCTT 


GATTTGGAAG 


ACTACATCAA 


AGTCAAAGAG 


CAAATGCTTG 


3780 


CTGACTACGA 


AGACCGTGAC 


GCATGGTTGG 


ATAAAGTCAT 


CGTTAACATT 


TCTAAAGCAG 


3840 


GATTCTTCTC 


ATCTGACCGT 


ACAATCGCTC 


AGTATAACGA 


AGACATCTGG 


CACTTGAACT 


3900 


AATACTCTTC 


GAAAATCTCT 


TCAAACCACG 


TCAGCTTTAT 


CTGCAACCTC 


AAAGCAGTGC 


3960 


TTTGAGCAAC 


TGCGGCTAGC 


TTCCTAGTTT 


GCTCTTTGAT 


TTTCATTGAG 


TATAAGATAC 


4020 


AAATTTATAC 


TAATACATTT 


TGTAAAAAAG 


CGAGTTTCGA 


TTGAAATTCG 


CTTTTTTAAT 


4080 


GATGTAGATT 


TGGGTCAATC 


TTGTCTAAAA ATAGGGAAAT 


CCTAGATACA 


GTGAAGGCTT 


4140 
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TAAATGCTGG 


TTTTTACTGT 


CCTCAGCCTT 


ATATTTTTTC 


GTAGTTGGTT 


ACCTCATATC 


4200 


TATTATATTC 


GCTTACATAA 


AGTATTATAA 


TATAATTGTA 


GGAAAGAAGG 


TGTTTTTATG 


4260 


ATATACACAC 


TTAAATTGGT 


GTTGTTTATT 


ACCTTTCTTG 


TAATAAGCTT 


GrTACCTfiAT 

Ul 1 ALL 1 \9n I 


4320 


AAGATTTTTG 


GAAAAAATAA 


AAAAATTTGG 


AAAATAGTTT 


TTGCAATATT 


GACGGCAGTG 


4380 


GCAGCATTGT 


CATTTATGTA 


CTAAGTTATT 


TTAAGAATGT 


AGGGAAATAA 


ACCCTACATT 


4440 


CTTTTTAGTT 


TTTTCTGTTT 


TCTAAATTCT 


ATTTATCCAA 


GCGATTCAAC 


ATTTCTTGCT 


4500 


TCTTCGCTTC 


AAGTTCTGCA 


CGCTTTTCTT 


CGATTTCGGC 


ATGTTTTTTC 


TCGAGTTCAG 


4560 


AACAACTTGC 


ACCATTGCTA 


AATTCTTTTC 


GCCATCAGGA 


GATAGGGTGA 


GTCGACATGT 


4620 


CTATTACTCA 


CCCAAAGCAG 


TCCTACAAAG 


CAGGAATTTT 


CTGTTACTTT 


TTTGGAAATA 


fi D OU 


GTAACGTTTA TACAGCTTTG 


nunv. x i v-Vj j. t\ 




AAACACACTC 


CGAGGGGTTT 


4740 


ACAGAAAGCA 


GAAAAGGAAT 


\3t\ 1 C 1 V3U Inl 




CTTTTCyCTC TTTTTCTTTA 


4800 


AGTAATTATA 


TACAATGTAC 






Lit. r(sA TCLAL 


CACCTAAAGG 


4860 


GAACTTTAAA 


CAACATTGAT 


AAG ATA A Afl A 




CGAAAATACG 


TTATACCCAA 


4920 


TTAATTTTAT 


TGTATATCTC 


ATGATTAAAA 


Vj 1 1 nn ILL 1 1 


CCGTTGTTAG 


GAATGGCATC 


4980 


ATTTTTATCC 


CATAATTGTG 


CTAAATAAGT 




AATAAATTCA 


TAGCGAATTC 




TAAAGCAACA 


TCATTTACAA 


ACCAACTACC 


TAGATATCTA 


GAAATTGCTG 


AACGAATAGC 


m net 

D1UU 


ACTTTTTGCT 


GCATGTTTTC 


CTTTTACTTT 


AATTAGATTT 


GCAAGGCCTG 


CAGTAGTTCC 


5160 


TCCTAATGCT 


AAAGCTATTG 


CAGTATCTAA 


TAGAGCACCC 


ATTTGATTAA 


CTGTAATACC 


5220 


TTGCCAAACT 


GCTCTAAATG 


GAGAGTATGT 


AGGTGGGATT 


GTATAATCGC 


CTTGTAATTG 


5280 


TCGGTTAATT 


ACTTCTTTGA 


TCCATTGTTG 


TGAGACGTCT 


GGATGAAAAG 


ATTGGATTTC 


5340 


GTTTGCAAGT 


GTATTGATTT 


GTTCTTCTGT 


TAGAGAAGTG 


ACAGGTTGAA 


GTTCCATATT 


5400 


TGTTTCAATT 


TGTGATACTT 


GTTCAGAAGC 


GTATACAGCT 


GAAACACTTG 


GAATCGCTGA 


5460 


TACAATTAAC 


ACAATTGACG 


TCAAAAAAAC 


CGAAATAAAT 


TTCATTAATT 


TGTTCATGAG 


5520 


CTTTTCTCCT 


TTTTATTTGC 


ATCTGCTTAC 


ATTTTATCAT 


ATACTGTTAT 


TATAGTCAAA 


5580 


AAAATATGCT 


ATTATGTTAA 


AAAAATATTT 


TTCAAAATAT 


AAATGGACGG 


ATTTATTTTG 


5640 


GATTTTATTT 


GTTATTTTGA 


CCTGCCTCTA 


TATTGGTAAC 


CATGATTTGT 


TTACTCTCAA 


5700 


TCATCAAGAA 


TTCTCTTTTC 


GTGGTAGCGT 


TTGGGGTCTG 


GTACTGGCCT 


TATATCACTT 


5760 


ACTATTCATT 


GATAAGTTTG 


TTATATCGAA 


TCGAAAATAA 


AGATTAGAGC 


TATGCTTGAC 


5820 


TGTGTACTTT 


TAGGATTTAT 


TTTGGAGGAA 


GATTTTGTCT 


CTATTATTTA 


TTATTTTAAA 


5880 


TTTATTTATT 


TTGTATAAGA 


TCTATTCTTT 








5910 
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(2) INFORMATION FOR SEQ ID NO: 166: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5406 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) 


SEQUENCE DESCRIPTION: 


SEQ ID NO: 


166: 






GGCATAGCGA 


CTCATTTTTT 


CAACTGTCCA 


GGCTGGATAC 


CAGACTAATT 


TAACCTCAGT 


60 


ATCCGTTACT 


TCTGGAACCT 


CTATCATAGC 


ATCATAAATC 


TGGTCTGTCA 


AAAGGTCTGC 


120 


TAAGGGACAA 


CCCATAGTTG 


TCAAAGTCAT 


GTCAATCTCT 


GTTTGCCCTG 


TGTCACCGTC 


180 


AAAACGAATC 


TCATAGATCA 


AACCAAGATT 


GACAATATCG 


ATTCCCAACT 


CAGGGTCGAT 


240 


GACTTCTTCC 


AAGGCTGTTA 


AAATCCGTGT 


TTTGATGTTT 


TCAATTTGCT 


CTTCTGTATA 


300 


AGCCATATTT 


TCCTCACTCT 


TAGTCTTCAA 


TAAAATCACG 


AAGCGGTTTG 


CTACGACTTG 


360 


GTTGGCGTAG 


TTTTCTCAAA 


GCCTTTGCTT 


CAATCTGACG 


GATACGCTCA 


CGAGTTACGT 


420 


TAAAGACTTT 


CCCCACATCT 


TCAAGTGTGC 


GCATTTTTCC 


ATCATCTAGT 


CCAAAACGTA 


480 


GACGCAGAAC 


ATTTTCTTCA 


CGGTCTGTAA 


GAGTATCTAA 


GATTTCATCC 


AATTGCTCAC 


540 


GCAAGACGAT 


ACGAGTCGTA 


TAATCCACTG 


GATTTTCAAT 


CACTTCATCT 


TCGATAAAGT 


600 


CTCCAAGGTG 


GCTATCGTCC 


TCTTCACCGA 


TAGGAGTTTC 


AAGAGATACT 


GGTTCTTGGG 


660 


CAATCTTCAA 


GATTTCACGA 


ACCTTATCAG 


GTGTCATATC 


CATTCGTTCA 


GCAATCTGTT 


720 


CTGGTGTCGG 


ATCTTGCCCC 


AATTCTTGAA 


GGAGATTCCG 


CTGTTCACGA 


ACCAATTTAT 


780 


TGATAGTTTC 


AACCATGTGA 


ACTGGGATAC 


GGATGGTACG 


AGCTTGGTCC 


GCAATAGCAC 


840 


GAGTGATAGC 


CTGACGAATC 


CACCAAGTTG 


CATAAGTTGA 


AAACTTGAAC 


CCTTTAGAAT 


900 


AGTCAAACTT 


GTCAACCGCC 


TTCATCAAGC 


CCATATTTCC 


TTCTTGAATC 


AAGTCAAGGA 


960 


ACTGCATACC 


ACGACCGACA 


TAGCGTTTGG 


CAATGGAAAC 


AACCAAACGA 


AGATTGGCTT 


1020 


CCGCAAGACG 


TTGTTTGGCT 


TCGATATCAC 


CAGCTTCAAC 


AGCCAGTGCC 


AACTCTTTCT 


1080 


CCTCTTCATT 


GGTCAAGAGA 


GGAACGACCC 


CTATTTCTTT 


CAAGTACATA 


CGGACAGGGT 


1140 


CATTGACCTT 


AGCAGAAGTT 


GACCCAATCA 


AGTCCTCATC 


GCTGAGTTCT 


GGTTCTTCTT 


1200 


CATTGCTGAG 


AACACGCGCA 


CTTGGATTTC 


CTTCGTTATC 


TGTGATAGAA 


ATGCCTGCAT 


1260 


CCTGAATCCG 


TTGCAAGAGA 


TCTTCAATCC 


CATCAGCGTC 


CAAGGTAAAA 


GGAATAACCA 


1320 


GACTTGCATT 


GATTTCATCA 


TCTGTTGCTG 


TCCCTTTTTG 


CTTATGATTA 


CGGATAAATT 


1380 
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CTGCTACCTG TACGTCAAAT GTTGTTACTT CTTTTTGTTT TGTTGCCATT ATTACTCCAT 1440 

TCTTCTCTTT TGGGAAATTA AACGTTCCAA TTCTTCTAGG GCTGTATCTG TATCTCCTAC 1500 

ATGGCTAGCT TCCTGCACCT TCTTTTTGAT TCTCATATTG TCCTGATTCA AGAGAGCCTT 1560 

GTTTCGAGTC ATCTCTACTT CACTAAGTTC CTGCGGCGAT ATCTCAGCAG GCAAATCCTG 1620 

AGCTAAAACT TGGTACCAAG CTCTTTCAAC TTCCTCTGTC TGCTCTGCTA AAACTTCTGG 1680 

AGGAAGATTT CCATACTGGC CAAGCAAGTC ATATAAGACC TGAAATTCAG GTGTAGCAAA 1740 

TGCAAAGTCT TCTCGCAAAC GGTAATCGTT CAAAACAAGA GGGGATTCCA TCATCCGATA 1800 

GAGTAGATGG GCTTCTGCCC TCATAATAGC CGATAACTGC TTGGTGACAG GCATGGTGAT 1860 

TGGCGTCGGT CTGGAAATTC CTTCCATGCG ATTCTGCCTT TGCACCTGAC GACTCTCATT 1920 

AACAATCTGC TCAATCTGGG TATAATCAAA GGACGCCAGA CTGTCAGCTA AAATATGAAT 1980 

ATAGCTGTTT TGAGCAGCGA TGGACTTTTC TTGAACAATC AAGGGAGCTA TTTTTTCAAG 2 040 

AAACTCAATC TGAGCCTGCA GATTTTCACT GTTTTCAGGT TTGTACTGAT GAATGTAGAA 2100 

CTCAATCGGA CTAATACGAG TTTTCGTTAA TAGATAGGCC AAGTCTTCTG GACCATTTTT 2160 

TTGTAGATAC TCATCAGGAT CCAAGTTATC AGGCATGCTG ACGATTTGCA CAGGCATATC 2220 

ACCAATTTCA TCCAATGCTT TCAATGTCGC GGCTTGCCCA GCCTTATCTC CATCGTAAAC 2280 

AAGAACCAAT TTCTTGGTTA ACCTTTTCAG ATGCTCAACA TGCTCTCGAC TCAAGGCTGT 2340 

TCCCATCGAC GCCACAGCAT TTTCGATTCC AGCCCGATAG GCTGCAATAA CATCCATGAA 2400 

TCCTTCCATC AGGTAAATCT CACTAGCTTT TCCAGAAGAT CTTTTTGCCC TATCCATATG 2460 

ATATAATTCG TAACTTTTGT TAAAAATTGC AGTCGATCGG CTGTTTTTAT ACTTAGAAGT 2520 

TTGTGAATCC GTTTTTTGCC AGATACGACC TGAGAAGGCA ATGACCTTTC CTTGGTCATT 2580 

TGTCAGGGGA AACATAATGC GATTGTGAAA GGTGTCTACA AATTGATTGG CATC CG AG AG 2640 

ATAAAACAGG CCTGAATCCA GTAAATCCTC TTCACGATAC TGATCAGACA AACGTTGATA 2700 

GAGATAGTTT CGTTCTGGAG GTGCTAAACC AATCCAAAAA TGTTTAAGCA CTTCATCTGT 2760 

CAACCCCCGC TGATAAAGGT AATTTCTGGC CTCTTCGCCC ATAGTCGTTG TCATGAGAAT 2820 

AGCATGGTAA AATTTGGCTG CATCTTCGTG CATATCATAA AGAGCTTGGT GAGGTGAGGC 2880 

TGACTTCTGC TCACTATAAA GCGGTTTTTC AACCTCAATT CCAACACGCT GACCTAAGAT 2940 

TTGGACTGCT TCTATAAAGG GAACCCCTTG GTACTCCTCG ATGAACTTAA AGACATCACC 3000 

TGAGCGACCA CAACCGAAAC AGTGATAAAA CTGCTTGTCC TCTACAACAT TGAAAGATGG 3060 

TGTTTTTTCA CCATGAAAAG GACAGAGCCC TAGATAGTTC CGTCCTGCCT TTTGTAAAGA 3120 

AATCACATCT CCTATGACTT CCACAATGTT GGCATTGTTT TTGATTTCTT CAATGACTTG 3180 
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TTTGTCAACC ATACACAATA CCTCCATGTT ATCATAGTTT ACTTTATATA GTATACTTTA 3240 

TTTCAGAAAA AAAGTAAACC ATTTCACTCA TTTTCCCTAC TTTATTCAAA GAGTTGATAA 3300 

TAATCAGAGA TTTTCATTTT TGCTTTTTCT TCTTGGTTTA AATCTTGGAT AATTCGTCCT 3360 

TCTTTCATGA CAATCAAGCG ATTGCCGTAT TTGAGAGCAT CTTCCATATG ATGAGTAATC 3420 

ATAAGGGCTG TTAGCTGATC TTTCTTAACA AATTCATCTG TCAATTCCAT CAAAGCAACA 3480 

CTAGTCTTTG GATCCAGGGC AGCAGTATGC TCATCTAACA GGAGTAATTC AGGTCGCTTC 3540 

AAGGTTGCCA TCAAGAGACT CAAAGCCTGT CTTTGTCCAC CTGATAAGAA CTCAATCGGT 3600 

GTATTCAAGT GTTTCTCAAG ACCATTTCCT ACTTTTTCAA TGGTTGCCTG AAATTCATCC 3660 

TTATAGCTAG TCAAGCGTCG TGGTAACAAT CCACGCTTTT CACCACGAAA CTTGGCGATT 3720 

AAAAGATTTT CAGCGACCGT CATACGGGGA GCTGTCCCCA TCTTTGGATC TTGGAAGACA 3780 

CGAGACAGGT ACTTGGCACG CTTCTCGGGT GAAAACTTAG TGAGATCTTC ACCTAAAATA 3840 

CGGATAGTTC CACTAGTTAG TGATAAGGTC CCTGCTATAG TGTTAAAGAG AGTTGATTTT 3900 

CCAGCACCAT TTCCGCCCAA AATCGTGATA AAGTCCCGTT CAAAAATTTC TAAGGAAACA 3960 

TCATTTAAAA TAATCTTTTC TTCATCAAAG CCATTTTTAA CGATTTTGGT TGCATTTTTT 4020 

AATTCTACAA TTGCTGTCAT TTGCTTAACT TGGCTCCTTT CAAGATTGTT TGCTTAAATG 4080 

TTGGAATCAT GAGGCAGACT GCTAAAATCA AGGCACTGTA TAAACGAAGG TAACTTGTAT 4140 

TAAAGCCAAG TGCGATAACT GCCCACACTA AAAATTGATA AGCGATAGAA CCTACAACGA 4200 

TAGTAACCAA ACGCTCTGCC AAGCTCAAAC TCTTGAAAAT AACTTCTCCA ATAATCAAAC 4260 

TTGCAAGCCC CACAACGATA ACCCCGATCC CTCGAGACAC ATCGGCATAA CCTTCTTGCT 4320 

GAGCAATGAG GGCACCTGCA AGGGCAATCA CACCATTTGA TAAGACCAAG CCCATGAGCT 4380 

CCATGCGTCC AGTATGAATC CCGAAACTTC TAGCCATATC AGGATTATCC CCTGTAGCAA 4440 

TATAGGCTTG TCCGAGTTTA GTGTCCAAGA AAAAGAGCAT GAGAGCAATA ACAATACTCA 4500 

CAAAGATGAG ACCTGTCAAG AGTTGATTCA AATCCGAATC AAAAGGCAAA ACATCCTGAA 4560 

TTTGCTTGGT TCCAAGCAGG CCTAAATTCG CACGTCCCAT AATCAAGAGC ATGATTGAGT 4620 

GACAAGAAGT CATCACCAAA ATCCCTGAGA GCAAGGTTGG GATCTTCCCT TTTGTATAAA 4680 

GAAGGCCTGC TGCCATTCCA GCCAAACAAC CTGCTCCTAC AGCAACAAGT GTCGCTAAAA 4740 

ATGGGTTCAC GCCTTTGGTT ATCAAAGTGA CAGCAACAGC TCCCCCAAGA GGGAAGGAAC 4800 

CTTCTGTCGT CATATCTGGA AAGTTTAAAA TCCTAAATGT CATAAAGATT CCCAGACCTA 4860 

GAATAGCCCA GACAAATCCT TGAGAAATAA TGGAAACAAT CATATTTTAT TTAATCCTTT 4920 
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CTATATTCAT CTTTTTAAAA AATGGGAAGA GTCTCCTCCT CCCTACCTTA TTTATTCGAT 4980 

GACTTGTCCT GCTTCTTTGA GAACAGACTC AGGAATAGTA ATACCTAGTT CTTGTGCTAT 5040 

TTTTTTATTG ATGACTGACT TACCAGTTGA AAAGACATTG ACTGGGGTAT CGGCTGGTTT 5100 

TGCACCTTTC AAGACTTGCA CAATCATTTT ACCTGTTGCC ACACCAAGGT CATGTTGGTC 5160 

AATTACAACT GATGCCAAAC CACCTACTTC TACCATAGCT GTCGCACTGG GATAAATTGG 5220 

TTTCTTAGAA CTTTGATTGC TAGAGACAAC CGTTGGAAAT CCTGATGCAA TGGTGTTATC 5280 

AATTGGAACC CAAATAGCAT CTACCTTGCT AGTCATAACA GTGACAGTTG AGGCAATTTC 5340 

ATTTGTTGAA GGAACTGCAA ATGTTTCCAC TGTCAGACCT GCCTTTTCAG CATAAGCCTT 5400 

AAATTC 5406 



(2) INFORMATION FOR SEQ ID NO: 167: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9711 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY : linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 167: 



CAGCTTGCTC 


TTACTATTAT 


AGCAGATGTT 


ATAGCTGGAA 


TTATCTTGTA 


TTTCGTCTGC 


60 


AAATGGCTAG 


ATGGTAAGAA 


GTAGACCGAA 


TGACTAGCCT 


ATAAACACCC 


GTTAAATCGC 


120 


TAAGATACGT 


CAAAAAAGCC 


CTTAACTATG 


GCACTAGTTA 


GGGGCTTTGG 


TGTTCTAATG 


180 


AACCTTATAC 


ACTAACTACA 


TTCTAGCATA 


TAAGCCCAGA 


TATTTCAAGA 


GTTTTATTTA 


240 


TTGTTTAAAG 


TTCTGAAAGG 


TCTATAATGA 


AGTTAGCCAT 


CTAGTATCAA 


AAAACCGACT 


300 


AGCTCTTATG 


AACTAGTCGA 


TTTCTCATCA 


ATGCGCCAAC 


ATTTCTTGGG 


CGATTTCTTG 


360 


GCCAGATAGG 


TTATCTGGGT 


AGTAGGTTGG 


CCAGTTGTCC 


ATTTCTTCAA 


AGAGGGCTTC 


420 


TTGGCTTGTG 


CCTCCAAAGA 


AGATATGGAA 


ATGTTCTGCC 


TTAACTGGGG 


CAACATTGVG 


480 


GTCACTAAAC 


TGAACATACT 


TGAATTGTCC 


AGCGTCAGCA 


TCTGTGGCTT 


CAAAGAGGAA 


540 


ACGCACGCCA 


CGATTGCCTT 


TCTTGTAAGT 


CAAAATTTTC 


TTACCGACAT 


ACTTGTAAGT 


600 


GTATTTCTTG 


CTTTGTCCAC 


CTTGAACAAA 


TTCCATAGTA 


TTATCAGTAA 


TGTTAATCTT 


660 


AGTCACATCT 


GTATGATAGC 


CTTTTGTATA 


GTAAGCCTTG 


TACTCAGCCT 


GGGTCATCTT 


720 


ACCAGTCAAC 


TTAGCCTTGT 


AGTCAAAGAC 


TTGGTCAAAC 


GTGCCGTCTT 


CAAGGAAAGG 


780 


ATAAACTGAT 


TGCCAGTTAC 


CTGCATAGTC 


ACTCAAGGTG 


CGGTCCTTGA 


CAGCTGCATC 


840 


CTCGAAGTAA 


CCATTTTGGA 


CTGTCTTGGT 


ATCCTCTGCC 


TTTTCAGGTT 


CAATTGCTGG 


900 
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GCCTTCTTGG TCTGTTGTTT GTTTCAAAGC CTTGAGGTTT TTCTCCATCA CGGAAATGTA 960 

GTTTTCTCCA GCCTTGGTGT CCTCTTCTGT CAGACTTTCT AAAGGATTGA GGACATCAGT 1020 

TTTGACACCT GCTTCTTTTG AAAGTGTGTT AGCAAGGGCT TGTGAGGCAT TTCTTCAAAA 1080 

TAGATATAGG CGATTTTATT TTTCTTGACA TACTCTGTCA ATTCTGCCAA GCGAGCAGCT 1140 

GATGGCTCTG CATCTGGAGA AAGTCCTGAG ATTGCGACTT GTTTGAGTCC ATAGTCCAAG 1200 

_ GCAAGATAGT TAAAGGCTGC GTGTTGAGTC ACAAAGCTCT TTTGTTTTGC TTGAGACAAA 1260 

CCTTCTGCGT AAGCCTTATC CAAGGCTTGC AATTTTTCGA TATAGGCAGC TGCATTCTTC 1320 

TCAAAGGTCT CTTTTTTATC AGGATAATCT GCTGACAAGC TGTCGCGGAT GTGCTCTACT 1380 

AGTTTAATGG CACGAACTGG TGATAACCAA ACATGGGGGT CAAACTCATG GTGATGACCT 1440 

TCTTCTCCAT GGTCATGGTC TCCCTCTTCT TCCTCGCCAC CTGGCAAGAG CAACATATCG 1500 

CCTGTCGCCT TGATGGTTTT CACTTTTTTC TTATCCAAGG TATCTAGCAA TTTAGGTACC 1560 

CATGTTTCCA TGTTTTCATT TTCATAAACG AAGGTATCTG CATCTTGGAT TTTGGCAACT 1620 

GCCTTGGCAG ATGGTTCGTA TTCATGAGGT TCTGTCCCAG CACCGATTAG GAGTTCTACA 1680 

TTAGCCGTAT CTCCTGCGAC TTGCTTGGTA AATTCATAGA CAGGGTAAAA GGTTGTCACG 1740 

ATATTGAGTT TACCATCTGC CTGTTTTTGA TTGGAACAAG CCACTAAAAA CAAGGCACAT 1800 

AGACTGGCTA GTAATAAGCT AATTTTTTTC ACGTTCGTCT CCTATTTGAT AAAACGTCTT 1860 

ACTAAACTGA TTAGTATAAA GACAGTTACA AAAATAATGG TAATACTTGC ACTTGCAGGT 1920 

GTTTCTGCAT AGTAGGAAAT GTAAAGTCCT GCTACCATTC CCAAAAAGCC AATCGCACTG 1980 

GCAAGCAGCA TAACCGATTT AAAGTTTTTC CCCAGACGCA GGGCAATACT AGCTGGCAAG 2040 

ACCATAATGG TCGATACCAG AAGAGCTCCT GCTGCAGGAA TCATAAGGGC AATAGCCACC 2100 

CCTGTCACCA TGTTAAAAAG AATGGACATG GTACGAACTG GCAAGCCATC CACAAAGGCC 2160 

GTATCTTCGT CAAAAGTTAA GATATACATA GGACGAAGAA AGAGAAAGGT CAAAATCAAA 2220 

ACAACCGCCG CAATGACAAA GAGGGAAATG ACCTGTTCTT CACTGATAGT CACGATCGAA 2280 

CCAAAGAGAT ATTGGTCCAA ACTCATTGAA CTCGAGCTTT TACCCTTGCT CATGACAATC 2340 

AGAGAAACAG CCAGACCTGT TGACATGAGG ATAGCTGTCC CGATTTCCAT AAAGCTCTTG 2400 

TAAACCGTAC GGAGATACTC CAGAAAGACC GCCGCAATCA AGACAATGGC AATAGTAGAA 2460 

ACAGTTGGAG AAATCCCCAA AACCAGACCA AAGGCTACAC CTGAAAGTGA GACGTGGCTA 2520 

AGGGTATCAC TCATCAAACT CTGACGACGC AAGATGAGGA AGGTTCCCAA TACCGGTGAG 2580 

AAAAGACTCA TAGCAATAAC CGCCAAAAAG GCGCGTTGTA TAAAGTCGTA AGATAATAAA 2640 
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CTAAGCATGG 


PPP A PPTPPT 




anvsa ft. /"* & nyrnc 


AAAt-AACUCC 


ATGGCGAGTC 


z /UU 


TTGGTTACGG 


ACT A HATHA A 


inl luLuAIV 


LuLA 1 AA 1A.L 


TTAACTTCTT 


CAGGGTCATG 


^ /ou 


GGTAATCATC 

V3V3 i *W» A N—^l 1 V» 


n/\AAl^?Ajl_t. 1 


luL-L-A 1 v*A X Vj 


a fvw^^r' pw 


TGGTGCATGA 


GTTCGTAAAA 


2820 


TTPATTTTTA 


V— i iU^luLAl 


pp a TpprrrT 




TPTIPP ft m ft A 

i L 1 AGGAT AA 


tv tv /*< t\ m/^ * /Ny-t 

ACACATCAGG 


2880 


GTCAGAAGPA 


rtAV, A I AL.Vsk,lj 


Lnnl IALLuL 


i LuL IXjL ri T 


nvnipr<prto ft /*■ 

TGTCCCCCAG 


ATAGAGACCC 


2940 


PAAfipfyprw: 




(_ L CACATGCC 


AACTGAGTCC 


AGACTAGCCT 


TGATATGCTC 


3000 


P/TPATPATYIA 


uUil 1A_AAAI_ 


/-> a cnc a a r»p a 
^ACWiAAv-(-A 


GCCTTTTCTC 


TV m TV /** /-»r-< TV /■» 

GGATAGCGAC 


CCGACTTGAC 


3060 


a a a t**p c a t a r* 


AH_LiTACTTG 


GAAAACCAGC 


ATTAAAACTG 


GCAATTTGTT 


GAGGAAGATA 


3120 


(jtjt. i Ai ix re 


AATTTCTTAC 


CTTGCGTATT 


TGTCTTTGAA 


ATAGCCACCT 


TTCCAATGCG 


3180 


I 1 GCAGA 


ATTCCAAGAC 


TAGCCTTGAT 


GAGCGTCGTC 


TTAGCCGCTC 


CATTTTCCCC 


3240 


AOTCAAGGTA 


ACAAATTCCC 


CACTATCAAC 


ACAATAATTG 


ATATGTTCAA 


GAACAGGCTC 


3300 


CTTATCATAA 


TAGAAGGACA 


AATCCTCTAC 


CGTAATATAT 


CTCATTATTT 


GATTTCTCCT 


3360 


AC TAAAGCAG 


TCAAAAACCG 


CTGAATCACT 


TTTTGTTCAT 


TTGGAGTAAA 


CTGAGTCGCC 


3420 


ACTTGTTCAT 


AGGTTAAAAG 


TGTATGCTCA 


TGGTGATGGT 


GGTGCTCCTC 


AGCGATTGGA 


3480 


CGAGCCAAGT 


CAGTCAACTG 


ATAAAAAATC 


ACACGCGCAT 


CTTTAGAATC 


TTTAGATGTT 


3540 




CTTCCTTGAC 


CAAAGACTTA 


ATGGCCTTGG 


TAACTGCCGC 


CTGACTGACA 


3600 


TTf2AP.APP.AP 


GGGCCAATTC 


TGAATTTGTT 


AAAGATTCCT 


CTGACAAGAG 


CATAAGGATA 


3660 




TATTGGTCAG 


GGCCACCTCG 


CTAGTGCAAT 


GACCTATTAG 


GATTTCATGC 


3720 




CCTGCAAAAT 


CACCTCATTC 


AAAAAAGCAT 


TGATATCCTT 


TGCTAGCTGT 


3780 


PTPATATPTO 


ACTCCTTTCC 


TTTTAGACTT 


CTCTTTTTTA 


AGAGAAAAAT 


ACTATTCTTT 


3840 


fi & P A T T 


TTACCAGTTA 


ATTATATCAC 


AAGCAAAAAA 


AGAGTCAAGA 


AAAAACGTGA 


3900 


AAAP/TAGTTT 


CATTCTTGAA CTCTTCTATA TTATATTATC TATTGAAATT CTTTGACATC 


3960 


TCCATCATAA 


GTCGCCCAAT 


CTTTGCTGAA 


AAAGCGCTCA 


TTCAGATGGT 


AAGTCGGAGC 


4020 


TGGTGTGGGA 


TTGGATAGGA 


AAGGATCAAC 


TGCCTTGTCA 


AAAGCCAACC 


AACCCAACCA 


4080 


ACCAAGGTGA 


ATGGTGTCCT 


TCATAAAGAA 


AGGCTCCCCG 


CCGTCCTTAG 


AAAAATCTGC 


4140 


TATATTGGTA 


AAACCTTGAC 


TTTCTAACTG 


GTAGCGAATC 


TTCTGCACCG 


TTTGTTGGTA 


4200 


CATATCCTCT 


CGTAGACCAG 


CATAGTTCAT 


CCATTTTTTA 


TTAACAGGTG 


GAATGATAAA 


4260 


AATCGGGTTT 


ACCTTAGATT 


TAGAAAACTG 


TGTTAAAACC 


AACTGCAAGT 


CATTATACTC 


4320 


TGGCGACTTG 


AGATAGGTAA 


AGCTTTTCTG 


AGAATCCTTT 


AATTTCTTCA 


AATCCTTCTT 


4380 


GATCTGCTCA 


TTATAGAAAT 


AATTTTCCAT 


TCCCATCTCA 


TTATTGGAAG 


TATTTTTTTC 


4440 



